I find it wild that the training process can do such things as forcing it repurp... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		samus on May 12, 2024 \| parent \| context \| favorite \| on: Vision Transformers Need Registers I find it wild that the training process can do such things as forcing it repurpose background areas to begin with. The authors just observed abd optimized what the model was already doing by itself.

jebarker on May 12, 2024 [–]

I agree, the most interesting thing about the paper is the default behavior of the network as it tries to compress the data.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact