Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
samus
on May 12, 2024
|
parent
|
context
|
favorite
| on:
Vision Transformers Need Registers
I find it wild that the training process can do such things as forcing it repurpose background areas to begin with. The authors just observed abd optimized what the model was already doing by itself.
jebarker
on May 12, 2024
[–]
I agree, the most interesting thing about the paper is the default behavior of the network as it tries to compress the data.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: