Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I find it wild that the training process can do such things as forcing it repurpose background areas to begin with. The authors just observed abd optimized what the model was already doing by itself.


I agree, the most interesting thing about the paper is the default behavior of the network as it tries to compress the data.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: