Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I agree with Halvar about all of this, but would want to call out that his "matmul interleaved with nonlinearities" is reductive --- a frontier model is a higher-order thing that that, a network of those matmul+nonlinearity chains, iterated.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: