Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think that part of why the tokenization is a proble for math here is that it doesn't seem to be carrying overflow into the left token. Anyway, I haven't worked with GPT in detail to do a deeper analysis than that hunch, so take my comment with a couple of salt grains.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: