Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Phi is notorious for benchmark overfitting. It's good, but not as good as it looks on the charts. On the Lmsys leaderboard it places a whole 23 spots behind Llama-3-8B which it also claims to soundly beat on the above. So YMMV.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: