Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
DeepSWE: Training an Open-Sourced Coding Agent by Scaling RL (pretty-radio-b75.notion.site)
3 points by sijuntan 8 months ago | past | 1 comment
DeepCoder: An Fully Open-Source 14B Coder at O3-Mini Level (pretty-radio-b75.notion.site)
15 points by sijuntan 11 months ago | past | 6 comments
DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL (pretty-radio-b75.notion.site)
322 points by sijuntan on Feb 11, 2025 | past | 127 comments
DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL (pretty-radio-b75.notion.site)
19 points by mluo on Feb 10, 2025 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: