Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> They set up a limited number of agents to run in parallel (often just one),

Most of what people use agents for daily can often be one-shotted though and even collating/rating 10 results would be costly.

If I had a harness for evaluating the results and VC level money, I'd be throwing an army at well defined experimental tasks as well.



Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: