Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

One thing I am now wondering is why bother with the Selenium script at all? Why not have the AI model describe the same things it would do in a Selenium script in detailed natural language, you could always store that in a DB or file with more efficiency than storing the video, and you could just feed the natural language description to a model for automation? And the major benefit is it is much easier for humans to review and modify if needed.


Eventually; openai, adept, etc. are working on these types of agents. But currently, name a model that can replace selenium (ie. engage with the browser)


Selenium scripts are already so flaky. Why would you want to add more ambiguity to them?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: