Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

None of them do it well from our experience. We had to write our own custom pipeline with a mixture of legacy CV approaches to handle this (AI contract analysis). We constantly benchmark every new multimodal and VLM model that comes out and are consistently disappointed.


If someone releases a benchmark/dataset, I'm sure that significantly increases the chances of one of these AI labs training on the task.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: