Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Exactly. I have written software for legal document search. Most of what they get is in PDF and it’s a major PITA to get data out of them. Forget about tables. Just try to extract text without some garbled characters and you will lose your mind.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: