Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The question is not about parsing. It is about tokenizing XHTML. So you are suggesting to write a hand-rolled tokenizer instead of using regexes for tokenization? Why is that better? That is exactly the kind of task regexes excel at.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: