The question is *not about parsing*. It is about tokenizing XHTML. So you are su...

		goto11 on Jan 31, 2019 \| parent \| context \| favorite \| on: Why isn't the internet more fun and weird? The question is not about parsing. It is about tokenizing XHTML. So you are suggesting to write a hand-rolled tokenizer instead of using regexes for tokenization? Why is that better? That is exactly the kind of task regexes excel at.