Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Confession : I am guilty of using regex superpowers to extract data from urls. Will check out w3lib soon!


I've had to write some gnarly XPath expressions to extract data with Scrapy.

> //b[contains(.,'City')]/following-sibling::a[not(preceding-sibling::b[contains(.,'Country')])]/text()


You're not alone on that! :)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: