https://www.findlectures.com Right now it has an index of ~70k conference talks ...

WmyEE0UsWAwC2i · on Feb 10, 2017

I didn't know i wanted this.

How do you scrap all that data?

garysieling · on Feb 13, 2017

I wrote up some more info here on data acquisition here (just notes right now) - https://www.findlectures.com/articles/2017/01/22/Software-Ar...

WmyEE0UsWAwC2i · on Feb 21, 2017

Thanks for the write up!

garysieling · on Feb 10, 2017

I started out scraping sites manually, and started automating more pieces (a lot of sites use wordpress, so they are pretty structured). I'm working on a talk on the subject, so I'll have an article soon that explains better :)

zump · on Feb 11, 2017

How do you pay for the bandwidth costs for scraping?

garysieling · on Feb 11, 2017

I'm doing everything over my home network

zump · on Feb 12, 2017

ANd how much does that cost?

debamitro · on Feb 12, 2017

what is the stack you used?

garysieling · on Feb 13, 2017

It's mostly Node right now. I started documenting it in more detail (just notes right now) - https://www.findlectures.com/articles/2017/01/22/Software-Ar...