I’ve done a fair share of scraping, and I learned that on a large scale, there a...

1vuio0pswjnm7 · on Aug 12, 2021

In developing this what were some sites used to test it, what was the desired data and format of the data to be extracted, and what was the most challenging of those sites.

nathell · on Aug 12, 2021

Thanks for the interest!

My most extensive use of Skyscraper to date has been to produce a structured dataset of proceedings, including individual voting results, of Central European parliaments (~500K total pages scraped, ~100M entries). I’ll do a full writeup at some point.