Hacker News new | past | comments | ask | show | jobs | submit login
Ask HN: Hacker News content license and distribution
1 point by donretag on Feb 15, 2024 | hide | past | favorite
I am working on a project that utilizes data from Hacker News. Wanted to share these insights, but wondering if I was able to share the content itself.

The content are all the whoishiring comments. There are currently many projects that use this data. Data was grabbed via the firebase API and is about 89K user comments. Currently in a pickle format, but can also be json (CSV does not work well with nested content and data frames). Would also like to share the scripts to bootstrap the data from scratch, but it does take a while and would not want others hitting the API with the same script.

The are many HN datasets on Kaggle. Does not mean they are not in violation. Ultimately, would like the data to be in Kaggle and scripts to parse in git. Sorry for the repost, only got one single comment previously.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: