Two questions, curious as I'm as well in the process of indexing HN, and your API may help me avoid this:
- how much content are you actually indexing ? Do you keep every single post or only the ones that do it on the home page or ask HN ? How far in time did you go ?
- do you have some way to implement a full-text search (eg: posts that contain a specific word, to be accurate) ?
I don't store or save any data, other than an in memory cache. I just scrape, process, and output the data. Since I'm not storing data, I have nothing to search.
Two questions, curious as I'm as well in the process of indexing HN, and your API may help me avoid this:
- how much content are you actually indexing ? Do you keep every single post or only the ones that do it on the home page or ask HN ? How far in time did you go ?
- do you have some way to implement a full-text search (eg: posts that contain a specific word, to be accurate) ?