Hacker News new | past | comments | ask | show | jobs | submit login

How could you use this?

Creative ideas please!




Write a better page rank algorithm?


I think an open source google could be a pretty great project, I would imagine its been tried before, but by seperating out the steps where these guys crawl, other people build indexes, and others handle lookups, it sounds more reasonable than one project taking on the whole thing


The biggest problem here is hosting the index... in RAM...

It takes A LOT of machines to power a modern search engine which serves any real amount of traffic. One key component of an open source search engine would be a sort-of peer-to-peer distributed infrastructure. When I suggested this in an earlier thread, people were quick to point out the liability concerns here... but maybe it could work somehow... but then how do you get people to sign up for it?

That said, I think this is incredibly interesting stuff. I would really love to see open source, peer-served web utilities. For example, I'd want access to many of the components of a web search engine, not just the search results themselves. Things like a language model for spell checking or word segmentation. Or a set analysis tool for detecting synonyms.


Ya. Thats true. But purely from an data standpoint, having the same index as Google has would be valuable. Just because you can do so many things with it. Like, figuring out how to increase the page rank of your site!


http://www..aspseek.org/ Open Source Implementation of PageRank


Why not use Google?


Why not use Windows?


I do. I also see nothing wrong with that.


Query for content inside of <script> or other similar tags which Google and all other search engines I have encountered ignore.


Draw a map of the WWW?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: