http://www.80legs.com - web-scale web-crawling for everyone. Launched in September and growing revenue now. We've solved a lot of big data store issues in our back-end. Challenges have shifted from technical to business :)
Extractiv - web-listening and content-extraction that combines semantic analysis and web-scale reach for a complete picture of what information is on the web and what the web is discussing. The core technology is working, currently turning it into a real product.
While in principle I think the approach is great, I have recently learned that some of your "volunteers" are in fact people infected with spyware. Can't find the corresponding article atm, though :-(
Yeah I met these guys when they presented at Rice a while back. I can tell you they aren't doing anything with Spyware. It was actually really frustrating because they had to keep answering this question over and over.
They're not infected with spyware. Some of the computing power comes from the users of freeware applications, who are asked _during install_ if they want to enable the grid system that powers us.
The article I read was about one of these freeware applications (something starting with "G", and a green Logo or mascot I think, I can not remember unfortunately), which installs the spyware if the users does "Express Install". Buried in the TOS is "we may use your computing power". It was by a third company, which in turn is supposed to be used by 80legs. Perhaps 80legs is not even aware of the shady practices of that company.
You're talking about Digsby. If you managed to follow the story to its conclusion, you'll see that they dramatically changed the install process to follow Plura's (our sister company) affiliate agreement.
Have you thought about the economics of having a program where people earn money from allowing use of there processing power vs paying for it regularly/ having your own processing power.
Even if it was barely worth it for the user after factoring in electricity costs the novelty factor might attract people.
Extractiv - web-listening and content-extraction that combines semantic analysis and web-scale reach for a complete picture of what information is on the web and what the web is discussing. The core technology is working, currently turning it into a real product.