Would be really nice to see a demo of RethinkDB under real load. All of the RethinkDB slides and info I have seen generally have 1-4 servers and maybe 50mb of data at most. At this level, you might as well just be using textfiles...
Anybody know of any demos of RethinkDB handling, say, 100gb of data? And running decent queries on it?
slava @ rethink here. Let me explain the state of affairs on this.
The underlying storage engine was tested on commodity systems and super-duper enterprisy storage systems, and can do hundreds of thousands of ops/second on tens of terabytes of data (that required pretty beefy setups, though). When we added clustering on top of the storage engine, we avoided thinking of performance too much (in the interest of shipping), so everything slowed down significantly. Here's our (rough) roadmap:
- New protocol buffer API and some more checklist features (1.4)
- Secondary indexes, huge ReQL improvements (1.5)
- Performance and scalability (1.6)
We'll be doing scalability and performance demos that I hope will be really impressive, but it'll take ~4 months to get there.
Anybody know of any demos of RethinkDB handling, say, 100gb of data? And running decent queries on it?