"I think one interesting project in the near future could be to try and build a ...

elvinyung · on Nov 11, 2017

Oh, I don't mean a database frontend that can handle both OLTP and OLAP workloads, usually by having some kind of OLAP column-store and some kind of OLTP main memory row-store. I know there's a lot of those (not only HANA, but also MemSQL, SQL Server, etc.)

The fun thing to try and imagine here is having literally the same physical data format that works for both kinds of workloads.

dustingetz · on Nov 12, 2017

You actually don't need to have the same storage data layout if you use a time series as a starting point; because you can maintain different data layouts in parallel, and the time dimension permits strong consistency across them all.

If this is what you mean by a "database frontend", I am really confused as to why you object to this?

I think this property of time series is going to prove very important in the 2020s

elvinyung · on Nov 12, 2017

Again, I don't care about distributed consistency here, nor is it mutually exclusive with what I'm talking about.

The question we're trying to answer is whether there exists at all a storage engine, at the scope of a single node (obviously generalizable/scalable), that can fit all use cases well. Once you figure out the answer to that, obviously having one storage engine is simpler than having two storage engines.

dustingetz · on Nov 14, 2017

Parallel but consistent indexes could be done in a single process too. The tradeoff is natural reads at the cost of extra constant factor of write computation and storage.

Which is exactly the tradeoff RDBMS indexes already make.

elvinyung · on Nov 15, 2017

That's still not what I'm talking about. https://cs.brown.edu/~ugur/fits_all.pdf

dustingetz · on Nov 16, 2017

thank you for that link !!