Hstore development for 9.4 release

twic · on Nov 7, 2013

This is wonderful work, and it is the foundation for yet more wonderful work in the future. However, as much as i am a huge PostgreSQL fanboy and a dedicated MongoDB peanut-thrower, i worry that the caption applied to this post is misleading.

Once this work is released, PostgreSQL will be faster than current versions of MongoDB at carrying out queries for documents which look for a given value at a given path (or perhaps more generally, values satisfying a given predicate at paths satisfying a given predicate).

But that has never been one of MongoDB's important strengths, has it? The operations for which MongoDB is optimised are inserting documents, retrieving document by ID, and conducting map-reduces across the whole database. Yes, it has the ability to query by values in leaves, and it can use indices to make that faster, but that's a bit of a second-tier, bag-on-the-side feature, isn't it? If that was the main thing you needed to do, you wouldn't have chosen MongoDB, right?

rpedela · on Nov 7, 2013

You can insert and retrieve by ID just as fast with PostgreSQL.

Map/reduce? Isn't that slow in general? Can't PostgreSQL JOIN and aggregate functions do most of what map/reduce is for but faster?

yummyfajitas · on Nov 7, 2013

You can insert and retrieve by ID just as fast with PostreSQL.

How can postgres possible be faster on insert? Unlike Mongo, Postgres actually needs to write bytes to a disk (with a WAL!).

troyk · on Nov 7, 2013

Even modest db work usually involves concurrency.

Mongo locks. Postgres rocks. [1]

1: http://www.postgresql.org/docs/current/static/mvcc-intro.htm...

masklinn · on Nov 7, 2013

> Unlike Mongo, Postgres actually needs to write bytes to a disk (with a WAL!).

Not with unlogged tables it does not!

asdf3 · on Nov 7, 2013

Or just a single transaction with synchronous_commit!

> Specifies whether transaction commit will wait for WAL records to be written to disk before the command returns a "success" indication to the client.

> This parameter can be changed at any time; the behavior for any one transaction is determined by the setting in effect when it commits.

http://www.postgresql.org/docs/9.3/static/runtime-config-wal...

masklinn · on Nov 7, 2013

And you can set fsync off for extra yolo!

clubhi · on Nov 7, 2013

Postgres also has stupid mode option. (unlogged)

pilif · on Nov 7, 2013

The feature works per table and is off by default. Unlogged tables have another nice property: If the data gets corrupted somehow, then the table will be completely empty. This makes it clear to users that you'd use unlogged tables for caching or other data that you have no problem with losing.

It also means that you can be sure that the data you read from an unlogged table is that data that you put there. I much prefer this over the possibility of maybe reading corrupt data without any possibility of detection.

In my case, the use-case of an unlogged table is a table that practically works like a materialized view (until postgres learns to refresh them without an exclusive lock and maybe even automatically as the underlying data changes). If the table is empty, I can easily recreate the data, so I don't care about any kind of WAL logging or replication. If the table's there, perfect. I'll use it instead of a ton of joins and an equally big amount of application logic.

If the data isn't there, well. Then I run that application logic which will cause the joins to happen. So that's way, way slower, but it'll work the same.

The advantage of using an unlogged table here is that it's cheaper to update it and it doesn't use replication traffic (unlogged tables are not replicated).

rch · on Nov 7, 2013

I think it's been referred to as 'running with scissors' mode actually.

effn · on Nov 7, 2013

You can also tune the fsync option to get mongo-like performance and reliability.

rdtsc · on Nov 7, 2013

It isn't turned on by default just like it was in MongoDB initially.

aschreyer · on Nov 7, 2013

It is not stupid it actually makes sense for ETL workflows where you load raw data into unlogged tables and transform them in the database into the final (logged) representation.

parhamn · on Nov 7, 2013

>Unlike Mongo, Postgres actually needs to write bytes to a disk

I'm now immune to MongoDB bashing now but jesus keep up with updates. This isn't true as of maybe two years ago.

rdtsc · on Nov 7, 2013

> I'm now immune to MongoDB bashing now but jesus keep up with updates.

We do want Jesus to keep up with the updates, for sure, but this isn't just about a random slip-up, goes a bit deeper than that. This wasn't an unfortunate bug, "oh we meant to do all we could to keep your data safe" it was a deliberate design decision.

Dishonesty and shadiness was the problem. When finally they started shipping with better defaults, did we see an apology? Was anyone responsible terminated? Nope. Don't remember.

Now, it could all be a statistics fluke and people randomly all decided to bash MongoDB, say, more than Basho, Cassandra, PostgreSQL, HBase and other database products? Odd isn't it. Or, isn't it more rational, that it isn't quite random, there is a reason for it.

When developers sell to developers, it is expected a certain level of honesty. If I claim I am building a credit card processing API but I deliberately add a feature that randomly double charges to gain extra performance, you should be very upset about. Yes, it is in the documentation, on page 156, that says in fine print "oh yeah enable safe transaction, should have read that". And you as a result should probably never buy or use my product and stop trusting me with credit card processing from them on.

The bottom line is, they are probably good guys to have beers with but I wouldn't trust them with storing data.

mcgwiz · on Nov 7, 2013

Your analogy rests on the absurd assumption that mission critical components don't need to be fully vetted and understood.

Shady? Yes. Dishonest? No. To be sure, both of these attributes suck, but beyond that there is an absolute world of difference between the two. One passively leaves knowledge gaps that even novice consumer programmers would uncover (the source is wide open after all, and speaking from personal experience, any claims of hyper performance should arouse enough suspicion to inspire some basic research), while the other asserts no knowledge gaps and actively inhibits knowledge discovery.

That said, they fell very far form the ideal, and for that they should be faulted. They're now paying the price by living in a dense fog of distaste and disgust. I don't think anyone believes it's a "statistics fluke" but it's a likely that a mob/tribal mentality plays some role (this is an industry notoriously susceptible to cargo cults) and that many individuals on the burn-MongoDB-at-the-stake bandwagon do not actually have deep--or moderate!--experience with it and competing products. Also, would you chalk it up to statistical anamoly that they were able to close funding and that many large scale systems use their product?

Isn't it possible that the company had shameful marketing practices that repel the programmer psyche, but that the product itself has a unique enough featureset (schemaless yet semi-structured querying, indexing, dead simple partitioning and replication, non-proprietary management language, simple backups, all at the cost of RAM/SSD) to make it compelling for some problem classes?

nosequel · on Nov 7, 2013

Well now you are assuming the documentation is telling the truth. The reality is, things still don't go to disk as often as you think.

http://aphyr.com/posts/284-call-me-maybe-mongodb

mcgwiz · on Nov 7, 2013

Unacknowledged writes are disturbing but this can happen in any system: data hits the disk, and just before the server can send an ack to the client, it crashes.

That same blog notes Postgres is susceptible to the same issue: http://aphyr.com/posts/282-call-me-maybe-postgres

sudhirj · on Nov 7, 2013

When map/reduce comes into play, speed isn't usually the main issue. It's a way to do work in parallel - which may be done to increase speed, but more often than not it's done because there's too much work to be done any other way.

Put another way, I see map/reduce as a background task - building indexes, reporting, etc. I don't think speed is as important there as it is for a foreground task like GET, PUT and DELETE.

For these operations, especially by key, Postgres has always been right up there - what's happening now is that Postgres is getting really good at NoSQL features - documents, key value pairs, deep querying etc - good enough that if you have Postgres you can do pretty much whatever you could have done with Mongo - while still making use of everything else it has to offer.

afandian · on Nov 7, 2013

I don't think Mongo map-reduce is parallel for a given instance.

driverdan · on Nov 7, 2013

Mongo clusters distribute map reduce so it can be reasonably fast.

mcgwiz · on Nov 7, 2013

You mischaracterize one of MongoDB's main draws. It's not just indexing, but indexing over a schemaless dataset. Mongo isn't unique in providing this, but combined with other features it is compelling for certain scenarios.

That said, if Postgres matches/exceeds enough Mongo features, it could certainly outshine Mongo for those scenarios.

AFAIK, map-reduce is relatively expensive on Mongo... typically recommended only for background execution, nothing synchronous.

craigching · on Nov 7, 2013

I've harped on this many times in the past. I love PostgreSQL, but where is its easy to replicate feature that matches MongoDB for ease of replication? Please answer "you must not be keeping up with PostgreSQL lately" and I'll eat my words!

Easy replication ... that's why I use MongoDB.

Lennie · on Nov 13, 2013

Setting up a new PostgreSQL slave is just a one-liner with pg_basebackup in 9.3 (no outside tools needed):

http://www.cybertec.at/postgresql-9-3-one-line-one-slave/

bramanga · on Nov 7, 2013

IMHO this seems like building a better apple by making it taste like an orange. This could definitely be useful for storing json from javascript frameworks like backbone/angular/ember, but I don't see how this is going to improve the lives of PostgreSQL users in general. Hopefully this leads to some weird cool emergent use case that make PostgreSQL way better than other databases, but I'm not sure at this point that it will. Does anybody have more context on the long-game for this approach? I'm not sure what the goal is for adding this feature.

garenp · on Nov 7, 2013

Look at some of the linked PDFs. It's the indexing technique and on-disk format that is an enabler for several notable use cases; personally, I'm looking forward to a time when I can use pg for full-text search without having to resort to another external system like Lucene or Sphinx. What it won't have is the query "smarts" like Lucene or Sphinx do today, but you need speed first.

baudehlo · on Nov 7, 2013

Pg's built in full text search works amazingly well and has all the smarts. It's also faster than lucene.

rpedela · on Nov 8, 2013

Huh? Please come up with about ten references (one isn't enough) to back up those claims.

jeltz · on Nov 10, 2013

I do not think so, but it might be after the GIN optimization work is committed. After it they beat Sphinx in several benchmarks.

http://www.sai.msu.su/~megera/postgres/talks/Next%20generati...

baudehlo · on Nov 10, 2013

I didn't say it was faster than sphinx (yet) but it is definitely faster than lucene in all the benchmarks I've thrown at it.

mml · on Nov 7, 2013

I'm very excited to put my 3m documents into hstore, as they are currently in a horrible, unqueryable, unwritable (mostly) blobs of compressed, b64encoded serialized ruby hash structures (long story, predating nosql).

This makes me very, very happy.

Edit:

I should point out why: -my document structures pretty much match 1:1 with hstore now (we have nested arrays of hashes) -i can query & report on the data with normal tools -i can update individual fields on my data (currently impossible) -i can do relational joins & such-like in the db (currently impossible) -i can have sensible transactions

ddrmaxgt37 · on Nov 7, 2013

The title will be changed soon. Don't worry.

andreypopp · on Nov 7, 2013

This is impressive but not really surprising given the quality of PostgreSQL database. Quote:

"We added performance comparison with MongoDB. MongoDB is very slow on loading data (slide 59) - 8 minutes vs 76s, seqscan speed is the same - about 1s, index scan is very fast - 1ms vs 17 ms with GIN fast-scan patch. But we managed to create new opclass (slides 61-62) for hstore using hashing of full-paths concatenated with values and got 0.6ms, which is faster than mongodb !"

baudehlo · on Nov 7, 2013

But note it's at the expense of a very large index file - 800MB (vs Mongo's 100MB). Although Pg's index covers the entire JSON structure, whereas Mongo's index only covers the leafs being searched, so it will optimize a larger variety of queries.

integraton · on Nov 7, 2013

See slide 44 here: http://thebuild.com/presentations/pg-as-nosql-pgday-fosdem-2...

GIN indexes are big, but Mongo's indexes are bigger than all other PostgreSQL indexes, and Mongo also requires much more space for the data.

garenp · on Nov 7, 2013

800MB is for the old GIN index, the new experimental one is 350MB, and has not yet undergone more space optimizations. They are also not really comparable: Mongo only indexes a few select keys, whereas the new GIN index does many.

eCa · on Nov 7, 2013

Space is cheap. Time is expensive.

falcolas · on Nov 7, 2013

It takes a non-0 amount of time to load large indices into memory (upwards of 5 seconds, depending on your choice of disks and arrays), and to iterate through large indices once they are in memory.

Such large indices also limit how many of them can be stored in memory at the same time - making transactions against those indexes slower over time if the indices have to be loaded and unloaded from memory frequently.

You should always choose indexes with care, because indexes not only cost space, they cost time. And as you said, Time is expensive.

mason55 · on Nov 7, 2013

> It takes a non-0 amount of time to load large indices into memory (upwards of 5 seconds, depending on your choice of disks and arrays), and to iterate through large indices once they are in memory.

This is why MongoDB recommends all indices fit in memory

darksaints · on Nov 7, 2013

Space is cheap, but IOPS are not, and RAM always has an opportunity cost.

wobbleblob · on Nov 7, 2013

That's why your index should fit in RAM

penguat · on Nov 7, 2013

Is that 800Mb in RAM? That could get expensive.

kermatt · on Nov 7, 2013

In space, no one can hear you scream.

eonil · on Nov 7, 2013

The real surprise is this comes with true multi-key ACID transaction.

karavelov · on Nov 7, 2013

If you want to index on specific field with PostgreSQL you could use function indexes and bring the index size down

jeffdavis · on Nov 7, 2013

Great work and research; horrible submission headline.

The news here is that, in addition to the huge market for traditional DBs, postgres is going to compete in a serious way on MongoDB's home turf. As that becomes more apparent, it will validate postgres's flexability/adaptability and cast doubt over special-purpose database systems and NoSQL.

MongoDB still has a story around clustering, of course. But that story is somewhat mixed (as all DB clustering stories are); and postgres is not standing still on that front, either.

(Disclaimer: I'm a member of the postgres community.)

rdtsc · on Nov 7, 2013

> MongoDB still has a story around clustering, of course.

Well, mostly. Except for when it doesn't, especially when it comes to partitions.

http://aphyr.com/posts/284-call-me-maybe-mongodb

craigching · on Nov 7, 2013

Hey Jeff, been a postgresql fan for a long time, but clustering is my number one issue. What is coming down the pipe for clustering/replication for PostgreSQL?

jeffdavis · on Nov 9, 2013

Logical replication will be a big deal over the next couple releases. I'm not sure exactly what will be available when, but I know that god foundations are being built now.

craigching · on Nov 10, 2013

Jeff, first, thanks for the response and I'm heartened by what you say.

If I could suggest, please look at ease of configuring replication in MongoDB. If you make it that easy, even if it's just for the JSON stores, I know that maybe you can't make it easy for relational tables, I will move my product to PostgreSQL in a heartbeat.

I have used and promoted PostgreSQL throughout my career, but I'm currently stuck with having to deliver HA in a product for redistribution to customers and Slony can't cut it for our use cases, it's too complicated. I think the key is that we are a vendor and we need to provide a database with our products, so it's not me that's responsible for configuring HA, it's our users. And replication is what I'm concerned about 90% of the time, not necessarily horizontally scaling out (i.e. we're not using sharding in 90% of our cases, we just need HA).

jeffdavis · on Nov 10, 2013

"I'm currently stuck with having to deliver HA in a product"

Have you looked at streaming replication in postgres?

http://www.postgresql.org/docs/9.3/static/high-availability.... http://www.postgresql.org/docs/9.3/static/warm-standby.html#...

I'm not sure what level of expertise your customers are at, so this might not work for you. But it seems like a better fit than slony at least.

integraton · on Nov 7, 2013

See also the PostgreSQL as a Schemaless Database slides (includes several benchmarks vs MongoDB): http://thebuild.com/presentations/pg-as-nosql-pgday-fosdem-2...

gibybo · on Nov 7, 2013

Can I index fields within a JSON document in PostreSQL? Can I query them?

I.e. can I do something like SELECT json_field FROM data WHERE json_field.age > 15 ?

rpedela · on Nov 7, 2013

That query would be this:

SELECT json_field FROM data WHERE json_field->'age' > 15

Part of the performance increase for hstore is improvements for GIN indexes, and according to the author can be applied to json. So yes you can use indexes on your hstore or json documents.

andreypopp · on Nov 7, 2013

Yes, this is what index scan performance metrics is about — PostgreSQL outperforms MongoDB here.

itodd · on Nov 7, 2013

This is how you do it in 9.2 using PLV8.

http://people.planetpostgresql.org/andrew/index.php?/archive...

For 9.3, see more info at http://michael.otacoo.com/postgresql-2/postgres-9-3-feature-...

jeffasinger · on Nov 7, 2013

Yes, for a project I'm working on, I had a JSON array of objects that each had a key called name.

I have fast indexed queries that return any rows that have an object with a given name in it, and for full text search on that name along with a few other things.

chrisfarms · on Nov 7, 2013

yes. See the operators at:

http://www.postgresql.org/docs/9.3/static/functions-json.htm...

desireco42 · on Nov 7, 2013

Which is not surprising, as this is 'real' database. I read review of MongoDB internals and it sounded not very positive. While you can create NoSQL quickly, to do it well with any db, it takes time which PostgreSQL definitely has.

Just to be clear, MongoDB is fine db for certain scenarios and I am using it in production.

fennecfoxen · on Nov 7, 2013

The internals are so very slim... so I like this guy's advice for a Mongo replacement that's almost as slim: http://nyeggen.com/blog/2013/10/18/the-genius-and-folly-of-m...

"ZFS has a heckuva caching strategy, and you know when it accepts a write."

:P

craigkerstiens · on Nov 7, 2013

If you want to find more of the detail and background around this which came from a talk by the author of the blog post then you check out his slides at http://www.sai.msu.su/~megera/postgres/talks/hstore-dublin-2.... One of the key things not in the blog post but addressed in the slides is that there will be a new index type that really pushes the performance gains even further.

tete · on Nov 7, 2013

We actually use both of them in our system.

I really wish PostgreSQL wasn't such an enormous challenge to scale horizontally.

asdasf · on Nov 7, 2013

What is challenging about it? If you make the same sacrifices you make to use mongo, it should be pretty straight forward.

tete · on Nov 7, 2013

Well, if I make the same sacrifices I make to use Mongo I don't really have a reason to use PostgreSQL.

No, but I really like the recent enhancements in PostgreSQL. Failover is nearly as easy as in MongoDB, however all this doesn't play so nicely yet, if you are using stuff like extensions (PostGIS), your own functions and still isn't really an out-of-the box experience.

I agree, if you use it with the same limits and the performance gain is worth it, you can as well be using Postgres. However, a lot of this actually only changed in the recent releases. It's all still fairly new.

Also there are still a number of things that are basically missing, like out of the box upserts (we are using a function for this, but it more a hack) and if you are still somewhat in development a lot of little changes get really hard in PostgreSQL. Converting your data structure, even with stuff like CTE and surrounding functionality can become really challenging, especially when you think there must be an easier way.

Where it is easier to modify structures in MongoDB it is actually harder to aggregate it sometimes. Using stuff like Map Reduce (even the lightweight version called aggregate) frequently appears sort of an overkill.

I think however it really depends on the kind of data you are dealing with. That's why we are using a hybrid system right now. Both systems are actually evolving really quickly and if you have the joy of using their most recent versions one is always excited about new releases.

joeshaw · on Nov 7, 2013

About the upsert function, have you found a way to do this generically, or are you generating (at least) one function per table from a template? I've found the hackiness of this to be easily the worst part about using Postgres.

asdasf · on Nov 7, 2013

You didn't answer my question at all. You claimed postgresql was an enormous challenge to scale horizontally. I asked how. There are still lots of reasons to use postgresql, I assumed you knew this since you expressed that you wanted it to be easier to scale horizontally.

Herald_MJ · on Nov 7, 2013

In my experience, mongo doesn't even scale horizontally all that well...

misiti3780 · on Nov 7, 2013

any more details? - im about to scale a large mongo database horizontally and wondering what I should be watching out for

tete · on Nov 7, 2013

If you want to scale out using a replica-strategy you really want to have the most recent version of PostgreSQL. It made things like log shipping pretty much an automatic experience. It still has rough edges, but the latest version really makes a difference here. See the release notes of PostgreSQL.

You will however find plenty of information on the wiki. Sometimes it is not completely up to date because of so much going on on that front.

You can use third party solutions too, but they usually have a number of caveats and general problems. However it depends on what you do with your database. If you don't deal with writing your own functions, use extensions or have fancy transaction they will work just fine. If you do, the functionality of Postgres is a more safe strategy. If you are dealing with extensions, etc. you should really be ready to get dedicated support for these things.

However, it really depends on what you do. For everything standard it won't make too much trouble once the first setup is done.

rdtsc · on Nov 7, 2013

The problem is he doesn't know he is making the sacrifices.

MongoDB marketing sloppy and promises "webscale" while hand waving partition-tolerance way.

After the whole "let's disable durable write to make our benchmarks faster" I just can't see trusting them with data I want to actually read back from the database. It might be good for probabilistic storage or stats reporting. It sort of because an issue of trust more than anything.

eksith · on Nov 7, 2013

Implementing our doc store on Postgres a while ago in lieu of switching to MongoDB suddenly seems like a very good call.

We had nothing against Mongo (frankly haven't gone into deep analysis of how it would turn out). It was simply the db we already had at hand and we knew it well and trusted it.

gosukiwi · on Nov 7, 2013

Postgres is quite scary, I feel that if I use it I'll miss 90% of the features @_@

wmil · on Nov 7, 2013

Don't worry about it, it's not like features have feelings.

There's always a fight between sticking to what your ORM supports and trying to use every feature of the database.

Either can simplify your code under certain circumstances.

netghost · on Nov 7, 2013

Don't worry, you don't use 90% of the words in the english language either, but it's still useful ;)

rdtsc · on Nov 7, 2013

At least your data will be there because the features are set to sane defaults.

pstuart · on Nov 7, 2013

According to the roadmap page, 9.4 is due out in the 3rd quarter of 2014.

roncohen · on Nov 7, 2013

Fantastic!

No mention of arrays in the post, but in the slides: you can use {1, 2} syntax for arrays and hstore now eats it \o/

andrewcooke · on Nov 7, 2013

context: http://www.postgresql.org/docs/9.1/static/textsearch-indexes... (GIN and GiST are index types)

willvarfar · on Nov 7, 2013

It would be great if they could run these benchmarks with mongodb using TokuMX.

I think tokutek's fractal trees would make an exciting data store for postgresql, but benchmarks are difficult to find... Who is perceived to be faster these days generally? Mysql+innodb, mysql+tokudb or postgresql?

lucisferre · on Nov 7, 2013

Now if I could only get Mongoid to work with Postgres instead of Mongo I would be happy.

andyl · on Nov 7, 2013

Postgres Array/Hstore/JSON datatypes have made my Rails apps way simpler. Great combination.

eksith · on Nov 7, 2013

I'd love to know how you've set things up [ if you don't mind me prying ;) ]. It would be interesting to see someone else's approach.

andyl · on Nov 7, 2013

What I did: 1) Use the latest Postgres (9.3) 2) use Rails4 - this provides String/Integer Arrays, HStore, UUIDs out of the box 3) use post_json (https://github.com/webnuts/post_json).

These tools made it simpler to do things like tagging, user-defined data attributes, hierarchical data, flexible configuration data.

It was always possible to store much of this data using text fields and JSON.parse (or some such), but the new Postgres stuff makes the flexible datatypes queryable. And as built-in support for flexible data types comes online, you can throw away your custom serializers.

There's other Postgres datatypes that I haven't used - for geocoding/geosearching, network addresses, etc.

zen_boy · on Nov 7, 2013

Care to expand how? Curious as a Rails developer myself.

netghost · on Nov 7, 2013

If you just want some background, I actually gave a talk about hstore, arrays, and FTS in postgres for rails devs a few months ago.

Links to the talk, slides, sample code, and some follow up articles: http://monkeyandcrow.com/blog/postgres_railsconf2013/

dalyons · on Nov 8, 2013

I was in the audience! It was a great session - you're a funny guy, explained the concepts well, and there was some really interesting real(or alternate :p) world use-cases for all this stuff.

zen_boy · on Nov 17, 2013

Great stuff! Seems like the store_accessor works with the :json column type out of the box as well.

Throw in a JSON schema validator and we have full-blown validations for :hstore and :json!

ddorian43 · on Nov 7, 2013

What is missing the a way to update specific fields in json/hsotre columns, or is there a way ?

joshguthrie · on Nov 7, 2013

    $ brew remove mongodb

Okay guys, now we're talking!

dkhenry · on Nov 7, 2013

So a database developer is saying that his database is faster then a competitors database. Shocking.....