Distributed systems theory for the distributed systems engineer

jandrewrogers · on Aug 10, 2014

This is a pretty limited view of distributed systems engineering, particularly at the high-end. Some topics that are essential for designing efficient, massively distributed systems:

- Greedy routing theory i.e. protocol design that is robust in a Nash equilibria sense. It still amazes me how many distributed systems are built on protocol designs that are provably unstable and inefficient.

- Space decomposition data structures. The only class of data structure that is efficiently distributable at massive scales due to their mathematical relationship to space-filling curves. Hash tables are often inadvertently implemented as space decomposition structures but there is a much richer and more expressive universe of data structures that exist.

- Game theoretic schedule awareness. This is a concept sometimes used in massively parallel systems for HPC but distributed systems designers seem to be unaware of it. It allows extremely high throughput by eliminating much of the need for distributed coordination with respect to accessing shared resources because every process can dynamically schedule its operations based on its modeling of the decisions of other schedulers it interacts with such that it almost never conflicts. I've never seen a paper on it but people have designed systems based on it (hell, I've designed a number of massively parallel systems based on it).

- Practical topology. On the surface it is about understanding how to maximize the throughput of a fabric of switch fabrics and most messaging patterns used in distributed systems today are naive at an elementary level; HPC has a much better understanding of packet flow optimization. There is also the more theoretical algebraic topology that hints how you efficiently do computational operations over space decomposition structures (mentioned above) with minimal data motion. Algebraic topology is enormously relevant to massively distributed and parallel databases but I rarely meet people who understand it outside of Oracle and similar (not that most of them are using it well).

A lot of distributed systems were engineered without any awareness of these things. Most of their fundamental weaknesses follow from that. The kinds of distributed systems that you can design if you really know the above topics greatly exceed the capabilities of popular distributed systems used today.

To reduce it to a simple metric: if you can show me a competitive algorithm design for the Graph500 benchmark, you actually understand massively scalable distributed systems. The efficiency at scale demonstrated in that benchmark is so far beyond common distributed systems because the designers of the top entries actually understand the above points I raised.

tburns · on Aug 10, 2014

Any lists/specific resources you would suggest that cover the things you mention well? This is a topic I'm actively trying to break into the basics of :)

gima · on Aug 10, 2014

Instead of providing resources, continuation, or even references to these topics for interested minds to dive into, I find bashing and gloating. Or am I reading this completely wrong?

Don't get me wrong, it's good to point out the writing's shortcomings, but as you appear to be knownledgeable in the field, please don't force others to begin from the bottom. Share your knownledge.

Game_Ender · on Aug 10, 2014

Took a little bit of reading but the OP is founder of company [0] making a massively scalable geospatial database. This is the closest article [1] I can find which has anything close to details. It lists the following technologies:

* A low level stack that bypasses the OS for at least disk IO (supposed to provide a 2-3x throughput increase)

* "polymorphic" space-filling curves for distributed parallel indexing

* hyper-dimensional spatial sieves [2]

* Allen’s Interval Algebra to parallelise SQL statements

More discussion on some of the OP's previous hacker news posts [3].

[0] - http://spacecurve.com/

[1] - http://www.it-director.com/blogs/Bloor_IM_Blog/2014/7/how-do...

[2] - http://www.google.com/patents/US20090182837

[3] - https://news.ycombinator.com/item?id=7482151

zodiac · on Aug 10, 2014

I see a listing of what the link lacks. It's not the parent's responsibility to provide references without being asked to do so - gathering resources for this kind of post can easily take 2-3x the effort of writing up the original post, especially if the parent's author hasn't studied the subject formally recently. Also, he doesn't know that there's demand for more references without first posting, and if we expect all such posts to contain references, we discourage their authors from writing them.

jandrewrogers · on Aug 10, 2014

Most of these topics have scant literature, or what is there is very theoretical in nature rather than something directly reducible to practice.

Greedy routing theory does have a lot of literature around it because it is used a lot in Layer-2 and Layer-3 packet routing protocols to optimize aggregate throughput of inherently decentralized systems. A lot of the robustness of modern IP networks are explained by this. However, above Layer-3 and particularly at the application level you almost never see properly designed distributed protocols; the people that design L2 routing protocols are not the people that design distributed systems and the knowledge is not transferred. (Admittedly this is a difficult mathematical area to understand with a lot of open problems. I know just enough to make good design decision but otherwise do not understand the underlying math.)

I've stated many times that the HPC algorithm people could learn a lot from the people that design distributed databases and that distributed databases theory can learn a lot from the people that design massively parallel HPC algorithms. As far as I can tell, those two groups of people do not talk to each other. I just happen to have done considerable R&D in both fields so I see what people in both domains are missing.

I've recently been tasked with writing about some of these topics, which should be interesting. My intent was not to bash or gloat but to highlight the point that we can do much better than we (or even I) are currently doing with distributed systems.

blutoot · on Aug 10, 2014

Will you open source SpaceCurve so that we can learn from the implementation of all of the points you mentioned?

i0exception · on Aug 10, 2014

I've found these video lectures by professor Seif Haridi to be really useful. http://www.ict.kth.se/courses/ID2203/video_lectures.html

Edit : https://www.youtube.com/playlist?list=PL700757A5D4B3F368

shmerl · on Aug 10, 2014

Just for the reference, with AdblockPlus enabled, the Youtube link images on that page are hidden (at first I couldn't find any videos).

jaekwon · on Aug 10, 2014

You must go beyond FLP impossibility. The problem with just stopping at understanding FLP is that it assumes too little about what is available. No assumptions of synchrony, no assumptions of clocks, then sure, consensus is impossible in the presence of faults. The real interesting question is, what realistic assumptions can I make to overcome FLP? I believe the CAP theorem is similar.

Also, if you're interested in cryptocurrencies or distributed systems with "greedy" participants, that's another class of problem that goes a level beyond byzantine consensus. The difference is that you cannot assume that even the "good" participants are running the officially sanctioned software, but rather you must assume that there may be colluders who attempt to game the network.

Byzantine generals problem + game theory -> ?

HenryR · on Aug 10, 2014

Here's one paper on the intersection of Byzantine fault tolerance, altruiusm and rational behaviour from UT Austin: https://www.cs.utexas.edu/lasr/download.php?uid=63

streametry · on Aug 10, 2014

The article doesn't directly mention network partitions and split-brain syndrome. Luckily there is Aphyr's Jepsen series [1] with both theory and practice.

[1] http://aphyr.com/tags/Jepsen

krat0sprakhar · on Aug 10, 2014

This is really awesome. I've been spending my nights on improving my understanding and was wishing for a resource exactly like this. I've been going through Prof Aspnes (of Yale) notes - http://www.cs.yale.edu/homes/aspnes/classes/469/notes-2011.p...

A question to the HN folk - For a programmer who spends his day time writing web applications and no systems experience, are there any good project ideas you guys have to better understand Distributed systems? I was thinking of implementing Paxos in Go / Scala but something more practical would be better.

owlish · on Aug 10, 2014

I've been looking at MIT's Distributed Systems class which uses Go (posted here about a year ago).

Original Link: http://css.csail.mit.edu/6.824/2014/labs/lab-1.html

HN comments: https://news.ycombinator.com/item?id=5192650

bochi · on Aug 10, 2014

This article is the best introduction to the topic in my opinion: http://steve.vinoski.net/pdf/IC-Rediscovering-Distributed-Sy...

Another good reference for understanding distributed databases is: http://cs-www.cs.yale.edu/homes/dna/papers/abadi-pacelc.pdf

blutoot · on Aug 10, 2014

I would like to know if any site reliability engineer or sys-admin or someone in any (dev)ops role ever seeks the help of distributed systems theory. I have seen many of the real-life systems that are listed in the article thrown in as a recommended reading for preparation for many job interviews. I've just never managed to figure out in what situations or scenarios (especially troubleshooting-related) the knowledge of the design or the design principles behind a distributed system comes really handy for a person in any of the roles I mentioned above. Any enlightenment will be greatly appreciated.

(EDIT: I guess what I'm trying to get at is that it would be nice to read a detailed post about how someone who maintains a big distributed system handled a major troubleshooting or scaling problem and how, if any, distributed systems principle or theory came handy in that exercise.)

ajdecon · on Aug 10, 2014

I don't have time to write a detailed post, but would a book recommendation work? :)

"The Practice of Cloud System Administration: Designing and Operating Large Distributed Systems", coming out next month, provides a good viewpoint on how distributed systems design knowledge is useful to someone in a sysadmin role. (I've read a preview via Safari Online and found it an excellent resource.)

[0] http://www.amazon.com/The-Practice-Cloud-System-Administrati...

blutoot · on Aug 10, 2014

Thanks! I bookmarked this one a few weeks ago and can't wait for it to come out next month.