Sandstorm – An open source platform for personal servers

amirmc · on Jan 30, 2015

Things like Sandstorm are really important as they allow people options without becoming a full-blown sysadmin. Just look at how many crowd funding ideas are based on 'personal cloud' concepts.

However, we also need to work on the fundamental problems to make it easier to build decentralised products in the first place (not everything is a web-app). Namely, how such apps are built, how they store/sync data, and how we deal with identity. The current tools simply aren't designed for the world we're heading towards, so we need to re-evaluate our assumptions. On top of this is the need for business models that don't rely on mass data collection (eg advertising) -- we can't rely on everything being open source but the underlying infrastructure must be.

There are many ways forward and the particular approach I'm taking is based on unikernels and creating a modern stack to deal with the above issues directly. There's more info at http://amirchaudhry.com/brewing-miso-to-serve-nymote/

If anyone happens to be at FOSDEM this weekend I'd be happy to chat about these ideas in person.

lewisl9029 · on Jan 30, 2015

In terms of storage, here's something I've been working with for one of my current projects:

http://remotestorage.io/

Open protocol for synchronized personal storage. It uses a decentralized model where users provide and pay for their own storage. Could be game-changing if it takes off.

edwintorok · on Jan 30, 2015

I've quickly read through your RFC, and since I've recently added support for Content-Range to LibreS3[0] that is the first thing I looked for, and sure enough you support it.

Although it is not clear to me why you need to use webfinger to announce Content-Range support. There is Accept-Ranges header, or you can detect that you get a 200 instead of a 206 reply on a Range request for GET.

For PUT RFC7233 says "A server MUST ignore a Range header field received with a request method other than GET." so I'm not sure how that would work there, can you give an example?

[0] http://blog.skylable.com/2015/01/build-your-own-cdn-with-sky...

lewisl9029 · on Jan 30, 2015

I should have probably made it a bit more clear that I'm only working WITH remoteStorage on one of my side projects, and am not one of its developers. =)

kentonv · on Jan 30, 2015

remoteStorage is an elegant solution in a lot of ways, but there are some problems:

* Apps written in this style have a very hard time being real-time collaborative. Maybe this will be solved by WebRTC, but it's a lot easier with a server component. (Sandstorm has several real-time collaborative apps, like EtherPad, EtherCalc, and Wave. Currently you can share these apps with other users by copy/pasting the URL; the sharing model will get more sophisticated eventually).

* You need to set up a storage server for the apps to use, or use one of the big providers that like to do data mining. You then need to connect the app to the correct server, which is extra busy-work.

* The permission model of these storage servers isn't terribly sophisticated. Often you'll end up granting the app broader permissions that you really want.

* There is, of course, nothing stopping the app from storing data to other places as well, nor is there any isolation between separate documents opened by the same app. (On Sandstorm, each document is a separate instance of the app in a separate sandbox which cannot communicate with the outside world without permission.)

Edit: Missed that you said you're working on RemoteStorage. Probably would have phrased differently if I noticed. Wasn't meant to be an attack. Sorry! (We actually like remoteStorage apps a lot in that they are often easily ported to Sandstorm. :) )

lewisl9029 · on Jan 30, 2015

Actually you got it right the first time: Working with* not on. =) My wording could probably have been a bit more precise.

Having worked with it for the past few months, I do agree with a lot of your sentiments on the shortcomings of the server-less application model, but in many applications where user privacy is a high priority (which is increasingly the case since the Snowden revelations), having a fully portable, standards-based storage solution that you can host on your own if you choose is incredibly compelling.

With the remoteStorage-based serverless app I'm working on right now, it's been quite a challenge trying to reach feature parity with existing client-server apps on the market today. But once we even get close to achieving feature parity, I believe we'll have a very compelling solution.

nileshtrivedi · on Jan 30, 2015

There is also https://tent.io/ It seemed quite promising to me but there haven't been updates from the team lately.

seagreen · on Jan 30, 2015

They got into YC with Flynn (https://flynn.io/) which kept them busy for a while, but Tent 0.4 development is starting to pick up again.

wyldfire · on Jan 30, 2015

I think Storj may have some similarities.

http://storj.io/

mempko · on Jan 30, 2015

I love the idea of unikernels! I took a much high level approach for p2p apps with Fire★ (http://firestr.com). Where it is GUI centric and has a built in app editor (all application code is viewable/editable).

My only concern with the unikernel approach is that you can end up with a system where the code is not viewable and editable.

Have you thought about that concern?

paulproteus · on Jan 30, 2015

I'm at FOSDEM! I'm on #fosdem and #sandstorm on freenode -- my nick is paulproteus. Also I work on+for Sandstorm.

greggman · on Jan 31, 2015

I know this is going to sound ridiculous and maybe this is the wrong topic. I'm at a spot where I wish there was a closer to turn key framework for my project.

I wrote 3 interdependent web servers, have them running on all the same virtual machine on digital ocean with varnish. Now I need to upgrade ubuntu and I don't want them to go down. So, apparently I need at least 3 machines if not more. One to run varnish or something similar, to direct to the other 2. Some way to bleed people over to one of the 2 machines. Once everyone is on one machine I can upgrade the unused machine. After that I can bleed them all back and then upgrade the other.

Is this too small a niche? Is the answer I should have used "google cloud"? It just seems like each step is so much work. I certainly learned a lot (vagrant, ansible, and other stuff) although all of that knowledge will be probably obsolete before I need to do this again.

Is there something I should have looked at? Sandstorm seems one level down. Like it's more like a replacement for the old lamp/cpanel isps.

e12e · on Jan 31, 2015

In your scenario, you need "just" two machines - and really only one running most of the time. You have varnish+appservers running one one vm now. What you need is one more vm, running the same stack. Depending on how you handle sessions this could be as easy as:

    0: reduce ttl of dns to ~60s
    1: bring up vm2==vm1
    2: upgrade vm2
    2.1: test vm2
    2.2: shutdown vm2, take DO snapshot
    2.3: instanciate vm/droplet of vm2
    3: point/change dns from vm1 to vm2
    4: wait for traffic to die on vm1
    5: shutdown vm1
    6: possibly increase ttl in dns again

Next round, same procedure. This is not ready for HA -- easiest way there is running 2 vms like now, but with haproxy in front (so the vm has ha-varnish-3appservers), along with heart-beat and a shared ip. I don't thing DO supports that -- and it is probably not worth the hassle if you can live with dns changeover-time==downtime (say, minutes modulus wrongly configured dns caches).

With a couple of dedicated ips, you can usually get away with vm2 taking vm1s ip (and vice-versa) -- but again that depends on your session set-up and/or if dropping sessions are ok. With sessions in some kind of replicated cache/db (eg: redis, mysql) -- you could probably set vm2 to slave over/replicate sessions, then do the dns [ed: or ip] switch[over].

greggman · on Jan 31, 2015

this is all great. thanks!!

My real point was why isn't this a finished product yet kind of like Sandstorm? Basically why hasn't someone made a system that just handles this for me. I start n services, when I want to upgrade something it provides a simple UI or a couple of command line options to just do it. Why is it all done by hand?

It seems like enough devs might need this now-a-days.

Note: I get why. I guess my point is there's an opportunity here and Sandstorm made me think of it.

ayrx · on Jan 31, 2015

There is. It's called a PaaS. You write the app, the PaaS providers handles the rest.

olalonde · on Jan 31, 2015

I'm using Docker + Fig. The nice thing is that you can exactly replicate your production environment on your local machine and the Dockerfile syntax is really simple. The not so nice thing is that there's no "simple" way to run docker containers on multiple physical/virtual machines right now (that should become easier once Docker team releases their cluster solution). It would be nice to have a fig registry where people can publish complete Docker based systems (e.g. varnish + nginx + Node.js server) but I don't think there is such thing available at the moment. I wrote a bit about that idea here: http://syskall.com/crazy-and-not-so-crazy-startup-ideas-2015...

Also, I asked a similar question on HN one month ago and got some interesting replies: https://news.ycombinator.com/item?id=8805952

mixmastamyk · on Jan 31, 2015

Heroku might be an option (in the future).

jackweirdy · on Jan 30, 2015

Will you be talking about Mirage/other bits of your work at FOSDEM, or are you “just” attending?

amirmc · on Jan 30, 2015

Not speaking but doing a demo at the Xen booth on Saturday (likely 1pm-3pm but email me to be sure).

The repo has details about the demo if you're interested (though it's probably also a spoiler) https://github.com/amirmc/fosdemo

themoonbus · on Jan 30, 2015

Here is how to write a landing page.

Big letters: "Sandstorm makes it easier to do XYZ" or "Sandstorm lets you do XYZ by doing XYZ"

E.g. "Sandstorm lets you run your own personal web applications without needing a background in IT!"

or "Sandstorm lets you to install personal web apps as easily as you install mobile apps!"

3 examples of what this could actually mean for 80% of your users "Run your own Dropbox!" "Host your own WordPress Blog!" "Get a mailbox to match your personalized email address!"

THEN drill down into what it actually is (Sandstorm is a open source platform that makes it easier to run and manage your own personal server, yadda yadda), and its more specific features, such as usability, security, etc.

(This advice operates under the assumption that "individuals" are your main target audience.)

larrybud · on Jan 30, 2015

+1 I went to Sandstorm's home page, and spent the first 5 minutes trying to figure out what it is & what it does.

"Sandstorm is an open source platform for personal servers". Ok, fine, but what is it REALLY? What does it do? Why is it better than (hosting / VPS hosting / AWS / Docker / PaaS)? Give me some examples of what I can do with it.

Sorry, you lost me with your home page.

CalRobert · on Jan 30, 2015

It's better because the internet was supposed to be a bunch of computers talking to each other. It was a beautiful vision. Instead, it's been centralized on two levels:

FB/Google/Instagram etc. serving content, and AWS/DigitalOcean owning the hardware for those intrepid individuals who want to roll their own solutions.

The internet wasn't supposed to be Amazon, Google, and Facebook all talking to each other. It's scary that ISP's don't even want you to host your own (modest) server. It's SUPPOSED to be a bunch of computers networked together! Sandstorm makes it easier to live that vision where you own the hardware, or at LEAST have full control over your cloud. It doesn't necessarily need to be your home computer - a colo'ed odroid (or RPi if your needs are modest) would do the trick too. As more and more of the internet is gobbled up by VPS services I think it's important that the average Jane or Joe can still put together their own website, blog, game server, etc. and not be reliant on a company for it.

apenguin · on Jan 31, 2015

Unfortunately, I don't see how this solves the problem. The main problem for me is this part of my ISP's ToS: "Users may not run any type of server on the system."

Further, every ISP I've ever had has had some such clause. I'd have to get a business plan to actually be allowed to run a server. So who is this, or any home server software, even for?

(Or at least, who in the US)

kentonv · on Jan 31, 2015

Sandstorm is not necessarily about running the server at home (though you can). It's more about being able to choose what is on your server and control how your data is stored and accessed, whether on a home machine or running in a datacenter.

ausjke · on Jan 31, 2015

I think I can use apt-get/rpm/etc to control what's on my server already these days?

kentonv · on Feb 1, 2015

Sure, but only if all of the following are true:

1) You understand how to use the Unix shell and everything else that goes into maintaining a Unix machine.

2) You have the time to do it. (This is what has always stopped me, FWIW.)

3) You are willing to spend money on a machine that has sufficient resources to be responsive when you use it but sits idle 99% of the time since you're the only user.

These obstacles are what drive people to SaaS, where they no longer have freedom to install arbitrary software.

CalRobert · on Jan 31, 2015

Unfortunately we lost all of the small independent ISP's that offered any semblance of competition. I just host my own stuff anyway with an ISP known to be pretty relaxed about it. You're right, though, it's a major problem.

roadnottaken · on Jan 30, 2015

I had the same experience. Still don't get it.

kentonv · on Jan 30, 2015

Question: Did either of you try the demo?

(I know it was kind of broken under all the traffic earlier.)

Usually the demo is what makes people "get it".

It's been surprisingly difficult to find a sentence or two that describe Sandstorm in a way that is effective on everyone. For any text we use, different sets of people get it or are confused. :/

themoonbus · on Jan 30, 2015

People on the internet are easily distracted and have short attention spans. You want them to get interested enough to actually run your demo. I'm not going to take 10 minutes to delve deeper unless you hook me to begin with.

Also, you don't need to have text that appeals to everyone (there is no "average" user), but you should be able to write text that appeals to at least one of your groups (individuals, developers, enterprise). The two sentences you have currently are so generic that they don't say anything at all. An open source platform? An open source platform that does what?

Target the group with your messaging that you are targeting with your platform. Sure, sandstorm could be used by any of them, but which group is MOST important to your platform?

rpdillon · on Jan 30, 2015

10 minutes? The demo allows you to set up a Wordpress blog in literally 10 seconds (it's four clicks and no typing or scrolling -- not even to log in). I'm not sure why that's so onerous, even for folks with short attention spans.

themoonbus · on Jan 30, 2015

That's such a great line that you just wrote: "You can setup a WordPress blog in 10 seconds". Why don't you say that under the demo link? Or say, "Try our demo. It takes 10 seconds to install WordPress" or whatever app.

It's not onerous at all, but you have to get people to the point where they're actually at the demo. My "10 minutes" was based on the thought process that goes through my head when I see a "try our demo" link. If the demo takes only 10 seconds, that's highlighting a major selling point of your platform, so make that explicit.

kentonv · on Jan 30, 2015

(Note that rpdillon is just a commenter, not a Sandstorm dev. But, yes, this is a good idea.)

lurcio · on Jan 31, 2015

This exchange made me smile. It got to the benefits in the end though.

How about 'Run X in Y minutes' (somewhat after the Learn X in Y series).

kentonv · on Jan 30, 2015

Admittedly people don't find that out until they've already decided to click, which they might not if they expect it will take 10 minutes. :)

Maybe we should change the button text to "60-second demo" or something...

kentonv · on Jan 30, 2015

I've added a brief subtext based in part on your suggestions. Let me know what you think.

themoonbus · on Jan 30, 2015

Looks good! I would think about replacing your main tagline with that sentence, or something like it. Not to be overly harsh, but your main tagline doesn't say anything.

Edit: Actually, I think if you said something like "Sandstorm is an open source app platform for personal servers" that would be a major improvement. The whole "app" part is missing from the main tagline. Then, your sub-tagline goes into more detail about what apps.

Edit 2: Actually, I would remove the open source part altogether. It's redundant if you have a github link somewhere on your page, which you do, and I think the developer community you are targeting would assume that it's open source. Or, just keep "open".

kentonv · on Jan 30, 2015

Actually, the words "open source" are a recent addition to our header, whereas we've always had the github link. We discovered from feedback that many people who visited our page had no idea that it was open source, since most people don't look at nav bars, and this of course completely changed their perception of the project (for the worse, obviously). When we put "open source" into the header, we saw a marked increase in interest.

Thanks for the feedback, though! We'll think about inserting "app" in there.

themoonbus · on Jan 30, 2015

Ah, gotcha. Makes sense!

bootload · on Jan 31, 2015

"... Usually the demo is what makes people "get it"...."

Kenton, I'd agree with this approach.

The killer approach is for newly created applications, ported to sandstorm to take advantage of the isolation, security and scalability.

So one area to look at, might be extra tools/paths to port, maintain and expand development.

For cough, Microsoft (platform), it was VB, the killer app. For Linux (platform) it was Apache (killer app). So "the path" to get applications on Sandstorm (platform) to create a killer app, might be the answer.

==== background ====

In fact, one way would be to ask what apps people (HN for example) already use and what problems they have. You need a feel for the numbers of applications companies/startups use. Is it technical? Is it business related? Is it cost?

Install it (if it's ported) and work a discussion around it. For example the reader who chimed in on creating a page on Wordpress - show the path to do that.

Another one I'd suggest as a side-business/demo is a collaborative editor (hello etherpad). [0] I know for a fact google, for example, use some crappy Doc editor (sans the nice editor features) to screen candidates. So there's a demand there.

For the technical minded, poking around https://capnproto.org/ really explains what sandstorm servers can do.

[0] https://sandstorm.io/apps/?host=https://demo.sandstorm.io

syllogism · on Jan 31, 2015

So write more than two sentences? Tell the story.

There's this idea around that people don't read text. What if it's just that most text sucks? Web designers end up writing web pages with text that isn't really designed to be read, so people don't read it, and then it get optimized away. The result is often really weird. It's this blank pastel page with some vague promises and a SIGN UP NOW button. Zombo.com all over again.

I thought your Cap'n Proto page did a good job of this actually! It tells the story.

username223 · on Jan 31, 2015

> What if it's just that most text sucks? Web designers end up writing web pages with text that isn't really designed to be read,...

Pretty much. "Web designers" are mainly focused on extracting money from credit card numbers. Humans are an annoying intermediary.

akanet · on Jan 30, 2015

Sure, but I think almost anything is better here than "Sandstorm is an open source platform for personal servers."

sinatra · on Jan 30, 2015

Great suggestions! I'd say "Sandstorm lets you install server apps as easily as you install mobile apps!"

themoonbus · on Jan 30, 2015

Yes, I definitely think something along those lines, especially if you're targeting a group of people who are somewhat tech savvy, but not tech savvy enough to run their own server.

taivare · on Jan 31, 2015

and for the ad campaign.." your Server will be with you "

resu · on Jan 31, 2015

If anything, the call for action ("Try the demo" button) should be made MUCH more noticeable.

I was thinking the same thoughts as you because who wants to try the half-assed barely functional demos that most sites put up?

Well, I decided to give the demo a try anyway, and damn am I impressed.

Almost didn't click it though. So hey, make it pop!

coldpie · on Jan 30, 2015

Good ideas, but make sure you hire a copy editor! "Let's" is short for "let us", like "let's go to the store." "Lets" is a conjugation of "to let" (i.e. to enable) like "Sandstorm lets you do XYZ."

themoonbus · on Jan 30, 2015

Oops, fixed. Yes, don't take copy directly from a hastily written Hacker News post :)

qiqing · on Jan 30, 2015

Thanks for your feedback!

ozh · on Jan 30, 2015

Sounds cool but I think the WordPress implementation is TERRIBLE: it depends on a WordPress fork that is completely outdated, instead of downloading an up-to-date fresh archive.

paulproteus · on Jan 30, 2015

I agree -- the current WordPress package needs work. Thank you for trying it and looking into it!

Community-wise, one thing we're going to need, as Sandstorm grows, is an ecosystem of app package maintainers. Part of what we're hoping is that more developers of the apps themselves will maintain the Sandstorm ports, like Audrey Tang is maintaining the EtherCalc port.

Tech-wise, one thing we're going to need is a solid story for how Sandstorm packages will easily stay up to date with the latest changes as the upstream author releases new updates.

I work on+for Sandstorm, and I'm also a Debian developer. Debian is not a shining example with regard to either of the above, and I'm sure we can do even better at Sandstorm.

tokenizerrr · on Jan 30, 2015

Is it possible to run arbitrary Docker containers? If so, that could be a solution.

paulproteus · on Jan 30, 2015

It's not currently possible to run arbitrary Docker containers through Sandstorm, since we prefer app packages (we call them SPKs) to be:

* Self-contained -- if the app needs MySQL, bundle it;

* Able to run with external network access unavailable -- this improves security, since even if an app gets compromised, it's not a big deal since it can't leak any data out to the world;

and a few other constraints that are more technical than philosophical.

https://github.com/sandstorm-io/sandstorm/wiki/Porting-Guide hints at them, but I don't quickly find a reference for all these constraints. I'm likely to write such a reference in the next few days/weeks, though.

ohyeshedid · on Jan 30, 2015

These preferences of yours appear to be a step back from installer scripts like Softaculous. Fantastico, etc.

It's a nifty/fun project, but why in the world would I bundle MySQL with a simple blog platform?

It seems like you're trying to reinvent the wheel, and making it less functional in the process.

kentonv · on Jan 30, 2015

We actually don't want apps to bundle MySQL -- we'd prefer they use sqlite. :) But the point is, it's up to the app. The app gets a slice of filesystem and they can use whatever infrastructure they want to store stuff to it.

We want the experience for users to be install app, use app, without worrying about setting up databases and such. We also want to enforce isolation between apps so one app cannot access another's data, and that's a lot easier to do if they aren't sharing a database. Considering these desires, it makes sense to say that apps should simply bundle their database of choice.

wmf · on Jan 30, 2015

why in the world would I bundle MySQL with a simple blog platform?

To make sure it actually works. (Although as others have said in this thread, it should probably use SQLite instead of MySQL.)

ocdtrekkie · on Jan 30, 2015

Sarien: It's not allowed to talk to the outside world except through Sandstorm via specific APIs.

anonymousDan · on Jan 30, 2015

Even so, I don't see how you could distinguish between a compromised app sending data to the outside world and an uncompromised app doing the same as part of its normal operation.

Sarien · on Jan 30, 2015

That's just marketing bullshit. Unless the API is magic (and I don't mean advanced technology "magic" but Harry Potter "magic") it has no way of knowing what the application is allowed to send or not and therefor cannot filter. It's like saying it cannot leak data because it has to use HTTP.

kentonv · on Jan 30, 2015

Hi, lead developer of Sandstorm here.

> That's just marketing bullshit.

No, it isn't.

> Unless the API is magic (and I don't mean advanced technology "magic" but Harry Potter "magic") it has no way of knowing what the application is allowed to send or not and therefor cannot filter.

You're assuming that Sandstorm apps have arbitrary IP network access. They do not.

Sandstorm is based on capability-based security. Any outgoing request has to be addressed to a capability representing some specific permission that the user has granted to the app. A capability might point to another app, or it might point to a specific external host that the user has designated.

More specifically, a Sandstorm app's only connection to the outside world is through Cap'n Proto RPC, which is an object-capability protocol, meaning that an app can only send requests to objects to which it has explicitly received a reference.

https://blog.sandstorm.io/news/2014-12-15-capnproto-0.5.html

https://capnproto.org/cxxrpc.html

Incoming HTTP to a Sandstorm app actually happens through this Cap'n Proto protocol:

https://github.com/sandstorm-io/sandstorm/blob/master/src/sa...

Of course, for backwards-compatibility, we have translation layers so that apps written to use regular old HTTP need not be entirely rewritten. You just have to tweak it to make the correct permissions request first, which has proven not very hard in practice.

tomjen3 · on Jan 31, 2015

So can I use sandstorm to run a personal RSS reader? It seems like one of the things it would be well suited for.

kentonv · on Jan 31, 2015

Yes. We have TinyTinyRSS on there now.

Note that Sandstorm is still in development and for the moment we've created a hack to allow ttrss to make arbitrary HTTP requests in order to update feeds.

However, in a few more months this won't be necessary. Instead, when you click "subscribe to feed", the app will call a method on the Sandstorm API saying "Prompt the user for a URL and then give me permission to access it". So, you'll get a dialog box to enter the URL rendered by Sandstorm itself. If you enter a URL, it's plainly obvious that you want the app to have permission to fetch it, so Sandstorm grants said permission. We call this UI the "powerbox".

Notice how the UX here is equivalent to what we have today, where the app renders its own prompt. This technique of inferring security decisions from actions the user was doing anyway is the core of how we plan to implement tight security without inconveniencing the user.

ocdtrekkie · on Feb 2, 2015

I've been using TinyTinyRSS on Sandstorm for a while. It even has a mobile app that works with Sandstorm's API. (Though it's a fork, not the official Play Store version.)

vertex-four · on Jan 30, 2015

Sandboxed applications literally cannot send any data by default. They can't open a connection to <whatever server>, no matter what protocol.

The goal, once they've built their Powerbox, is to then implement a set of protocol drivers which the application can use. So it still can't connect to arbitrary servers, but it can ask the user for permission to, say, connect via SMTP to <wherever>, and the user has control over that.

Yes, they could leak anything that you put in them if you allow them to connect to someone you don't trust. However, even if you do so once, most applications will be per-document - you have an instance of your document editor for each document, and they don't know anything about any other documents you have.

In short: applications can only leak what you give them, and only to people you say to give them to. They can't call back to home base without your permission or the permission of someone you've given the app permission to contact. So for all reasonable definitions of "cannot leak data", applications cannot leak data without your permission.

abecedarius · on Jan 30, 2015

It's worth keeping covert and side channels in mind, though: e.g. an instance can leak bits by timing variations. Capability security is a big big deal, a qualitative change in the game, but I think this comment is over-promising things.

kentonv · on Jan 30, 2015

Yes, covert side channels should always be assumed to be possible.

However, there are two reasons I think you don't need to worry about them too much:

1) They'll typically be fairly expensive and low-bandwidth.

2) They're unambiguously malicious. This is not a technical barrier to using them, but it's a huge political barrier. Today, major developers will happily stick covert statistics gathering into their code, and then when called out on it, will make some contrived argument about how it benefits users (if that's true, why don't you ask them first?) and how it's mentioned in the privacy policy so therefore it's legit. OTOH, you can't exploit a covert channel in Sandstorm and then plausibly claim you haven't done anything wrong.

Some hardcore security nerds will of course scoff at this argument, and to them I can only say: "OK, yes, there are possibly covert channels, sorry. Please don't put sensitive data into an app you don't trust."

A theoretical long-term solution is deterministic computing, but that probably requires apps to be written in a different language or be run in a heavy-handed VM. Not practical at the moment.

It's also worth noting that Sandstorm is designed to make it impossible for an app to leak capabilities via covert channels. They can only leak bits, and a capability is not just bits.

abecedarius · on Jan 31, 2015

Yep, good points; I just think the GP was too absolute. It's good to hear Sandstorm's built on object capabilities instead of password capabilities; since I wasn't sure I didn't get into that, or deafening (determinism to eliminate side channels into a process; I gather that outward is much harder to control).

Sarien · on Jan 30, 2015

How can a COMPROMISED WEBapp ever not be able to leak data while being usable?

paulproteus · on Jan 30, 2015

Here's how:

* Backend: Due to Linux network namespaces, the app can't communicate with the network (except over "sandstorm-http-bridge" which allows it to respond to inbound HTTP requests).

* Frontend: Due to Content-Security-Policy, the client part of the app can't communicate with any hostname other than the one the app runs on. The CSP header is set by Sandstorm, not the app.

So then it has no network access, and therefore even if it is compromised, can't leak anything.

This does hinge on the app's dynamic code only being run for logged-in users. For many apps -- imagine a Google Docs spreadsheet only accessible to people within your domain -- this is a pretty straightforwardly reasonable model. Sandstorm handles authentication for apps, so it can enforce this even if the app is 0wned.

Sarien · on Jan 30, 2015

I compromise an app, add myself admin account, log-in, download everything. What's stopping me?

kentonv · on Jan 30, 2015

An app does not have the ability to edit who has permissions to itself. In order to add yourself as an admin of some app, you'd have to compromise Sandstorm, not the app.

Sarien · on Jan 30, 2015

I'm not sure how that is supposed to work. You would have to rewrite every webapp so that it's data can be protected by sandstorm. Which seems hugely impractical. And as long as the webapp has access to it compromising it will compromise the data.

kentonv · on Jan 30, 2015

Not "rewrite". You do have to tweak apps to be Sandstorm-appropriate, but it's usually somewhere between five minutes and a couple days of work. Namely:

* Delete the login system and use Sandstorm's. If you build on Meteor, for instance, this is a simple matter of swapping dependencies.

* Delete your sharing system. If the app hosts multiple things that can be independently shared, change it to host only one such thing. The user can create multiple instances of the app and using Sandstorm's sharing. This is probably the hardest part, but we've done it for several apps now without too much trouble. Since it's largely deleting code, it's not very difficult.

* Find the places where your app connects to the outside world and insert a bit of code to make a Sandstorm powerbox request to get permission first, then address the requests to that permission.

None of this involves "rewriting". We have 20+ apps on the Sandstorm app list, most of which were ported by two people who certainly didn't have time to rewrite each one.

ocdtrekkie · on Jan 30, 2015

I've ported apps to Sandstorm with literally no prior experience with the languages those apps were written in. Porting to Sandstorm involves more deleting stuff you don't need than actually writing code yourself. :D

jerf · on Jan 30, 2015

The only "everything" you should be able to get, if the security is correct, is for the app you compromised, not the other ones running on Sandstorm. No, it does not magically secure applications put behind it (though IIRC it does put a couple of useful tweaks in place, but that's all it can do), but it can prevent "I compromised your WordPress and stole your entire machine's contents."

ohyeshedid · on Jan 30, 2015

Is that because of the security of Sandstorm as a platform or because each app would need it's own DB engine bundled with it?

kentonv · on Jan 30, 2015

It's because of Sandstorm's security as a platform. Apps cannot see each other's files on disk, because each one runs in a container with only their own subdirectory mapped in.

saraid216 · on Jan 30, 2015

Should this vulnerability be hard to demonstrate with a proof of concept? It sounds pretty straightforward.

kentonv · on Jan 30, 2015

In addition to what Asheesh (paulproteus) said, see:

https://blog.sandstorm.io/news/2014-08-19-why-not-run-docker...

mikewhy · on Jan 30, 2015

Disclaimer: I couldn't get this thing to work reliably at all

Panamax[0] might be something to look into. It has app templates that are comprised of Docker images.

For example, getting Gitlab up and running is as simple as finding the Gitlab template and pressing "Install".

You can also save templates locally, think "Python Web App (db, cache, app server)".

[0]: http://panamax.io/

rdtsc · on Jan 30, 2015

Btw the main developer, Kenton Varda, is also the author of Cap-n-Proto -- a pretty neat serialization & RPC format.

https://capnproto.org/

I was playing with that and stumbled on Sandstorm a while back.

swah · on Jan 30, 2015

He likes LAN parties: http://kentonsprojects.blogspot.com/2011/12/lan-party-optimi...

vtempest · on Jan 30, 2015

I was there! Small world. I would definitely use sandstorm if I wasn't an IT guy.

green7ea · on Jan 31, 2015

Cap-n-Proto is awesome in so many ways. I'm trying sandstorm just for that.

SimplyUseless · on Jan 30, 2015

On their github page, it still shows a warning about not using it for mission critical applications.

https://github.com/sandstorm-io/sandstorm

ocdtrekkie · on Jan 30, 2015

Well, it's kinda an Alpha. Goes without saying, doesn't it?

leonardinius · on Jan 30, 2015

It's the case of me scrolling the site and reading most of the GH readme - and still getting almost no idea what status it is, what is the goal/vision and how I might use it..

"We do LXC stuff in secure and user friendly way" is the message?

paulproteus · on Jan 30, 2015

Here's my own summary, which if you like it, I can try harder to make sure becomes more prominent somewhere:

Sandstorm is a way to run web apps as containers, gloriously sandboxed from each other, and moreover a web interface to install them easily and allow the user to create multiple instances of a web app easily.

It intends to grow features relating to sharing instances -- so that an instance of a web app is as easy to share with someone else as a Google Docs link -- and grow features relating to supporting more network protocols -- so that apps can safely communicate with the outside world, mediated by the person using Sandstorm.

Right now, the target audience is people who like running web apps like WordPress or Ethercalc on their own server. In the future, the target audience will grow to include companies whose IT departments want to enable users to install web apps safely without asking IT first -- they'll know it's safe due to the glorious sandboxing.

jorjordandan · on Jan 30, 2015

How is glorious sandboxing different from regular sandboxing? Is there a 'glorious' certification?

Just kidding - looks pretty awesome!!

kentonv · on Jan 30, 2015

Did you get a chance to try the demo?

It was pretty overloaded earlier, but is in better shape now.

https://demo.sandstorm.io

debacle · on Jan 30, 2015

I don't understand what this is. Can someone explain? What the hell does "This is the only way to make Open Source web apps viable." mean?

ocdtrekkie · on Jan 30, 2015

The primary idea being that open source web apps can be used without having to be a server admin. Non-technical users can install apps on a Sandstorm server as easily as installing apps on their phone.

cben · on Feb 11, 2015

More details: https://blog.sandstorm.io/news/2014-07-21-open-source-web-ap... The central point (IMO) was economical: self-hosting frees developer from expenses. Trivial installation is just a pre-requisite for self-hosting to be acceptable.

e12e · on Jan 31, 2015

First(?) discussion on hn: https://news.ycombinator.com/item?id=7460828

After about a year -- how tied up is sandstorm to meteor? I confess I have issues with the our-way-or-the-highway nature of meteor (our js, our db, our app-server) -- even if I can see that it does appear to give some pretty nice benefits for rapid prototyping.

I'd love to see sandstorm as a handful of small tools with various uis on top: command line, web, etc. Seems like it should be possible to do with (on the extreme end) go and and a berkley/sqlite db+file system for images?

kentonv · on Jan 31, 2015

Sandstorm's front-end UI is built on Meteor, but this doesn't affect apps -- they can be written using any stack. We have apps written in Meteor, Express, Rails, PHP, Python, C++, and Rust.

Meteor is actually amazingly modular if you look under the hood. We use it in a fairly default configuration, but it's easy for me to see how I would go about using a different database or a different templating language. Those people write high-quality code.

Eventually I would like to ditch Mongo and instead have Meteor speaking Cap'n Proto RPC to a Cap'n Proto database. I don't expect that I'll have much trouble getting Meteor to do this.

> I'd love to see sandstorm as a handful of small tools with various uis on top: command line, web, etc.

Hmm, not sure I understand what you're suggesting. Sandstorm is all about UI and running web apps, so it seems to me that a "command-line interface to Sandstorm" would really be a whole different project. :)

jorjordandan · on Jan 30, 2015

Would a sandstorm app prevent you from using say, Google analytics or new relic without getting a users permission?

rdrey · on Jan 30, 2015

I don't think Sandstorm wants to be a platform on which you host your large user-facing app. Instead it wants to be the platform on which you can install your personal wiki, ipython notebooks, your streaming media library, etc.

You as the owner of the Sandstorm instance will control whether the apps on your instance can send data to Google for Google Analytics, for example.

If you invite someone else to use an app on your Sandstorm instance, they will trust _you_ with their data and you can decide whether the apps on your instance share the data with Google or not.

jorjordandan · on Jan 30, 2015

Ah, that makes much more sense now! I think they need to figure out a better way to spell out their value proposition.

seagreen · on Jan 30, 2015

Does anybody have any thoughts on the differences between Sandstorm, Camlistore, and Tent?

It seems there are a number of problems here. We need:

1. Better data stores 2. Better server environments 3. Better ways to share data with others

I wonder if Camlistore's approach might not be the cleanest, since it doesn't try to bundle (1) and (2) together.

EDIT: Not to get too sappy, but any of these would be _fantastic_ compared to our current Web 2.0 disaster, and I'm glad Sandstorm is picking up steam.

kentonv · on Jan 30, 2015

Camlistore and Tent are both complimentary to Sandstorm. Sandstorm gives you a way to run apps easily, Camlistore is a structured storage system which other apps could connect to, and Tent is a federation protocol that apps could use to talk to each other. I'd like to see this all converge at some point. :)

seagreen · on Jan 30, 2015

That's awesome news.

I'm suspicious there's too much overlap between Camlistore and Tent for them to both be useful, since they each do data storage and sharing, but that's not your problem:)

CalRobert · on Jan 30, 2015

This is exactly what needs to exist. I recently set up Ghost, Owncloud, and Gitlab on a personal server (odroid U3) that sits under my couch at home, and it's really rewarding to own the hardware which is my "cloud". However, it should be easier, and possible for anyone. Good for you guys.

hbbio · on Jan 30, 2015

Did you use the Docker images?

I set up a Gitlab using Docker recently, and it was super easy to deploy (using https://github.com/sameersbn/docker-gitlab). On a side note, we also package our own app as Docker containers (https://github.com/MLstate/PEPS).

CalRobert · on Jan 30, 2015

I didn't, embarrassingly because I've never used Docker and didn't feel like learning another tool while I set all this up. I need to though.

hiou · on Jan 31, 2015

I've been using dokku[0] for a while now and love how easy it is to just push random stuff up to a new subdomain. The other day I pushed up a doxygen html of an code base I was working with. I have my blog, portfolio site, random apps I use for myself, a cloud storage app etc.

It is definitely one more tool to learn, but it is pretty much a light wrapper around docker so it ended up being a great gradual introduction to the concepts and configurations of working with docker as well.

Be sure to install either the persistent storage[1] plugin or the docker options plugin[2] so that your apps can just use the file system on the server to make things a lot simpler.

[0] https://github.com/progrium/dokku [1] https://github.com/dyson/dokku-persistent-storage [2] https://github.com/dyson/dokku-docker-options

CalRobert · on Jan 31, 2015

Thanks! I'm checking it out now.

maninalift · on Jan 30, 2015

ghost what?

e12e · on Jan 31, 2015

Incidentally, running containers is probably a great way to "install" the ghost libc vulnerability[1] (assuming you're basing off of base-images made before the bug was patched, and you haven't updated your containers/images).

I'm not sure neither vagrant nor docker have this really fixed -- that is: easily patching the base system/image (and still be confident that the app keeps running).

Is there an easy way to update a container based off of a (possibly few generations remote) base-image? Eg: You've pulled down a bare-bones, official CoreOS/Ubuntu/Debian/RedHat image from docker -- set it up for your use-case (say made a base image with your own CA-cert bundled, wired it up for kerberos/ldap/AD, maybe set up a trusted ssh-server ca-cert) -- then made a handful of images based off that: db, cache, and web-app.

Is there an easy way to patch the base image and all descendants? I assume all state should be in other volumes, so maybe this is easier than I think?

At any rate, it is something to keep in mind -- that grabbing images are great, but updates are still needed!

As other mention, the ghost refereed to by gp, is a blogging platform.

[1] https://news.ycombinator.com/item?id=8953545

kentonv · on Jan 31, 2015

Well, bad news, good news, and curious news:

Bad news: Sandstorm packages do not have any particular separation between "base system" and "app"; your app package is simply one big archive of the entire userspace filesystem needed. This is something we might conceivably do in the future, but for now we like the simplicity.

Good news: Once the app maintainer publishes an updated package, it is trivial to update your local app instances in-place. Much like installing apps on Android, the system just swaps out the old package for the new one without touching the user data. We are confident enough in the robustness of this that we plan to implement auto-updating of apps, again like Android (though you'll be able to turn it off if you prefer).

Curious news: With Sandstorm, it often (not always, but often) doesn't matter if an app has vulnerabilities. Each app instance is initially only accessible by its owner, and only accessible to others if the owner explicitly shares with them. Often, the people you are collaborating with aren't threats -- they're your friends.

Apps that public a public web site -- like Ghost (the blog platform, not the glibc vulnerability :) ) -- actually do so strictly as static content. Sandstorm serves the content for them, without executing any of the app's code.

Admittedly, this starts to break down if you want to have a public web site in which users can make persistent changes -- say, post comments.

Of course, if someone does compromise one of your app instances, it's only that instance. The rest of your server remains safe, since each app is in an isolated container.

None of this is to say that patching exploits doesn't matter, but security is not about absolutes, it's about risk management. It's significantly less likely that a bug in a Sandstorm app will lead to real damage.

ocdtrekkie · on Jan 30, 2015

Ghost is a blogging platform! https://ghost.org/

iamdave · on Jan 30, 2015

http://ghost.org - it's a markdown based blog engine that gets out of your way and lets you focus on writing.

I run it for my blog and love it. There are a few features that I'd really like to have, but I get around them by editing locally.

CalRobert · on Jan 30, 2015

Below commenters are correct - the blog platform.

teh_klev · on Jan 30, 2015

This probably:

https://ghost.org/

davidjgraph · on Jan 31, 2015

There's a lot of focus of how sandstorm allows you to run web applications easily without having to setup the back-end that they need without SS.

There is an additional edge case, that's of web applications that don't have a back-end at all, ours falls into that category. Our web app is a by-product of two of our commercial products, but we don't actually have user management, storage, etc.

Online we integrate with Google Drive and Dropbox, but you can't create an account with us and store your data with us. Sandstorm allow people to deploy our web app, whereas you can't at all, previously. It saves us months of work creating and maintaining the functionality it provides.

69_years_and · on Jan 31, 2015

Sandstorm - this is wickedly cool, tried the demo and it worked great, can't understand all the 'do this, do that' comments. As someone who is just learning to play with docker - just finished dockerizing all my vps apps so the first thing I think of is there a Dockerfile to build this or a docker image - off to have a look for one. Awesome stuff - like the collection of apps you have ready to go. Maybe you have fixed the landing web page in the meantime but I had no trouble understanding what you are about. 100.times upvote.

kentonv · on Jan 31, 2015

Thanks!

Hello71 · on Jan 31, 2015

> No protection from getting your job done: Security can often be a hassle, getting in the way of your work. Sandstorm is different. When you tell a Sandstorm app to talk to some other app, or to talk to the internet, Sandstorm sees your intent and automatically grants it access. So, you are never interrupted by a prompt asking "Do you want to allow this app to the thing you just told it to do?" And yet, the apps only get the permissions you actually wanted them to have.

So... the program is psychic?

RaleyField · on Jan 31, 2015

> So... the program is psychic?

No, it's just based on DOS.

sarciszewski · on Jan 30, 2015

> Contributors: Jasvir Nagra <img src=x onerror=alert()>

Nice try :)

kentonv · on Jan 31, 2015

I love that Jas (being on the Google security team) just instinctively XSS's any form he fills out, and I love that our code just leaves his tag there, properly escaped, for all to see.

MayanAstronaut · on Jan 30, 2015

Looks great but memory usage is ambiguous. What if I load a data structure to over the 1gb allocation?

If I need to load a 1.2gb dictionary the whole thing topple?

kentonv · on Jan 30, 2015

There is no 1GB limit. I think you might be confused about compute units? Compute units are just a measure of RAM usage over time -- a compute unit is 1GB of RAM used for one hour. An app can use more than 1GB of RAM; it will just consume compute units faster. E.g. an app using 2GB will consume a compute unit in 30 minutes.

This all relates to our upcoming managed hosting. Self-hosted installs are not metered since it's your own hardware.

https://blog.sandstorm.io/news/2015-01-14-compute-units.html

polynomial · on Jan 30, 2015

This looks incredibly useful for things we're currently trying to make Docker do. Huge potential here.

qiqing · on Jan 30, 2015

Hey polynomial, I'd love to learn more about your use case. Drop me a line at jade at sandstorm?

S4M · on Jan 30, 2015

Can you give examples?

bmoresbest55 · on Jan 31, 2015

I am not really sure why this is so exciting to everyone. I have seen a couple comments asking what this software does, explicitly. I will admit I like the idea of personal servers but I am not sure how to apply this potential amazing software to my life. Suggestions?

alfg · on Jan 30, 2015

Very cool project. I've played around with this same idea, but as a CLI package manager, rather than a webapp.

I think it would be cool to have custom VPS image, where you can install webapps to it from the CLI out of the box easily. Sort of like homebrew, but for your personal servers.

Congrats!

api · on Jan 30, 2015

I've been following this for a while and really like it. Have yet to try an install but probably will once it's more mature.

Dumb question: why not build on and leverage Docker? I'm sure you have an answer. I'm just curious.

kentonv · on Jan 30, 2015

Indeed, we have a whole blog post about that. :)

https://blog.sandstorm.io/news/2014-08-19-why-not-run-docker...

kornakiewicz · on Jan 30, 2015

What's make me curious - do you plan enable option for selling apps? As a developer I would be more than happy to allow user install my app, but since I am not charity I would like to earn some money on it as well.

kentonv · on Jan 30, 2015

Yes, we'll have an app store much like iPhone/Android. Open source apps will have the option of using a "pay what you want" model, but we'll also allow proprietary apps with fixed prices and maybe even subscription-based.

kornakiewicz · on Jan 30, 2015

Great to read. Fingers crossed for the project.

bovermyer · on Jan 30, 2015

This is actually pretty fantastic. I loved how quickly I could get an app up and running.

I assume that the paid version has a better app search/browse experience? The demo list, while cool, was pretty long and uncategorized.

kentonv · on Jan 30, 2015

Yeah the demo "app list" is just a placeholder. We're working on an "app store" with self-service uploads, searchability, paid apps, pay-what-you-want for open source apps, etc. Once it is ready it will be available to everyone, whether you use self-hosting or managed.

bovermyer · on Jan 31, 2015

I appreciate this very, very much. You have my attention.

j_s · on Jan 30, 2015

The easiest way to get your own LibreBoard while they straighten out their DMCA from Trello! :)

https://news.ycombinator.com/item?id=8936701

JimXugle · on Jan 31, 2015

There's also YunoHost, a project with similar goals. I've played around with it, but it seems to be beta quality. https://yunohost.org

cyberrodent · on Jan 31, 2015

From https://demo.sandstorm.io/demo : "We only have one machine. It may get slow or crashy under excessive load."

kentonv · on Jan 31, 2015

And indeed, when we were #1 on HN yesterday morning with hundreds of apps running concurrently, it got a bit slow and crashy. :) But things cleared up after the traffic died down a bit.

Our upcoming managed hosting service will, of course, use multiple machines with automagic scaling.

iamwil · on Jan 30, 2015

How do you get individually installed apps to selectively share data with other installations? I imagine something like diaspora tried solving this problem. Is there a general solution for any app?

kentonv · on Jan 30, 2015

This is something we're still building (Sandstorm is still in alpha), but what we have in mind we call the Powerbox.

The idea is that one app can say to the system "I implement such and such protocol at such and such endpoint". Later on, some other app can say "I need something implementing such and such protocol". The system itself displays a picker, showing the user all of their other apps that may satisfy the requirement. When the user makes a choice, the requesting app is told how to contact the providing app and is implicitly given permission to do so, whereas prior to the exchange the apps had no ability to talk to each other. The user can inspect these connections later and potentially revoke them.

This is a whole lot easier for the user than going to the providing app and editing an ACL, then going to the requesting app and giving it an endpoint address, etc.

The way this is implemented under the hood is in terms of Cap'n Proto capability-based RPC. Blog post on that:

https://blog.sandstorm.io/news/2014-12-15-capnproto-0.5.html

iamwil · on Jan 30, 2015

That's pretty neat it's like the android intent system.

How does the system get the list of all other apps that satisfy the requirement? I'm guessing all apps register with sandstorm server somewhere that has a centralized list of other servers?

It'd be neat to have standard protocols, in the same way we have standard media types.

kentonv · on Jan 30, 2015

The system knows about other apps installed on your server, but not necessarily apps on other servers. To connect to something on another server you will usually want to obtain a Cap'n Proto capability to it. You might do this through, say, a messaging app that has the ability to embed capabilities. Your friend sends you a message with a capability to some object on their server, and then your messaging app publishes that capability on the receiving server, such that it will now appear in the powerbox for other apps on that server.

Alternatively (less cool, but more practical), you might just drop a URL into the Powerbox and Sandstorm will connect to it and turn it into a capability.

3zzy · on Jan 30, 2015

As a designer, the website gave me an 'aha- moment, nicely done! :)

idiotclock · on Jan 30, 2015

Would this also work for a GNU Social or pump.io service? Or diaspora?

ocdtrekkie · on Jan 30, 2015

One of the Sandstorm devs is already working on porting Diaspora. pump.io has been mentioned as a possible thing to port.

falcolas · on Jan 30, 2015

I'm a DevOps developer, so keep that in mind as you read below.

Sandstorm looks insecure and inefficient, but that probably won't matter.

The ease of use for the end user trumps all. Users will love this, but I'm not looking forwarding to having to administer boxes running Sandstorm, though.

I imagine there will be a fair bit of work from my end to re-building apps so they can take advantage of tuned settings, shared services, caches, and the like. Plus figuring out a way to automate the usual securing, managing, monitoring, and cleanup around the Sandstorm environment.

Identifying bottlenecks will be fun too, though my first instinct will probably be to look at the Cap'n'Protocol bridge which Sandstorm runs everything through.

kentonv · on Jan 30, 2015

> Sandstorm looks insecure

As the lead developer, I emphatically disagree with this. :)

If you'd like to state why you think it's insecure, I'd love to hear it, but security is incredibly important to us and something we've put lots of effort into. I don't deny that there may be bugs (it's an alpha), but by design Sandstorm is a highly secure way to run other people's code.

> and inefficient

While it's true that running lots of small per-user (or per-document) instances of apps is necessarily less efficient than running one large multi-tenant server, it's not nearly as bad as it sounds. Instances of the same app share their code an assets (read-only) and are aggressively shut down when not in use, which makes up the vast majority of the inefficiency. Meanwhile, infrastructure continues to get cheaper...

falcolas · on Jan 30, 2015

> If you'd like to state why you think it's insecure, I'd love to hear it

The long-term security of Linux containers has not been well explored yet. There have been exploits against the Kernel found, and there are likely to be more.

Plus, how much effort has gone into hardening the Cap'n'Protocol bridge? Do you have a security expert reviewing the code and looking for vulnerabilities? If so, great! I take that part back wholeheartedly.

I appreciate that you've worked past the Docker failing of not signing and validating files; this is a huge step in the right direction.

> Instances of the same app share their code an assets (read-only)

But not their in-memory caches. Code and shared make up a fraction of their presence in memory.

> infrastructure continues to get cheaper

This has always sounded like a lazy cop-out to me. Yes, infrastructure is getting cheaper, but our applications are getting bloated at the same rate. And if we're running potentially dozens of PostgreSQL instances on a single machine, your infrastructure costs to make all of the apps performant are not going to be cheap.

kentonv · on Jan 30, 2015

> The long-term security of Linux containers has not been well explored yet. There have been exploits against the Kernel found, and there are likely to be more.

Good answer, but I counter with this:

https://blog.sandstorm.io/news/2014-08-13-sandbox-security.h...

> Plus, how much effort has gone into hardening the Cap'n'Protocol bridge? Do you have a security expert reviewing the code and looking for vulnerabilities?

Among our advisors are Mark Miller and Jas Nagra, both members of the Google security team (though advising us in their free time, not on behalf of Google). Cap'n Proto is based heavily on Mark Miller's previous work in capability-based security.

Also among our advisors is Andy Lutomirski, a kernel developer who specializes in security and sandboxing. He has been cranking out CVEs against the kernel lately. Most of them haven't affected Sandstorm, due largely to our seccomp filter which Andy himself wrote and continues to work on (see link above).

My own background is diverse but includes a few years working on security at Google.

That said, we have not yet commissioned a thorough security review of Cap'n Proto's own implementation. That is something we plan to do before any 1.0 release (of Cap'n Proto or Sandstorm).

> But not their in-memory caches. Code and shared make up a fraction of their presence in memory.

Depends. If the app is written in C++ or Rust, then the code is mmap'd in (with that memory being shared across instances). The runtime memory overhead tends to be very low if the app is single-user.

For apps written in dynamic languages that parse their code at startup, yes, memory usage is a lot larger -- as a rule of thumb, many apps use around 100MB. One idea we have to fixing this is to checkpoint an app at the point when it first tries to read its per-instance data and restore from that checkpoint on future runs. This checkpoint could theoretically be shared between all instances and mmap'd copy-on-write.

That said, we don't feel this trick is immediately needed. For our upcoming managed hosting service, we've run the numbers and are confident that the vast majority of users will not come anywhere near hitting the resource limits we've set even for the "standard" service level, and we aren't losing money if they do.

> And if we're running potentially dozens of PostgreSQL instances on a single machine

For the scale of a Sandstorm app, it makes tons of sense to switch to sqlite, which mostly solves this problem. :)

falcolas · on Jan 30, 2015

> [ security reassurances ]

That's pretty freaking awesome - thanks for taking the time to point all this out. Might I request that you make some of this information more prominent on your site?

> [memory]

You're still talking about program code memory, not the allocated stacks and heaps. The heaps are the important part to me, because they represent db buffer pools, Redis queues, and cached responses - data which will be duplicated if multiple instances of the same command are run.

> For the scale of a Sandstorm app, it makes tons of sense to switch to sqlite, which mostly solves this problem. :)

Which unfortunately references back to my comment about re-writing apps which come in, in an effort to increase performance.

kentonv · on Jan 30, 2015

> Might I request that you make some of this information more prominent on your site?

Yes, we should do that. (Tricky, though -- there's so much information we want people to know, but most people will only read two lines. :) )

> Which unfortunately references back to my comment about re-writing apps which come in, in an effort to increase performance.

We've found that a lot of SQL-based apps support sqlite already. For those that don't, adding support may be some work but it's not a rewrite.

For Mongo-based apps, we actually have a patched version of Mongo that reduces the resource usage pretty well. (Basically we just reduced all their hard-coded "pre-allocate at least this much space" constants.) At some point we'll try to do the same for some SQL database...

audreyt · on Jan 30, 2015

> For the scale of a Sandstorm app, it makes tons of sense to switch to sqlite, which mostly solves this problem. :)

Case in point: EtherCalc, which usually runs with Redis storage, deliberately uses the fallback "toy" JSON file storage with Sandstorm, which saves 1MB RAM per document instance and makes migration easier.

This works because there's only a few concurrent writers per document at most, instead of the multi-tenant scenario where there's thousands of concurrent writers at any given time.

jerf · on Jan 30, 2015

"Sandstorm looks insecure and inefficient, but that probably won't matter."

You may need to expand on that. It is not clear that you know what they're doing with sandboxing, etc. If you are, then I'm definitely interested in your further criticism, if this was a knee-jerk response I think it's unjustified.

"I imagine there will be a fair bit of work from my end to re-building apps so they can take advantage of tuned settings, shared services, caches, and the like."

Are you building apps that you expect people to deploy to their personal servers on a routine basis? For normal DevOps folks working inside of a corporation, Sandstorm is a complete non-event. It's not targeted at any part of your pipeline. The fact you're asking about bottlenecks makes me wonder a bit if you understand where this is targeted, too; frankly Sandstorm could slow everything it runs down by a factor of 10 and I wouldn't notice. My VPS that I would run sandstorm on is 99.9% idle anyhow, if not 99.99%.

falcolas · on Jan 30, 2015

I addressed the security concern in a response to kentonv, please feel free to comment more on that one.

> Sandstorm is a complete non-event.

I am speaking from the point of view of someone who may be asked to run and manage Sandstorm servers. Either as a service to sell, or as a service for internal customers. This is a very different use case than using it for just myself.

And if I'm honest, this isn't something I'd need to or want to run for myself. I'm comfortable with managing shared services and propping up web frontends. So no, I am not the targeted user of this software; however, I am the one who gets to write tools and processes to support those targeted users at some point in the future.

kentonv · on Jan 30, 2015

You might also be interested to know that for large-scale users we're developing tools to manage Sandstorm clusters, with the goal of making your life really easy. :) The idea was introduced as part of this blog post:

https://blog.sandstorm.io/news/2015-01-15-sandstorm-1.3M-see...

ohyeshedid · on Jan 31, 2015

You really should follow what your fellows are saying, your claims of this being a complete non-event for DevOps contradict their claims of building out cluster management to help DevOps, or the future target audience being IT departments.

"The fact you're asking about bottlenecks makes me wonder a bit if you understand where this is targeted, too"

If your own team can't seem to understand where this is targeted, how would anyone else?

kentonv · on Jan 31, 2015

It sounds like you might be assuming jerf is a Sandstorm core developer. Just to clarify, he is not. (No offense, jerf.)

FWIW, the Sandstorm team is: me, qiqing (Jade), jparyani (Jason), dwrensha (David), paulproteus (Asheesh).

ohyeshedid · on Jan 31, 2015

Thank you for clarifying, it definitely helps clear up some of the confusion.

relaxitup · on Feb 1, 2015

kentonv, any chance of getting a nice extended (eg, support for more than just text; file formats like images etc as well) (or even just regular) pastebin app added to Sandstorm? Perhaps something like bepasty ( https://github.com/bepasty/bepasty-server ) or Hosty ( https://bitbucket.org/xrstf/hosty ).

kentonv · on Feb 2, 2015

Absolutely, if you port it! ;) https://github.com/sandstorm-io/sandstorm/wiki/Porting-Guide

Or nominate it to our app committee, who votes on apps to port: https://sandstorm.io/vote

(The app committee is 32 people who purchased seats during our Indiegogo campaign.)

ocdtrekkie · on Feb 2, 2015

That sort of thing is definitely well in the scope of apps that'd be good to port to Sandstorm. There's a lot of basic apps the platform still needs.

Bepasty in particular sounds like a really neat potential app that can just provide "file containers".

mmccaff · on Jan 30, 2015

The web page is beautiful.

It might help to emphasize where this sits between installing Docker images from the repo, and using something like Webuzo or Softaculous to install popular web apps.

jmsdnns · on Jan 31, 2015

I really appreciate the humor in Cap'n Proto. That makes the dev team seem like a fun group to hang with.

jorjordandan · on Jan 30, 2015

The web site says apps cant perform psych experiments, and links to the contentious facebook emotion split test. I'm pretty sure you'll be able to do split testing on these apps...

That small criticism aside, I'm very curious how developing for sandstorm would be different from from developing for a typical host. Anyone know the major differences?

kentonv · on Jan 30, 2015

Once our sandbox is complete, apps will not be able to "phone home" unless the user grants permission, so users will need to opt into any experiments.

Currently there are two reasons this isn't true yet:

* We haven't implemented client-side sandboxing yet. It's not incredibly hard (content-security-policy header, some tweaking of apps), but hasn't hit the top of the priority queue.

* The server sandbox currently has some intentionally-poked holes allowing apps to do things like pull RSS feeds from the internet. We plan to close these once the Powerbox UI is implemented, which is the main permission-granting interface, but that's a major project and we wanted to get some useful apps working in the meantime.

tomjen3 · on Jan 31, 2015

Couldn't you do an AB test by having the code choose a random number on install (or first write to the datastore) and have it leak the results by loading an image or iframe with a special url/query parameter?

kentonv · on Jan 31, 2015

With Content-Security-Policy we can prohibit an app from using images or other assets from third-party servers.

rdrey · on Jan 30, 2015

Sure, you could write an app that does A/B testing, but the whole idea is that people will use Sandstorm to run _personal_ servers/apps. And A/B testing your own reaction to an app doesn't make much sense.

Runtime permissions of these apps will be easy to control, so the Sandstorm platform will prevent apps from phoning home without your permission.

jorjordandan · on Jan 30, 2015

Again thanks! But using Facebook was still a poorly chosen example of what their apps won't do, since they aren't aiming for hosting large customer facing apps like a social network. Perhaps that is what contributed to my confusion in the first place. Now I have a better understanding of what it's for.

swsieber · on Jan 30, 2015

Ah, but you could write an A/B testing page and then share it easily with your friends - part of the value proposition is 1) hosting your personal apps and 2) fine-grained sharing of it with others. So it wouldn't be a huge sampling, just your friends.

pmontra · on Jan 30, 2015

I see that many old and new web applications run inside sandstorm, so it's a framework to manage apps. They probably have to be adapted a little but I don't think that WordPress has been rewritten to fit into sandstorm. Anything will do, probably.

Furthermore you can download sandstorm and install it on your server.

swsieber · on Jan 30, 2015

One major thing that I can think of off the top of my head is that they provide the login and authentication for you and you just plug into it. They also have a sharing system in place that I think you can easily plug into.

nodejsisbest · on Jan 31, 2015

Darude Sandstorm is going to conflict with this projects name a bit?

dominotw · on Jan 30, 2015

The logo is very cute. I love it.

qiqing · on Jan 30, 2015

Thanks! h/t to Néna Nguyễn :)

alinspired · on Jan 30, 2015

on the first glance it's similar cpanel or plesk, but opensource, will take a look!

daemonk · on Jan 30, 2015

Is this like a cloud docker?

qiqing · on Jan 30, 2015

We use the same Linux kernel features that Docker uses, but we use them directly. Here's a deeper explanation:

https://blog.sandstorm.io/news/2014-08-19-why-not-run-docker...

netsurfer912 · on Feb 1, 2015

Darude - Sandstorm

(sorry)

shric · on Jan 31, 2015

What's the name of the song?

iagooar · on Jan 31, 2015

I thought that too, but was afraid to post it fearing the negative votes ;)

gregimba · on Jan 31, 2015

This isn't really anything to write home about.