Scientific Breakthrough Lets SnappyCam App Take 20 Full-Res Photos Per Second

revelation · on Aug 1, 2013

DCT is already lossy [1], so the statements around 8 megapixels are completely pointless, and worst of all, its 1990 lossy technology. Wavelet transformations completely destroy any DCT.

That said, if their emphasis is on producing pictures with minimal time delta at highest resolution, algorithms used for still pictures are out of place. Video compression algorithms still use DCT and wavelets, but they do so only after they have reduced redundancies between series of pictures, a process that tends to work significantly better than anything you can get out of these lossy transformations when you want to preserve quality.

Of course, eliminating redundancy in a series of pictures might have tipped them off to the fact that the image sensor isn't actually producing fresh pictures at the rate they want.

1: as used in JPEG. The transformation itself is perfectly invertible, assuming infinite precision arithmetic.

jpap · on Aug 1, 2013

You are right on the loss: it's purposefully introduced as a quantization step after performing the DCT, and before losslessly compressing the resulting coefficients with Huffman and encoding to the final JPEG bitstream.

Despite all of that, JPEG has now become computationally tractable. I remember the days where it took tens of seconds to encode a JPEG on a commodity machine. Now, with the help of SIMD, we can encode a high quality image in msec on a mobile device.

Fortunately you can choose the quantization matrix that determines the amount of loss. Even if you were to choose a unitary matrix, no human, not even superman with his laser eyes, can "detect" the quantization noise.

For SnappyCam, I chose to invest in JPEG a little more because it's a ubiquitous standard for still image compression.... and with the right hardware and algorithms, quite tractable.

I'll consider adding a JPEG "quality setting" so you can choose the amount of loss introduced... sounds like a great idea to me.

The idea behind SnappyCam was also to code each picture independently, and not rely on motion prediction or video codecs. If you try and pull a single frame from a HD video you might be disappointed: they compress the YUV dynamic range (studio swing) and it looks washed out, even if you land on an i-frame.

Lastly, as far as I can tell, the image sensor is yielding complete scans with each frame. I'd hazard a guess to say that any motion prediction or frame deltas might actually slow the whole chain down.

smith7018 · on Aug 1, 2013

Man, I hope I'll know as much as you do one day... Congrats on the app! I just bought it and I'm really liking it! One suggestion; would outputting to a gif file be too difficult? I could really see people using this to create gifs of their lives that they could share to Tumblr or Facebook.

jpap · on Aug 1, 2013

Thanks! :-) To be honest, none of my formal training prepared me for what went into SnappyCam---only the "how" to go about learning to do so. I'm sure you could pick up a few tricks in the same way by working on a cool pet project or two.

With so many recent requests for AGIF, I'm absolutely going to add it to the app. (It's been on my list for a while, but lower priority than getting up a solid core product and the social sharing that is in development at the moment.)

zxcdw · on Aug 1, 2013

Out of curiosity, what kind of educational background do you have if you don't mind the question? As a highschool dropout it interests me. :)

jpap · on Aug 1, 2013

I'm Aussie, so I did my schooling in Melbourne, Australia.

I don't recommend the same path for everyone, but my ugrad at RMIT University, dual bachelors of EE and CS. I then went on to do a Ph.D at the University of Melbourne in EE. My dissertation was on mathematical optimization of wireless and wireline DSL. (Prof Jamie Evans is an awesome guy if you're on the lookout for an advisor!)

I've been in SFO for just over 5.5 years, and started SnappyLabs after winning the "greencard lottery".

SnappyCam, you might say, is the embodiment of both a very practical ugrad and somewhat applied but very theoretical pgrad.

prawn · on Aug 1, 2013

Congrats fellow Australian. Two of us here in an office in Adelaide just bought the app.

Guy behind me said "Hey, this is pretty cool. Have you seen... oh, you're looking at it already."

megablast · on Aug 1, 2013

Hi, what is Adelaide like to work in? I assume you guys are in IT? From what I hear, it is mostly government work.

prawn · on Aug 2, 2013

I run a two-person web development business and have worked out of Adelaide for myself for 15 years. Always seems to be enough work around and a decent amount of variety.

The other SnappyCam purchaser I mentioned is an iOS/Android developer who is a sub-tenant in my office. We actually have a cheap, spare desk in this room at the moment if you wanted to visit and work for a while. Email me if you want to ask any questions.

nl · on Aug 1, 2013

Adelaide is good. Plenty of gov & defence work if that is what you are after, but I've worked here nearly 15 years and never done either (ok, my current place is quasi-government, but still..)

dorian-graph · on Aug 1, 2013

I think all us Australians are coming out right now to congratulate him, haha. /Brisbane

jpap · on Aug 1, 2013

Onya mate! That's bloody awesome. :D

lukego · on Aug 1, 2013

Impressive stuff! And just in time to capture some action shots of my puppy :-) -Appreciative fellow Australian, in Switzerland :)

jpap · on Aug 1, 2013

Oi oi oi! :D

wluu · on Aug 1, 2013

high five

Another fellow Australian and Melburnian.

Well done on the app John!

danpat · on Aug 1, 2013

Hah, I told you getting your PhD was a waste of time ;-)

jpap · on Aug 1, 2013

LOL! Thanks @danpat. :-) How are you going these days? I heard you might be moving to the Big Apple?

mansr · on Aug 1, 2013

Motion estimation/prediction is used video coding because it minimises the compressed size. However, it is incredibly expensive to perform the motion search. A typical video encoder spends well over half its CPU time in this stage. After motion estimation, the residual image is encoded in the usual way, so speed-wise the motion search is a complete loss.

portmantoad · on Aug 2, 2013

One thing I've gotten very into recently is multishot techniques, where I take multiple shots in burst mode, align them, and then average them to reduce sensor noise. Similar to what http://www.photoacute.com/ does but they do more advanced superresolution stuff that your invisible noise might preclude, but a simple average or median in areas where there isn't too much motion really improves the quality quite dramatically in some cases, particularily in low lighting situations. If your frame rate is that high it probably wouldn't be hard to get a really good alignment between frames, so I thought I'd bring this up as a thought for a future feature (as in - you select the frame you like, then the program grabs a few frames immediately before and after and uses them to increase the image quality of the final output).

svantana · on Aug 1, 2013

SIMD, you say? Are you relying mainly on NEON optimizations or are you also doing encoding stuff on the GPU? Very impressive performance I must say!

jpap · on Aug 1, 2013

Thanks! :-)

I first tried using the GPU, using old school GPGPU textures and OpenGL ES 2.0 shaders, but unfortunately the performance wasn't there for a variety of reasons given in [1].

SnappyCam has since been making extensive use of ARM NEON for the JPEG codec and a bunch of other image signal processing operations, like digital zoom. It's a great instruction set!

[1] http://www.snappylabs.com/blog/snappycam/2013/07/31/iphone-k...

igravious · on Aug 1, 2013

Just curious, I know nothing about low level ARM stuff. I was wondering is this iPhone/Apple specific tech or is the work you've done portable to other mobile platforms? Congrats on what you've done. I couldn't quite work out whether you've optimized the hell out of the standard DCT algorithms or whether you've come up with new algorithms. If it's the latter would you be able to publish them or would that give away too much secret sauce? ;-)

nine_k · on Aug 1, 2013

No, NEON is an ARM-specific tech, and is widely available and used e.g. on Android smartphones.

It's like MMX / SSE of x86 world, a set of extra instructions to process many small integers in parallel in one instruction. Since image data are usually independent 3-byte pixels (or 3 planes of 1-byte subpixels, one per color channel), NEON is great for many image-processing tasks.

See e.g. https://en.wikipedia.org/wiki/ARM_architecture#Advanced_SIMD...

lcrs · on Aug 1, 2013

Re video's "studio swing" dynamic range, the YUV components do have a different encoding range to those in JPEG, but if you expand them back out to 0-255 the image is in fact the same - you lose a lil' fraction of your bit-depth but no dynamic range.

I think you definitely made the right choice though - it's interesting that the obvious delta-coding and motion compensation tricks to reduce bandwidth are rarely used for video acquisition apart from the most limited devices like phones, stills cameras and the GoPro. Everything that can afford to uses per-frame coding like ProRes, REDCODE, AVC-Intra, DNxHD, Cineform... being able to seek quickly is important!

In fact Canon's 1DC 4k camera uses (dun dun duhhhhh...) motion JPEG :)

revelation · on Aug 1, 2013

It's just bizarre that you would be doing the complete JPEG process at the instant you get the image from the sensor. As you note, there are a plethora of steps that JPEG performs, from color space conversion, to DCT transformation (essentially a gigantic matrix multiplication), Huffman coding, quantization, arithmetic coding and encoding as JPEG bitstream.

The only reason would be that you are pressed for memory or bandwidth, but certainly you have the resources to store one full frame and produce deltas, or just apply part of the JPEG chain, enough to remedy memory pressure. You can always encode it to an actual JPEG after the process.

And yes, pulling single frames from a completely encoded video isn't helpful, because they can get away with more compression. But there are very sophisticated algorithms for eliminating the redundancy between frames, which would have been my first avenue in attempting to do something like this.

gruseom · on Aug 1, 2013

which would have been my first avenue in attempting to do something like this.

Have you attempted to do something like this? Because he not only has attempted, he's done it. Therefore I think you should stop talking down to him ("completely pointless", "it's just bizarre", "my first avenue"). It comes across as wanting to prove how smart you are instead of seeking to learn from someone who has done incredible work and—lucky for us—is bursting at the seams with enthusiasm to share it.

Oh and congratulations jpap on what's looking like the most successful and technically solidest HN launch in quite some time! I hope your hard work pays off.

eclipxe · on Aug 1, 2013

Thank you for saying what I'm sure a lot of us were thinking.

zimpenfish · on Aug 1, 2013

wild applause

tptacek · on Aug 2, 2013

This whole thread should be framed and hung in the HN lobby.

jpap · on Aug 1, 2013

Oh, I actually do both---see the other thread on the topic.

I buffer as much as possible, while also encoding on any other cores that are available.

There's a reason why a "simple camera app" can total some 80 kLOC. :)

philhippus · on Aug 1, 2013

Why is this comment getting downvoted? I see that it adds insight to the subject and makes some good points, which OP even acknowledges. The written complaint is that revelation is not being deferential enough, which is bullshit on an in-depth technical discussion.

kelnos · on Aug 1, 2013

There's a difference between "not being deferential" (which I don't think the complaint is) and talking down to someone who's clearly built something very cool. Put another way, there's a difference between dismissing someone's approach and asking questions to try to understand it better.

gruseom · on Aug 1, 2013

"Deferential" has nothing to do with it. In a technical discussion, the focus should be purely on the content. Inserting one's self into it (such as by being supercilious) detracts from that.

jasonwatkinspdx · on Aug 1, 2013

> Wavelet transformations completely destroy any DCT.

It's not quite that simple. They have different strengths and weaknesses, so you can't say one is categorically better than the other. DCT has uniform frequency resolution, which sounds desirable but isn't once you start quantizing due to ringing, etc. See slide 25 or so of [1]. Wavelets overcome this problem by adapting the resolution in opposite ways at the extremes. This works fantastic for medium signal rates, but can have severe low passing at low rates. For low rate coding DCT with the bells and whistles (deblocking, etc) will typically win.

[1]: http://people.xiph.org/~tterribe/pubs/lca2012/auckland/intro...

nwh · on Aug 1, 2013

Looks like they've an instagram-type site set up too.

http://snappyc.am/2LdRMF28U0

http://snappyc.am/4HHxyCad7D

http://snappyc.am/3G3i6QCJUk

aroman · on Aug 1, 2013

Wow, I'm usually pretty skeptical about these new niche social networks, but I've gotta say that is really really cool.

Great work on the "living photo" idea -- that kind of adds this magical sort of feeling to it, which, I've gotta say, I felt.

I hope your hard work pays off. You've built something special here :)

jpap · on Aug 1, 2013

Thanks mate---comments like yours make my day. :-)

I'm not actually planning on creating a new niche social network. I think, given the vastness of some of the ones that've emerged of late, it would be more fruitful to leverage the ones that exist today.

Can't wait for you to see it!

pdog · on Aug 1, 2013

Given that niche social/mobile networks for short videos are so hot right now (ie. Vine, Instagram Video), it might be more fruitful for you to plan on creating a new one.

ianstormtaylor · on Aug 1, 2013

+1 I think this could actually take off as a niche social/mobile network. (Despite how ridiculous that sentence sounds.)

jpap · on Aug 1, 2013

The Cinemagram guys started off down that path, an Instagram clone for animated GIF cinemagraphs, then quickly pivoted toward short videos to compete with Vine and Instagram on similar territory.

I'm still not convinced. :-)

prawn · on Aug 1, 2013

Get convinced. These could be a lot of fun. You just need some help with the design/interface, but the tech is great.

robryan · on Aug 1, 2013

Yeah, I think there is a niche for this. One of the main criticisms of vine is that there is a certain magic to pictures compared with video which comes off as more real. We don't want to save the reality of our lives, just the filtered moments.

I really think this could fill a space in between the perfect single shot and the realness of video.

jpap · on Aug 1, 2013

Glad to hear it. I feel the same on the format: it's closer to a photo on the left of the spectrum between photo and video, where vine/instagram/cinemagram are on the far right.

You've put it very nicely--there's definitely some magic about a "silent moving picture".

It requires and sparks the imagination in the viewer, evoking emotion perhaps as easily as a carefully crafted cinematic short.

prawn · on Aug 2, 2013

It's also a photo with more context. If the presentation allowed the owner to pick the starting point, and users to slide forward and backwards, commenters could suggest other funny/interesting frames.

The work required to set up a basic social network around this would surely be insignificant when compared with what you've done to date! Yell out if you need help on the design side of things.

tomrod · on Aug 1, 2013

It honestly looks like the photos from the Harry Potter movies. Love it! Great work!

jpap · on Aug 1, 2013

Nice find! It's going to form part of the Facebook Timeline integration that's coming soon... as well as embeds, though Josh already linked to the "press samples" page in the article. :-)

skrebbel · on Aug 1, 2013

Just for your info, the top link actually crashed my Windows Phone 8's Internet Explorer. Clearly that's an IE bug and nothing else, but I thought you should know :-)

Keep up the good work!

jpap · on Aug 1, 2013

That's unfortunate. I suspect there's a JS memory leak in the viewer, as it will cause a crash on iOS Safari after prolonged use. It's on the list.

Thanks for letting me know!

marcamillion · on Aug 1, 2013

Do these work for you? When I load the site, I see the photo scrub on the right hand side of the screen - and the slider moves up and down...but I don't see the big image loaded in the center of the screen where I expect the image to be.

Unless it takes a while to load - in which case I was just being uber impatient.

jpap · on Aug 1, 2013

I saw this a couple of days ago. Are you using Chrome?

It might be yet another Chrome canvas bug. :-(

Try Safari and let me know if you can? :-)

dlsym · on Aug 1, 2013

Does not work in FF 22.0, neither in Chrome: 28.0.1500.71 or Opera. (OS: Linux Mint)

jpap · on Aug 1, 2013

Ouch. Looks like some work for me ahead.

The problem with Chrome 28.0.1500.x (.95 here) is troubling me. It seems a more recent problem that I'm convinced is another browser bug.

Thanks for the detailed version report, that's really going to help. :-)

groby_b · on Aug 2, 2013

Hm. Works on Chrome 30.0.1582.0 (Canary) - after repeated page refreshes. Was stuck on 98% several times. Disabling the cache doesn't allow me to repro it, emptying the cache doesn't either.

Works with 28.0.1500.95, too.

However, looking at the console, I see occasional instances of Resource interpreted as <blah> but transferred as MIME type <foo>. Not a big deal, but maybe a pointer.

icebraining · on Aug 1, 2013

Works fine in Firefox, FWIW.

BuddhaSource · on Aug 1, 2013

OMG, it works on IE 10! Congrats!

But not working on Chrome or FF :|

jpap · on Aug 1, 2013

What version(s) are you on? I spent a lot of time checking in more recent versions of FF after developing the site.

I do pretty much everything on OS X these days, but I did fire up a couple of VMs with Win XP, Win 7, and Win 8 to test.

I did my entire dev on Chrome, and apart from what appears to be a new canvas issue at present, I'm not having any major issues. (I'm looking into the canvas problem.)

wingspan · on Aug 1, 2013

It also doesn't work for me in Chrome on Win8. It does work in IE10, though.

jpap · on Aug 1, 2013

It shudders me to think that IE won over Chrome. :(

I saw an issue also reported on here earlier today a few days ago. Sounds to me like a (new) canvas bug in Chrome.

There are unfortunately several workarounds for browser bugs in the HTML5 viewer. The AS3 Flash port was surprisingly very solid. I can't wait to share more info on that when it's time for another major release.

asmosoinio · on Aug 1, 2013

Chrome on Mac here. I was also worried it might not work, as after 100% it still took a long time (didn't measure, but maybe 10 seconds?) to actually show anything. Almost closed the tab, lucky I did non.

Cool app + site, congrats! Would love to use this to analyze my disc golf throws, and share with my fellow disc golfers.

Myrth · on Aug 1, 2013

Chrome 28 on Win8 here - works great, thank you for making it! Even though I personally can't use it since I'm on Android.

ronjouch · on Aug 1, 2013

Impressive job! But I confirm the webapp does not work for me either, it stays blocked at 0%. I'm using FF (Nightly) 25.0 on Ubuntu 13.04 x86_64.

seldo · on Aug 1, 2013

This is neat tech and works pretty much as advertised, but man, this UI is pretty rough. The blue background and curvy borders are strangely superfluous; tapping the left-bottom corner controls pops up an intermediate selector but the right-bottom controls work in-place; taking a shot produces a big "infinity" symbol that fades in and out of view -- I don't know what it means.

Good work on tech, please hire a UX specialist :-)

jpap · on Aug 1, 2013

Fair comments, and much appreciated.

I did all of the graphics design myself, in the app and on the web. :-)

The infinite sign you see does require an explanation. I'll take your advice and think about how it can be done more simply.

It's basically telling you, the user, that the capture buffer has filled, and you're now dropping (some) shots.

josh2600 · on Aug 1, 2013

Have you thought about displaying a semi-transparent bar to show the buffer? Or maybe a one or 2 px white mark creeping up the side of the screen (turning to red as it get towards the top)?

Just some thoughts. If you have a buffer, and I'm gonna get fubarr'd if I hit the limit, you should probably show me the buffer (not just a warning that it's too late).

jpap · on Aug 1, 2013

Versions 1.x.x of SnappyCam had a linear buffer [1] but I felt it was distracting.

I generally can see the "end" of the circular buffer around the shutter button, so it doesn't seem to be an issue for me. Perhaps I tend to touch it on the lower-right instead of dead-center.

I made an effort to support lefties in the UI (see Advanced Settings), but the buffer doesn't spin the other way just yet. (To be honest, I've had to deprioritise that in favour of other features.)

Are you left handed?

[1] Yes, that's me jumping near the GG bridge. I'm quite good at it now, as you can imagine: http://a3.mzstatic.com/us/r1000/085/Purple/v4/c5/06/d5/c506d...

josh2600 · on Aug 1, 2013

No, not left handed, but I didn't get the infinite visual cue until you explained it here or the border.

The red bar on the bottom woul probably be fine of you could make it like 70% transparent until it gets toward the end, the vacillate it between 0% and 50% so it looks like its flashing. Some visual indicator that I should be paying more attention to it.

seldo · on Aug 1, 2013

I hadn't realized the line around the shutter button was supposed to change at all. That's definitely not where I'd put it.

danpat · on Aug 1, 2013

Nice crotch shot jpap :-)

voltagex_ · on Aug 1, 2013

A red thing might look too much like a "Recording" dot, though.

andrewf · on Aug 1, 2013

I think it's fantastic that you've managed to turn a long, hard optimisation slog into a real product win. Add me to the list of Australians willing to buy you a beer - but not back home, I live in SF at the moment :)

I'm curious about the low-quality preview you get when scrolling through all the shots. Are you storing low-quality data separately or do you also have a fast, low-qual JPEG decoder? (Is the Huffman encoding between blocks independent?)

jpap · on Aug 1, 2013

Hey Andrew, would love to catch up over a beer. :-) Drop me a note via email: jpap {at} snappylabs.com

You've got a good eye: as part of the JPEG image compress, I also generate a low-resolution thumbnail that's embedded into each file as Exif metadata (along with geotagging, and other camera settings that define the shot, like exposure).

They are used as a "first-in" placeholder for an image.

The full image is then downsampled and decompressed simultaneously [1] exploiting the fact that the (Retina) screen resolution is often much lower than the full JPEG resolution.

As soon as you start zooming, the image is decompressed yet again at the full resolution and replaced in-place as quickly as possible so hopefully you won't see it. :-)

[1] As outlined in http://jpegclub.org/djpeg/ the technique relies on the fact that the top NxN DCT block of MxM coefficients, N < M, can be inverted to form a NxN pixel lower-resolution image of the original MxM block. When N is {1, 2, 4} a fast inverse DCT algorithm can be used with great success.

In fact, N == 1 is a trivial inversion and it might be tempting to use it as the low-resolution image instead of a thumbnail, but you still have to unpack all of the DCT coefficients to get to it, which can be expensive (Huffman).

Oculus · on Aug 1, 2013

I have a feeling that soon SnappyLabs is going to have Apple knocking on their door with a very nice offer.

Kudos to them, sounds like they deserve it.

jpap · on Aug 1, 2013

Thanks!

I just hope Apple's engineers don't get pissed off by the press. SnappyCam is built on their hardware, which can do remarkable things.

Though we as app developers don't get access to a lot of their smarts, e.g. hardware JPEG codecs, I'm sure there's even more innovation in their work that often goes unacknowledged.

ahknight · on Aug 1, 2013

You'd think so, but they take 2-10s to recover from a photo and you ... don't. Clearly there's some room for optimization there. :)

cendrillon · on Aug 1, 2013

Nice to see Jpap continuing to push the boundaries of what's possible.

Aussie maths whiz supercharges net http://www.smh.com.au/articles/2007/11/05/1194117915862.html

gandalfu · on Aug 1, 2013

@jpap, are the results shown in the article being used today?

jpap · on Aug 1, 2013

You'd have to ask Ericsson. ;-) I certainly hope so!

rbourke · on Aug 1, 2013

I vaguely remember reading that they (the Ericsson patents) were included in the latest VSDL2 specs.

Could be rolled out as part of the NBN pending Australia's election result.

jpap · on Aug 1, 2013

Thanks Raphael! :D

It's been an awesome, surreal experience, very much reminiscent of 2007. Fun times, relived. :-)

Marat_Dukhan · on Aug 1, 2013

Wow, amazing performance tuning, so rare these days!

However, you should be careful with this online ARM simulator. It simulates Cortex-A8 while iPhone 5 runs on Apple Swift, two generations ahead. It very likely has different instruction timings compared to Cortex-A8. I didn't have a chance to test Swift, but here is a list of what might be different, judging by Qualcomm Krait and ARM Cortex-A15, which are in the same generation:

- Instead of 2-cycle latency on Cortex-A8 simple ALU instructions might have 3-cycle latency on Krait (this is the case on Krait and Cortex-A15).

- Cortex-A8 can issue only 64-bit SIMD multiplication per cycle, Swift probably can do 128-bit VMUL.Ix each cycle (Krait does).

- Cortex-A8 can issue only one SIMD ALU instruction per cycle, Swift probably can do more (Cortex-A15 can issue 3 128-bit VADD/VAND/etc in 2 cycles).

- Cortex-A8 could issue one SIMD ALU + one SIMD LOAD/SHUFFLE per cycle, Swift could be less restrictive (and probably even can issue 3 NEON instructions per cycle, like Cortex-A15).

jpap · on Aug 1, 2013

That's really cool, Marat. Thanks for the additional info on the A15 and Swift.

It's a lot of work to optimize the assembly code to each ARM variant, but glad to know that Swift will generally run the same code at the same or faster speeds as the Cortex-A8.

The 3-cycle latency on simple ALU instructions is a bummer, but fortunately I use them sparingly for computation as compared to NEON. (They're great for pointer arithmetic and computing image row strides.)

The multiple issue of an ALU + LOAD is awesome. That would definitely help some of my routines.

Marat_Dukhan · on Aug 1, 2013

The 3-cycle latency refers to simple NEON ALU instructions (VADD.Ix, VORR, VAND, etc). Scalar ALU instructions are still single-cycle. Note that these numbers are from Cortex-A15 and Krait which are expected to be similar to Swift, but I didn't measure Swift itself to know for sure.

gosu · on Aug 1, 2013

This looks fantastic. Watching people's reactions in that example image was really interesting, and it occupied me for a good few minutes. "Why can't you do the same thing with video?" Because rewinding video is really painful, especially online video.

Criticism:

I use my thinkpad's pointer stick to move the mouse cursor. It's impossible to keep the cursor inside the "control strip" while moving it up and down and also looking away from the strip (and at the image). Too much accidental x motion is introduced.

It would be better for me if you were to enable the scroll wheel (which I can simulate on my pointer) as an alternative time control, or perhaps let me click on the control strip and then hold down mouse1 for as long as I want my y motion to control the position in time.

jpap · on Aug 1, 2013

@gosu, despite what Josh wrote, you can traverse your pointer across any part of the living photo online. :-)

Love that you picked up on the expressions! It wasn't until I got the photos out of the app was I fascinated by the same thing. I really can't wait to enable this functionality for everyone soon. :-)

More elaborate mouse movements are possible, only in HTML5 full screen mode; required to "capture" the mouse (think a game).

The problem with that, too, is that instruction or a tutorial is required. (I'd try to make things as intuitive as possible, despite the failure in the other thread RE UX and the infinite shutter.)

gosu · on Aug 1, 2013

facepalm

What was happening is that I was trying to keep the mouse in the control strip, and it would go off the right side of the image.

Thanks a lot, Josh.

Edit: By the way, the fullscreen functionality isn't launching. But I do have a weird browser (conkeror on xulrunner 22.0).

jpap · on Aug 1, 2013

haha, no worries.

In the app, you need to start with your finger near the thumbnail strip. (But can move it away for fine-grained scrubbing if you wish.)

It's no surprise that the learned behavior is transferring to the web viewer.

9oliYQjP · on Aug 1, 2013

jpap, I don't quite fully understand the implementation (though I'd love to one day be proficient enough to). But maybe you can explain how the format compares to motion JPEG. Or maybe it's very similar? About 15 years ago I dabbled in live video recording on old Pentium II hardware with an old BT878 video input card. Motion JPEG was the only feasible option to obtain relatively high quality (for the time) results albeit at the cost of disk space.

jpap · on Aug 1, 2013

There are a lot of similarities actually.

In SnappyCam, each photo is compressed to a separate JPEG file. There's no inter-frame compression, no motion vectors, etc. The same as mJPEG.

The main differences are:

* Each photo is stored in an individual file. This makes seeking through the living photo blindingly fast. (I guess you could do this with motion JPEG by utilizing an index.)

* Each photo also has full metadata. Try rotating the camera as you shoot. It will follow you. :-) Same goes for the geo-tagging: included are a bunch of timings that aren't normally included, so you can know the "precise" usec when you took the photo.

* Each photo has it's own thumbnail. That allows me to cheat a little bit in the photo viewer: you will see a flash from blurry to clear as you scroll around.

(There are more cheats in the viewer for decoding and downsampling at the same time before you zoom, to make the photo load faster as well. One of the handful of reasons why I rolled my own decoder as well.)

ramanujan · on Aug 1, 2013

This is amazing work. Could you explain why you decided to go with many individual stills rather than filling in the gaps in a video codec? It's a really counterintuitive approach.

jpap · on Aug 1, 2013

Good question!

Several reasons:

* Video codecs are much more complex.

* Random access seek is a lot slower, unless you're using all I-frames. (That's now a codec option on iOS, but not when I started.)

* "Studio swing" reduces the dynamic range of the YCbCr components so the quality suffers.

* Each frame lacks their own thumbnail, unless you maintain an adjunct "thumbnail video"

* Each frame might(?) not be able to have attached separate metadata, like geotagging, sensor settings at time of capture, etc.

* Deleting one frame causes a "hole" and headache.

* Standards compliant JPEG means export is super easy.

* Anything above full HD video is difficult to deal with in 3rd party software.

Dylan16807 · on Aug 1, 2013

Wow, I never thought I'd see a software optimization be talked about in such breathless amazement.

oakwhiz · on Aug 1, 2013

"discrete cosine transform JPG science"

Here's a more interesting link directly from the app developers: http://www.snappylabs.com/blog/snappycam/2013/07/31/iphone-k...

Strilanc · on Aug 1, 2013

Thanks, that's a much better article.

> extended some of that research to create a new algorithm

> 10,000 lines of hand-tuned assembly code

> optimized out pipeline bubbles using a cycle counter tool

Color me impressed. It sounds like they really pulled out all the stops.

stefantalpalaru · on Aug 1, 2013

The Android bashing in that article is unfounded: http://www.eggwall.com/2011/09/android-arm-assembly-calling-...

nl · on Aug 1, 2013

It's not Android bashing - he's managed to make the older, slower iPhone hardware perform better than the current high-performance kings (the S3 & S4) through smart software optimisation.

There's no reason he couldn't do the same on Android and see similar gains. It would just be a lot of work..

stefantalpalaru · on Aug 1, 2013

He only implemented his optimized software on one platform and somehow starts to compare it with different software on another platform. How is that relevant for that other platform's performance?

nl · on Aug 4, 2013

Umm.. it shows the power of software optimisation?

qwertzlcoatl · on Aug 1, 2013

I assume you never went to a apple conference.

wmf · on Aug 1, 2013

Yeah, I'm kinda skeptical of the "science" here.

Edit: A new algorithm counts as science, but the TechCrunch article really gave no justification for the claim.

jpap · on Aug 1, 2013

I've given a bit more background to the fast JPEG codec on my engineering blog: http://www.snappylabs.com/blog/snappycam/2013/07/31/iphone-k...

If you like signal processing, fixed point arithmetic, SIMD cores, and assembly, then this is for you. :-)

0x09 · on Aug 1, 2013

So the summary is "JPEG encoder written in assembly with NEON instructions saves images faster than Apple's encoder."

That's a cool feat and is a little damning for Accelerate.framework, although the way techcrunch writes it I expected a new kind of fast cosine transform.

jpap · on Aug 1, 2013

Don't forget that SnappyCam pumps both CPU cores when available.

The actual DCT algorithm created and used in the app is different to the typical AAN (Arai, Agui, Nakajima) DCT algorithm that's used in JPEG codecs, at least all the ones I've seen.

It's all about doing as little work as possible to achieve the end result. That's why there's so much asm implementation, with carefully chosen NEON instructions for each step.

Think of it as a cross-layer optimization between algorithm and implementation... done by hand. :-)

midnightclubbed · on Aug 1, 2013

Really interested in the nuts and bolts - are you optimizing specifically for one quality setting (in which case I'm guessing you could probably do the quantization as part of the dct and throw away some calculations)? I played with a realtime jpeg compression implementation back in college on transputers (yes I'm that old). Fun stuff, nice to see there are still places where going right down to the metal can make a real impact on a product...

jpap · on Aug 1, 2013

Oh that's awesome and a lot of fun!

While SnappyCam has been the most difficult, complex, piece of software I've written since I started coding in my early teens, it's also been one of the most satisfying technically.

I'd love to disclose the many, many optimizations baked in, but as this is a commercial app I must keep much of it as a trade secret.

I will say though that a lot of precomputation was involved, both for the encoder and decoder. Jumped at the chance to avoid computation, memory reads, etc., as much as possible. :-)

binarycrusader · on Aug 1, 2013

One of my colleagues at work (Bart Smaalders) is known for the saying (paraphrasing?): "The easiest way to go faster is to do less work."

Well done on realising something that seems obvious in retrospect, but most people still miss.

jpap · on Aug 1, 2013

haha, very cool. Smart man! :D

PavlovsCat · on Aug 2, 2013

I find it amazing how you share your know-how so freely. This is the first app I ever saw that made me think of an iPhone as a potentially desirable thing... not enough to make me get one, but a big compliment to you. Never change (unless it's for the even more generous and clever of course :P)

femto · on Aug 1, 2013

Having shared a few beers with John Papandriopoulos (at an AusCTW workshop), I can vouch that he is capable of doing great things in signal processing. He's a smart guy [1].

G'day from across the ditch John and I'm glad to see things are going well! (from John D. in Sydney)

[1] http://www.rmit.edu.au/browse/Current%20students%2FAdmin%20e...

jpap · on Aug 1, 2013

Hey, thanks!

The Australian Communications Theory Workshop was such a long time ago--what great memories. :-)

What are you up to these days?

femto · on Aug 1, 2013

Turning another incarnation of CSIRO's wireless research into a product, and still dreaming of the Free Space Optics stuff. Happy to buy you a beer if you are passing though SYD! Keep well.

jpap · on Aug 1, 2013

Oh wow, that sounds really cool! :D

Will definitely look you up when I'm down next. Trying to drop by more consistently during the summer these days. I spent five weeks in Melbourne last Jan and loved every minute of warmth. :)

aerolite · on Aug 1, 2013

How come it doesn't let me downvote you?

ygra · on Aug 1, 2013

This looks similar to what Microsoft Research's BLINK [42] does on Windows Phone. Alas I wasn't able to find any publications on what they are doing (which is strange for MSR). As I don't have my phone currently I can't even look whether they are doing full resolution too or whether they are dropping down to smaller sizes.

[42] http://research.microsoft.com/en-us/um/redmond/projects/blin...

peterldowns · on Aug 1, 2013

Any chance of this coming to Android soonish? This is seriously cool!

jpap · on Aug 1, 2013

The fast JPEG codec was written for the ARM NEON SIMD coprocessor found in the iPhone. Most Android devices also sport the same architecture, so it is indeed possible.

The code for the codec is written in mixed C and assembly, so it can be "easily" ported to Android by making use the JNI.

While the R&D for the fast JPEG codec took about a year to perfect, the iOS app took just about the same time to get polished (including the NodeJS backend work, the HTML5 website and embeddable widgets in AngularJS).

Writing the rest of the app would take a few months of full time work, and it's not yet clear if that might pay off at this stage.

We'll see... and glad to hear there's interest! :D

fluidcruft · on Aug 1, 2013

Don't overlook the fact that the source for the stock Android camera is available under a commercial-use-friendly open source license and has a quite nice native android UI. You don't have to reinvent all the wheels unless you're stubborn.

https://android.googlesource.com/platform/packages/apps/Came...

I would buy that in a heartbeat.

Unrelated, how quickly can you alter exposure settings? Can you get 30 pictures per second with three interleaved exposure brackets? (i.e. burst of 10 HDR photos / second) That would be very, very, very, very cool.

jpap · on Aug 1, 2013

That's really interesting. I wasn't aware of that. I'll have a look at it once social sharing is out the door.

I did consider getting into other aspects of iPhoneography, like HDR, etc. The trouble with HDR in particular is that there's no API access to direct the sensor into each of the bracketing modes.

In the case of HDR, it might be more fruitful to attempt some kind of image signal processing, similar to "Clarity" on Camera+.

I looked into that for a while, and I figured that Camera+ might be using some version of the Contrast Limited Adaptive Histogram Equalization (CLAHE) algorithm. In any case, what they've done is really neat from a DSP perspective. :D

est · on Aug 1, 2013

Hi,

There's also a cool technology allows you to save near the same jpeg with much, much smaller file size.

https://news.ycombinator.com/item?id=2940505

vidarh · on Aug 1, 2013

If I was you, if you're unsure it might pay off, I'd go to the phone makers and offer them licenses for just the encoder.

est · on Aug 1, 2013

There's similar app on Android for years

https://play.google.com/store/apps/details?id=com.spritefish...

The claimed speed is 30 fps, the moar RAM the better I think.

jpap · on Aug 1, 2013

I thought the app was pretty cool, just super slow to save out the JPEGs.

That's one of the reasons I spent a lot of time to make sure SnappyCam could compress these images, thumbnails and Exif metadata included, at a ridiculous speed.

est · on Aug 1, 2013

Yes, SnappyCam has a faster algorithm advantage.

Android on the other hand, has more open and better hardware, e.g. larger full-res camera, much larger RAM, faster external storage or even OTG, more CPU/GPU cores. In theory if SnappyCam is full ported to Android you can make it faster than 8M pixels @ 20fps

jpap · on Aug 1, 2013

The possibilities are exciting for sure! :-)

lvs · on Aug 1, 2013

I might point out that, depending on the device, you could have many more cores to work with in Android-land.

venomsnake · on Aug 1, 2013

We should create a "marketing core" term the same way we have marketing HDD size. I have seen big.little advertised as 8 cores.

I think the main advantage of the androids will be the fact that high end devices have generally more RAM than iOS counterparts. So even if the codec cannot be pushed as far as on the iOS the bigger possible buffers can help.

jpap · on Aug 1, 2013

Yes, more RAM definitely helps.

On SnappyCam, I had to arbitrarily limit the size of the buffer to a fraction of the system memory because there's no way to know "how much" RAM can be allocated to avoid the dreaded memory warnings until you hit one; and then it's a three strike's out policy: you get two and the third kills the app.

The first two are "soft" warnings, but I suspect have a lower threshold than the "hard" one that sends SIGKILL.

In setting the limit arbitrarily, I unfortunately have no choice but to select it rather conservatively: it might otherwise be (a lot?) higher.

K2h · on Aug 1, 2013

is there any value in having buffer size selectable for advanced users so they can play with it and see where the sweet spot is on their hardware?

gandalfu · on Aug 1, 2013

Purely wishful thinking: how about this and the latest lumia camera?

jpap · on Aug 1, 2013

I don't know much about their device (maybe Nokia might send me one?!? hehe)...

... but they must have an awesome JPEG encoder. I'd assume 41Mpx stills would need to be compressed in no more than 1 second for a reasonable UX. That there is a 41+Mpx/sec encoder.

I've also noticed they are using higher quality chroma sampling (4:2:2) so their encoder is actually doing a lot more work than say SnappyCam.

But I bet they're not doing it in software, either.

gandalfu · on Aug 1, 2013

They will send you one if you send them 670 USD back...

dvt · on Aug 1, 2013

I might try to (naively) implement something similar for Android just to see how fast it goes without too much low-level fiddling. It really is pretty darn cool.

jpap · on Aug 1, 2013

Awesome! Drop me a line, would love to see how you go. :-)

dvt · on Aug 1, 2013

Why not have a deferred compressor? I assume that just straight-up saving the raw data in memory would be much faster than compressing every frame as you get it.

Couldn't you get significant FPS increases (given that you still had free space/memory available)?

jpap · on Aug 1, 2013

Actually, I do both on dual core devices.

One core is dedicated to host the capture/buffer, the other will encode shots in the background.

When you see the big circle percent animation, both cores are dedicated to compression to clear the encoder queue so you can take back to back living photos quickly.

Andrenid · on Aug 1, 2013

I've just gotta say:

This is one of the main reasons I keep coming back to HN. A story gets posted about some cool new tech, and the creator is in the comments answering questions. Simply awesome.

jpap · on Aug 1, 2013

haha, cool. :-)

To be honest, I don't often post here because I'm busy working, but am enjoying the discussion on a baby I've nursed for two years now. :-) Thanks for your post!

ajays · on Aug 1, 2013

I read the TC article, bought the app, and came here in the hope that the developer would be here. I was not disappointed!

dvt · on Aug 1, 2013

Very cool :) Thanks for the quick reply.

jasomill · on Aug 1, 2013

Assuming approximately 8MB worth of uncompressed raw data from the image sensor, 20 frames per second would require writing images to flash storage at 160MB/sec, which no iPhone can do.

dvt · on Aug 1, 2013

But writing to RAM (at 3200+MB/s) is certainly possible. You could cache around 2-3 seconds (on lower-end iPhones) as you compress. Again, the idea wouldn't be that you could indefinitely do this, but merely that the compressor would be deferred (and would lag at a ratio that would still yield x FPS).

That's how I would do it. But apparently they already do this :)

jpap · on Aug 1, 2013

That's how it's done, where on dual core devices, a JPEG encode can be done in parallel to capture.

Try it on an iPhone 5 with "infinite shutter" disabled and you will see the dropped frames in the filenames once you import them to your machine from iTunes App File Sharing (or over SSH if you're jailbroken).

comatose_kid · on Aug 1, 2013

Science vs engineering distinctions aside, it is pretty cool to see the attention to detail + effort put into solving this problem.

rabino · on Aug 1, 2013

This is quite remarkable. I just tested it and works even better than advertised. I hope you become rich and famous for this. And I really hope there's not a hidden gotcha I haven't seen yet.

jpap · on Aug 1, 2013

Thanks! :D I'm just very happy to have more people try the app.

It's been a hard slog working 7 day weeks for just over two years now. Feels great to receive some kind of recognition for the work.

axman6 · on Aug 1, 2013

I too have purchased the app, and initial testing on a 4S seems to show it works exactly as advertised. This is a really great app, and an astoundingly low price. You should be very proud I think, I'll definitely be using the app as my goto in the future.

(Hmm, after writing that, I somehow feel it sounds like it should have a reference to my rural, folksy, respected job that clearly makes me qualified to discuss such things. Unlike those Amazon reviews I'm refering to though, I mean every word.)

jpap · on Aug 1, 2013

That's abs awesome to hear! Thanks for the wonderful complement.

I played with price. Until now, most of my sales were word of mouth, and the $1.99 price hindered "growth".

It's been an interesting game. Many people whom I demo the app to in person love it, then when they reach into their pocket to download it, realize it's a paid app, they place their phone back into the pocket.

Still have lessons to learn in sales and marketing... but am enjoying the schooling.

bobbles · on Aug 1, 2013

Well as a point of reference, I will never pay more than $0.99 for a camera app unless a friend has specifically shown me how it works. I have been burnt on too many photography style apps that end up either not doing what I expected from the pics and description, or just sucking in general.

For me $0.99 just breaks that psychological barrier into 'who cares if it sucks'.

I gotta say though, after playing with snappycam it's definitely worth it. I bet being cheaper will end up with easily more than twice the sales

jpap · on Aug 1, 2013

I understand where you're coming from; social proof removes a massive barrier to conversion, even in my own experience.

I found that having it at $1.99 most definitely improved sales after it had been at $0.99 for about a week; after another week, it started to degrade again.

You're spot on in saying that $0.99 is a good price to get "disconnected" users who might experiment. If they like the app, they might make the personal recommendation to their "connections" where price is less sensitive.

After a while, that social proof and networking effect wears off and it's time to reset the price down to the "discovery amount" of $0.99.

To be honest, I'd love to flip SnappyCam over to freemium; but I feel that can't happen until the social sharing is bolted in and the app has a chance to sell itself organically.

mgerals · on Aug 1, 2013

The techcrunch title sounds like taken from an infomercial. Or "one weird trick..."

jlebar · on Aug 1, 2013

To be clear, using SIMD for JPEG encoding is not new. I'd be curious how this JPEG encoder compares to libjpeg-turbo's NEON encoder.

http://libjpeg-turbo.virtualgl.org/

jpap · on Aug 1, 2013

Hey @jlebar, you're right--it's existed on the desktop for some time (MMX, SSE). When I first started, libjpeg-turbo never had an ARM port, which was part of the motivation to do it myself.

See my post in another thread here on the same topic.

ianb · on Aug 1, 2013

I take a fair number of casual action shots – mostly of the kids. To get something to come out I often take a handful of pictures in a row; even that's often not enough, or the "right" scene happens in between these slowish frames. This could be cool for those cases.

Except... I also get annoyed sorting through those pictures afterwards. It would be interesting if with some post-processing it could sort through the pictures some for me, identifying distinct pictures, or filtering out ones that are clearly bad (mostly too blurry), or if fancier maybe doing eye or smile detection. I want to capture the moment a person looks up, before they think about the camera.

Another cool case would be taking photos of movement. If I can track the movement with the camera the picture can come out surprisingly well. But tracking movement is hard. If I had several seconds of pictures, over the course of that time probably I'd track the movement well enough for a few of the photos to come out.

Martijn · on Aug 1, 2013

If I remember correctly the automatically sorting through your pictures and picking the best is exactly what Google announced for Google+ at their last I/O keynote.

jpap · on Aug 1, 2013

That's a cool feature, and not easy to implement. It generally ends up being a machine vision problem. (Google has both great talent and a lot more resource than a single-founder self-funded engineer like me.)

i4software · on Aug 1, 2013

Hi Guys. This is Fast Camera. I'm callin' out SnappyCam!

Are you up for an old fashioned DUEL to see which app can shoot the most "native camera quality" 8MP images per second in 60 seconds without crashing?

On an iPhone 5 with all apps closed, SnappyCam manages to save only about eight 8MP per second over 10 seconds on average and loses the other 12 per second. And these are not 8MP images at least as far as comparing resolution against the native camera app or Fast Camera. All of this technical discussion sounds great but is anyone actually testing this like I am? Just download a stopwatch app with hundredths of seconds and burst for 10 seconds. You'll see. Then shoot something with a LOT of detail at 8MP in both SnappyCam and Fast Camera.

Fast Camera is capable of 10-12 native quality 8MP images per second (more than SnappyCam) We throttle it back on purpose.

And what's with camera-shutter.caf John? ;)

Michael Zaletel Founder, i4software Fast Camera, Vizzywig, Video Filters

jpap · on Aug 3, 2013

Michael, thanks for making contact by e-mail, outside of these public forums.

As discussed over e-mail, I've created an in depth report showing that SnappyCam indeed takes full quality 8 Mpx shots on the iPhone 5.

With the amazing discussion and interest here on HN, I thought to share it with the community here as well:

http://www.snappylabs.com/blog/snappycam/2013/08/03/snappyca...

I'm off the grid on a hiking vacation for the next 2.5 weeks, back in late August and look forward to the discussion then.

jpap

huhtenberg · on Aug 1, 2013

Bug report -

On the first launch, if I quickly press the Setting button (bottom-right) it starts the flip animation and still shows the handwritten overlay explaining where to tap for manual focus and all whatnot. After the animation is complete, the overlay is still shown, so it looks like a mess. And it's also not obvious how to get the overlay back, because I haven't seen what it actually said.

Congrats on the TC cover and a very nice app. Get rich! :)

(edit) A nitpick - "Warm-up", not "Warmup"

(edit) Report Usage = On. Seriously? Who on Earth in their sane mind would actually want this, except for you? Next thing you tell me is that you have some "app analytics" library linked in and it's always on. Please don't be evil.

(edit) The same goes for "Send Crash Reports = Always". It should be "Ask". Respect your users and they will help.

jpap · on Aug 1, 2013

Thanks for the suggestions:

1. Looks like a race condition for the settings button tap. Does it happen if you wait a second before pressing the settings button?

2. You can re-enable the tutorial (overlay) screens from the bottom of the settings menu.

3. On the usage/reports, I hear you. I won't give you bullshit on "standard industry practices" here, but I will say that I had to hack a well-known closed-source library to give you that opt-out from usage reporting. I really do value your privacy. (I've already requested the library developer fix it, and will try and write a blog post on how other developers can provide a kill switch, too.)

4. The default is there because many people don't like to configure apps, they just use them as-is. In that light, the default configuration is the one I felt was best for general use.

huhtenberg · on Aug 1, 2013

"Ask" should be the default, I have to insist.

Just tried the actual functionality and it gives the machine gun sound effect, showing a counter going up to 50-60, then I release the button, the blue stripe around the button shrinks back, it adds a photo to the bottom-left area, but when I tap it, there are just 3 frames. What am I missing? Is it adaptively trimming bad frames (I am shooting in a low light conditions)?

(edit) Just tried again and this time after I release the shot button, it showed a big circle overlay with the "JPEG" in the middle that counted up to 100% and the resulting photo had the right amount frames. It didn't do that on the first try. It's either a bug ... or you are missing a helpful hint that explains what's going on :)

jpap · on Aug 1, 2013

Yes, that does require some explanation:

1. The receding circle is the capture buffer being processed. When you're tapping on the thumbnail, SnappyCam sees the start of the living photo being available and shows it. It does not, unfortunately, refresh the thumbnail list as more shots complete processing.

This is a (feature) bug and I'll work to address it.

2. The circle with percent progress is what I call "turbo rewind", where the camera is shut down so that all CPU cores can be applied to compression so that you can take back-to-back living photos quickly.

You can select the buffer "threshold" for when this kicks in under the advanced settings: look for Turbo Rewind.

gandalfu · on Aug 1, 2013

It takes time and lots of effort, and ill argue is easier on a quasi standard platform (processor wise) but apps like this show how much juice can be squeezed out of the existing hardware by handcrafting the code.

Kudos, I just bought the app!

jpap · on Aug 1, 2013

Thanks for the download! Let me know if you've got any feedback, I can be easily contacted through the app. :D

I'd just like to add that in addition to handcrafted code, choosing the right algorithm and always trying to "do less work" (less cycles, less data IO, better use of registers) makes a big difference.

gandalfu · on Aug 1, 2013

I always say a good programmer has to be "lazy"!

Some feedback, the default exposure settings showed my room as pitch black (I have it very dim now), the native iphone5 camera adjusted automatically. I was able to snap a shot by pointing to the light. Personally I prefer not to crank the gain on the sensor.

jpap · on Aug 1, 2013

haha, yes, if only it pays to be lazy. :) Sometimes doing "less work" means more up-front planning and thinking. Not a bad thing necessarily.

Interesting on the native camera adjustment. SnappyCam will use the "low light boost" high ISO capabilities of the camera. I'll have a play around with it.

Otherwise, does the continuous flash help you much?

gandalfu · on Aug 1, 2013

The continuous flash didn't fire. I'm running the stock settings.

jpap · on Aug 1, 2013

Oh, it's a manual flash.

Enabling that automatically is an interesting problem in itself: I'd have to estimate the light level based on the camera preview... or perhaps from the preview metadata.

Will think about how that might be done. Thanks for the thought. :-)

mutagen · on Aug 1, 2013

I'm impressed as hell by all of this, the fast DCTs and the crafting of the entire process to build something so far beyond anything else on the market is great.

Bought this!

voltagex_ · on Aug 1, 2013

I'd love to make the jump from "just" web development to some proper embedded development. Any pointers?

jpap · on Aug 1, 2013

Practice. A lot.

A good start is actually the ARM processor; since it's a RISC instruction set, it's quite simple.

I've done lots of assembly in my ugrad days, even writing a Motorola HC11 micro-controller emulator, but ARM would be a much better choice right now.

I found the "Tonic: Whirlwind Tour of Assembly" [1] site invaluable to get me started for SnappyCam, as it covers a lot of the ARM ISA.

For iOS-related assembly, I'd recommend [2].

And for a taste of ARM NEON SIMD, have a look at [3]. The one thing that "clicked" for me on SIMD is that you should look at each register "lane" as trying to unroll a loop. I initially dived in thinking I'd just make a sequential algorithm parallel, which is often too difficult to arrange.

[1] http://www.coranac.com/tonc/text/asm.htm [2] http://www.shervinemami.info/armAssembly.html [3] http://hilbert-space.de/?p=22

polskibus · on Aug 1, 2013

Just adding my vote for android version! Great job !

chacham15 · on Aug 1, 2013

It looks great but I have a few questions/comments.

1. What is the difference in quality between using this and the video capture mode? I.e. if what I really want is a high quality video, would this get me a better result than the built in programs?

2. Seeing as how you've done all this work (and how Android apps can be compiled from C) how difficult is it to port this to Android so that the rest of us can get in on it?

3. Is it just me, or can anyone else not change the settings / look at the other demos on the samples page?

jpap · on Aug 1, 2013

1. It really depends on what you're after: are you looking for a video sequence that plays back, or an individual still? Video is better for the former, SnappyCam for the latter.

2. It's a lot of work, hinted at in another thread here on HN. The entire "app" build on top of the JPEG codec needs to be built from scratch; new artwork is required, etc.

3. I just tried it from another machine and works for me.

My backend API is being hammered at the moment, which is awesome, but it doesn't appear to be overloaded. (Gotta love NodeJS!)

sgustard · on Aug 1, 2013

I have the same issue (3), on Safari and Chrome: mouse clicks in the menus after they're opened are ignored, but the keyboard works to select a video.

jpap · on Aug 1, 2013

Weird. Could be a bug in the dropdown component I wrote in AngularJS. :( Glad the keyboard still works.

Will look into it...

Myrth · on Aug 1, 2013

> To put the speed in perspective, SnappyCam is about 4X faster than the normal iPhone 5 Camera app, and more than twice as quick as the Samsung Galaxy S4′s 7.5 shots per second.

Does it mean that S4's hardware is faster than iPhone 5 given they're using similar algorithms, and if you'd make the same app for Android it could get even better results?

jpap · on Aug 1, 2013

It's unclear to me, as there's a lot more going on when taking a photo than you might think. :) (I originally thought I could knock together a basic SnappyCam app on top of the JPEG codec within a week or two, it took months.)

If SnappyCam can do it on hardware that is older than the S4, then I can't see why technically Samsung can't lift their game.

And judging by how quickly they've been chasing Apple, and sometimes stepping ahead, I wouldn't be surprised to see a bit of leap-frogging for some time to come.

Let's see what the 5S/C brings in a few months! I'm excited.

sytelus · on Aug 1, 2013

Looks like the most interesting part here is "living photo" that instantly responds to interactions. Can this be standardize as new video format? It would be very cool to have all cameras be able to save video in this format. @jpap should consider formalizing this format, produce viewers on different platforms and license this tech to manufacturers of point-and-shoot cameras, GoPro, WebCams, camcorders etc. This feature could make camera an instant hit. It is a real value add for customers. I can also envision movies getting recorded in this format and available on Blue Ray so people can instantly interact with the cool fast action videos in HD. I think the great insight here is the awesome coolness of instantly interactive video that is ready to be unlocked inside current camera hardware.

jpap · on Aug 1, 2013

I'm really glad to read this! :-)

I had similar thoughts myself, and forms a part of what I have in mind for the next major SnappyCam release (a taste is what you see on SnappyCam.com today). My thoughts are perhaps more web-focussed that what you describe, but the thought is really encouraging!

marze · on Aug 1, 2013

Some questions:

Instead of doing full resolution at 20 fps, can you do a smaller resolution at, say, 160 fps?

If the next generation iPhone processor is faster (a safe bet), do you think your software would allow at least 24 fps, and you could use the iPhone to shoot a 10+ megapixel movie?

Shouldn't Apple have hired you already?

jpap · on Aug 1, 2013

It all comes down to what the hardware supports, ultimately.

I'm not performing any true miracles here: I'm just making best use of the hardware resources available, with some clever software tricks and algorithms.

The iPhone 5 actually supports 60 pictures/sec capture, for example, but Apple has decided for whatever reason, to disable it on iOS 6. If the iPhone 5 ran on iOS 5 (surprise?!) then it would likely run at 60 pictures/sec.

On iOS 7 that all changes: so you'll soon be able to capture at 60 pictures/sec, which is rad.

The rollerblader shown on the TC article was shot at Sunday Streets in the SF Mission District on my iPhone 4S at 60 pictures/sec. The photo quality is somewhat degraded for the web, but it still looks awesome full screen (from the SnappyCam website; the TC embed is in a restricted iframe and can't go full-screen).

I know a couple of great engineers that work at Apple, but haven't spoken with them for one or more years. Sounds like a cool place to work, but so can be working for yourself.

It's been a hard slog--I quit my last full-time job in March 2011--but I'd love to see SnappyCam through and bring to life another startup idea I have in mind. (Some of the YC partners have already seen me pitch it; SnappyCam has been a rather good distraction of late.)

MikeTLive · on Aug 1, 2013

At 20fps, could you make a 3d camera app by the user moving their camera in space and then correcting for stabilization with the accelerometers etc telling you point in space and using the multiple view points as individual cameras.

jpap · on Aug 1, 2013

That's a really interesting machine vision problem and a lot more complex than a JPEG codec. :-)

I wonder how long before we start to see Kinect-like infrared cameras mounted on phones to make the depth problem easier to solve. That would be cool!

NamTaf · on Aug 1, 2013

I want to see if you could use all the rapid frames, plus a variant of that cool Adobe image de-blurring tech [1] that was shown a while ago to produce a clearer, sharper image during motion?

[1]: http://prodesigntools.com/photoshop-cs7-image-deblurring.htm...

jpap · on Aug 1, 2013

There's still much innovation left in image signal processing... and fortunately much interest in taking good photos!

This reminds me of research into superresolution, an area that's "super interesting" :-) as well.

The guys who started Occipital (360 Panorama), I believe, tried to dabble in that with ClearCam many years ago... but I honestly don't know much about it. Anyone from Occipital here on HN?

beambot · on Aug 1, 2013

Or you could just use individual frames and bundle adjustment.... even via free online software like 123D Catch. ;) http://www.123dapp.com/catch

_quasimodo · on Aug 1, 2013

You should port it to several platforms and license it as a library. I would think there are many companys interested in a fast jpeg encoder that is not embedded in an iPhone App :)

egypturnash · on Aug 1, 2013

This is pretty cool. You got my buck!

I was kinda hoping I could also turn the speed down to multiple seconds per photo, since it talks about doing time-lapse shots. One of my major uses for my phone's camera is selfies for art reference, currently done with Genius - which annoyingly won't do repeated shots at anything less than 10 seconds. Being able to take one shot every 1-3 seconds would be pretty damn cool for me.

jpap · on Aug 1, 2013

Thanks!

You can reduce the capture rate in the app settings, down to 1 photo per {1, 5, 10, 30, ... } seconds.

Move the slider toward the turtle under "Camera Lens".

jpap

egypturnash · on Aug 1, 2013

Oh durf, I fail at exploring UIs. Thanks!

zeroDivisible · on Aug 1, 2013

I must say that this is one of the most interesting apps which I had found in last few weeks. You should get yourself a beer as this is a neat feat to accomplish:)

Also, some people were saying that webapp wasn't working for them on some chrome versions. As for me - I've got the 28.0.1500.95 - the culprit was Disconnect extension, which when disabled, allowed the whole application to behave as expected.

jpap · on Aug 3, 2013

That really helps, thanks for letting me know about the disconnect extension. I've never used it, will check it out.

mappu · on Aug 1, 2013

That's fantastic, and a very cool demo.

How does the encoder performance compare to libjpeg-turbo? That also has some SIMD work for NEON.

jpap · on Aug 1, 2013

Yes, Nokia contributed the NEON code for the DCT in libjpeg-turbo.

I haven't had a chance to do a side-by-side comparison as yet, but I suspect the SnappyCam encoder is faster for many reasons, including choice of algorithm and the way they use two multiplies (low, high) at times, and their image row-by-row nature with function call overhead in favour of code maintainability.

mansr · on Aug 1, 2013

I was involved in some NEON work on libjpeg-turbo, and I can confirm that the image buffer management there is hell, as are some other aspects of the design. A from-scratch implementation with performance in mind should easily be quite a bit faster.

bobbles · on Aug 1, 2013

Looking forward to taking these pics and testing out http://research.microsoft.com/en-us/downloads/69699e5a-5c91-... Image composite editor with things like photosynth

on Aug 1, 2013

[deleted]

jpap · on Aug 1, 2013

1. Instragram is only shown if you actually have Instagram installed on your device. ;-) As you might know, Instagram guard their API carefully: we don't yet have general access to it.

2. E-Mail is also only shown if your device has built-in e-mail accounts set up.

3. iTunes App File Sharing is accessible by connecting your device to your Mac/PC via USB and using the iTunes app.

Drop me a line jpap {at} snappylabs.com if you're still having issues. I'd love to help! :D

Hopka · on Aug 1, 2013

It crashes for me every time I take somewhere between 60 and 75 frames with the main camera. With the front-facing camera, I can shoot forever. In the iPhone Settings (somewhere called Diagnosis & Usage), I have a bunch of LowMemory warnings. I'm using an iPhone 4S.

jpap · on Aug 1, 2013

Thanks for reporting it in!

It seems I enthusiastically chose a larger buffer size that appears to be having some issues on some devices that have a lot of memory pressure.

If you reboot your phone, as awful as that sounds, it will likely fix the issue.

EDIT: I've just submitted an update to Apple that uses a more conservative buffer size.

This aspect is hard to get right: I once used an adaptive buffer size that heeded to memory warnings, but that meant that the buffer filled to lower levels than a conservatively sized buffer.

If only iOS had an opt-in for an *alloc returning 0 instead of these warnings, or at least notifying us of how much space left before we're SIGKILL'ed.

Hopka · on Aug 1, 2013

Thank you, I'll try rebooting the phone.

I recently read an article here on HN that briefly touched on memory management under iOS and especially the problem of apps getting killed, maybe it is interesting for you: http://sealedabstract.com/rants/why-mobile-web-apps-are-slow... (scroll down to "How much memory is available on iOS?")