Thanks! I find it weird that performance-focused javascript projects don't have live benchmarks up so you can judge for yourself. Personally I've found that a semi-clever matrix multiplier in c++ compiled with emscripten can give about 1.5 gflops (singlethreaded on a 2.5GHz i7 macbook pro), and existing js libs like sushi are a lot slower.
On the other hand, optimal speed and JavaScript are often different ball parks.
I threw together this thing for when I need some matrix stuff in a little script. Not fast, but flexible.
I look forward to the day when the sufficiently-smart JIT implements the optimal Matrix Multiplication from that. :-)