Firefox will consider a Rust implementation of JPEG-XL

gardaani · 2024-09-04T10:49:33 1725446973

> the team at Google has agreed to apply their subject matter expertise to build a safe, performant, compact, and compatible JPEG-XL decoder in Rust

There's already a JPEG-XL decoder written in Rust: https://crates.io/crates/jxl-oxide

It would be nice to hear why it's not good enough.

tux3 · 2024-09-04T10:58:04 1725447484

This is the same Google that wrote the libgav1 decoder instead of using dav1d, they don't have a strong history of starting from code that was invented elsewhere.

(In the case of AV1 I think they eventually gave up and started offering dav1d on Android, but not before shipping their in-house implementation that was much less efficient and no safer than state of the art, perplexing everyone in the process.)

lifthrasiir · 2024-09-04T11:17:56 1725448676

It should be noted that all known independent JPEG XL decoders (especially my J40 [1] and jxl-oxide) were primarily written by a single individual for each library. So the request is that some library, either the future version of jxl-oxide or Rust rewrite of libjxl or something else, should be written in a provably memory-safe manner, actively maintained by JPEG XL developers, and successfully reproducing the original libjxl's decoding performance. I think it is indeed a fair deal to both Mozilla and JPEG XL developers.

[1] The main blocker for J40 was that I don't really want to keep it less safe than I hope to achieve, but I also want to keep it in C for practical reasons. This and my daily job prevented any significant improvements so far.

pchangr · 2024-09-08T01:08:46 1725757726

Well, blink did come out of WebKit ..

Timshel · 2024-09-04T10:58:17 1725447497

They are relatively neutral about the format, so I'm guessing they want a commitment from Google to maintain it.

As good as Oxide might be it looks like a one developer project.

lifthrasiir · 2024-09-04T10:59:45 1725447585

For one thing, I believe jxl-oxide is not yet performant enough to replace libjxl's decoder (not necessarily because of a difference between C++ and Rust, however).

pornel · 2024-09-04T11:20:10 1725448810

I think it would make more sense to support the existing implementation than to start yet another one.

As for why jxl-oxide can't be used yet — it just isn't mature enough yet. They're still finding unimplemented or incompatible features, and have some unoptimized code.

JPEG XL is big and complex – it is designed to have every feature of every competing codec (from vector graphics to video), and beat them all on compression in every case, so it's a half a dozen of different codecs in a trench coat.

JyrkiAlakuijala · 2024-09-04T12:03:09 1725451389

I believe that analysis is unfair or imprecise

libjxl decoder is 3-5x smaller in binary and 10x smaller in specification text size than an avif decoder

rurban · 2024-09-04T19:57:00 1725479820

But it has an insane API. I wanted to support jxl encoding, but eventually gave up. Too complicated for average users

JyrkiAlakuijala · 2024-09-04T20:57:17 1725483437

Could it be improved by having a wrapper for simple uses?

noobermin · 2024-09-04T11:25:01 1725449101

Why start from zero then if jpeg xl is big and complex?

noobermin · 2024-09-04T11:24:12 1725449052

May be deep down somewhere a large motivation for coding in rust is to invent something here even if that includes reinvention. The memory safety is a by-product.

justmarc · 2024-09-04T10:13:21 1725444801

Why just consider? Hell yes.

Anything that has remotely something to do with user-supplied data should be done in Rust, especially for such a high profile piece of software.

Omin · 2024-09-04T11:03:10 1725447790

They are not on the fence about Rust. They are on the fence about JPEG-XL when it entails a major increase in attack surface due to the current implementation being in C++.

johnisgood · 2024-09-04T12:50:39 1725454239

Or Ada / SPARK which is widely used in production for critical-systems.

amelius · 2024-09-04T10:53:55 1725447235

Rust adds a lot of complexity (a new language) and does not even offer a general approach to automated correctness verification (not saying that C++ is better in this regard). Maybe it is just better to wait for Rust++.

pornel · 2024-09-04T11:33:38 1725449618

Rust is much better suited to static analysis and formal verification than C++, primarily thanks to stricter pointer aliasing, immutability without loopholes of const, and thread safety annotations in the type system.

Most things that are advanced correctness verification in C++ are regular compiler errors in Rust.

SahAssar · 2024-09-04T11:00:33 1725447633

Rust is already in the firefox codebase (hell, it was basically invented to be used in firefox) so it doesn't add any complexity.

dwattttt · 2024-09-04T11:01:15 1725447675

Rust is not a new language for Firefox, some of Firefox is already written in Rust.

pornel · 2024-09-04T11:13:07 1725448387

This is a great news for JPEG XL. Web standards need two independent implementations of every feature – this proves that the spec is actually possible to implement, and makes it possible to verify that the implementations are interoperable. It makes uses in the wild much less likely to depend on bugs in a particular implementation, which makes it possible to upgrade or replace implementations without creating bug-compatibility problems.

JyrkiAlakuijala · 2024-09-04T12:25:44 1725452744

I believe that there are already several decoder implementations that pass the ISO conformance test for JPEG XL. I don't remember which one's exactly but I believe at least both jxl-oxide and libjxl do.

joshlk · 2024-09-04T11:15:47 1725448547

I’m not sure this holds as true if it’s the same team creating both implementations.

pornel · 2024-09-04T11:26:33 1725449193

It's sufficient.

Another implementation is still unlikely to have the exact same bugs. Especially rewrite in Rust will force the code to be structured differently (Rust is very opinionated about that).

The spec is big enough that the team won't be able to just write the exact same implementation from memory.

zcorpan · 2024-09-05T07:32:24 1725521544

I don't disagree that it's sufficient, but also, ideally different people would implement the spec. If you have a particular mental model or understanding of a part of the spec that doesn't match what the spec actually says, that is likely to translate identically when writing a second implementation.

hencoappel · 2024-09-04T09:49:29 1725443369

> 100k lines of multithreaded C++ Just for a JPEG library. Damn!

bdemirkir · 2024-09-04T10:14:55 1725444895

I don't know where that line count comes from. It looks like libjxl has 38314 lines (https://app.codecov.io/gh/libjxl/libjxl/tree/main/lib%2Fjxl) at the moment.

JyrkiAlakuijala · 2024-09-04T12:10:09 1725451809

and that 38 kloc includes the encoder, psychovisual model, etc. etc. that are not needed for the decoder

current decoder is around 20 kloc

lifthrasiir · 2024-09-04T12:26:18 1725452778

I think Highway is a critical component for libjxl and has to be counted towards the line count. (CMS can be ignored here, and any Rust implementation should ideally use qcms [1] which is already in use in Firefox anyway.)

[1] https://github.com/FirefoxGraphics/qcms

JyrkiAlakuijala · 2024-09-04T13:35:59 1725456959

Personally, I'm on the edge on this one. I think both viewpoints are valid.

One way to think of Highway is that it is portable multi-platform SIMD intrinsics for C++. While we developed it originally as a part of JPEG XL, it has long ago graduated into a general-purpose library that has various uses, including the recent Gemma.cpp ML launch. Other modern highway uses include: Audio, Browsers, Computational biology, Computer graphics, Cryptography, Grok JPEG 2000, JPEGenc, Jpegli, OpenHTJ2K, Image processing, Image viewers, Information retrieval, Machine learning, Numpy, and Robotics... (copied from: https://github.com/google/highway)

I derived the name from the CityHash, FarmHash, and then HighwayHash series, and considered that Highway would link this library to its roots in the HighwayHash (of course much is also based on Jan's previous work with SIMD). Notably, I resisted using the -li naming here :-D

janwas · 2024-09-04T13:29:17 1725456557

Indeed, though Highway is approaching a system library at this point. It is also a thin wrapper over the compiler's intrinsics headers: e.g. ~16KLOC for arm_neon.h, times the number of targets. That's a lot of code, but it is not comparable with actual codec logic lines.

thrdbndndn · 2024-09-04T09:52:20 1725443540

I'm also surprised by this number. Granted it's not just old JPEG standard, still 100k sounds like a lot for a decoder.

pixelesque · 2024-09-04T10:16:25 1725444985

It's the encoder as well, plus converter code for things like JPEG reading as well, and JPEG XL supports animation...

thrdbndndn · 2024-09-04T15:02:26 1725462146

I guess the original comment in TFA is a little bit misleading, then.

> of the reference decoder (currently behind a pref in Firefox Nightly), which weighs in at more than 100,000 lines of multithreaded C++

lonjil · 2024-09-04T12:53:55 1725454435

That number includes all the different extra tools in the repo, and all tests. And as mentioned, the encoder.

archerx · 2024-09-04T10:04:46 1725444286

There must be a lot of edge cases that were discovered over the years.

pogue · 2024-09-04T12:49:18 1725454158

Can someone summarize why JPEG-XL is so controversial to browser devs? Why not just allow as many image standards as possible?

rowbin · 2024-09-04T12:52:21 1725454341

Because of the increased attack surface and maintenance burden. Which is totally understandable. But JPEG-XL should really be supported...

Timber-6539 · 2024-09-04T10:26:20 1725445580

What does Google get out of this?

mikl · 2024-09-04T10:49:15 1725446955

A more efficient image format helps everyone on the web. We’re looking at 30-50% fewer bits sent over the wire for the same image quality. Massive bandwith savings for everyone. JPEG-XL is basically a better version of what Google was already trying to do with WebP, and a suitable long-term replacement for all image formats currently in use.

illiac786 · 2024-09-04T20:13:48 1725480828

Isn’t video the lion’s share of the internet traffic?

out_of_protocol · 2024-09-04T11:14:37 1725448477

* much faster decoder/encoder than avif

* Much better quality at high bpp (avif performs better at high compression levels, means shitty quality images can be much smaller, while high quality images either don't compress well or loose quality a lot)

* Best in class lossless compression, to replace png

* Progressive encoding

* Ability to losslessly reconvert existing jpegs for 20% free gains

JyrkiAlakuijala · 2024-09-04T12:26:53 1725452813

great analysis and summary of the JPEG XL benefits!

maeln · 2024-09-04T11:31:38 1725449498

Google Research (which, if I remember correctly had member working on jxl spec and did reference implementation) has much more freedom, and has already been cleared to work on jxl, than the Chrome Team. They also don't have the same goal. Google Research is also responsible for Brotli and Snappy I believe.

For all we know, JXL might be super useful, and is actively being used, for some project at Google for X, Y, and Z reasons, but the Chrome team doesn't want to add it to the browser for A, B, and C reasons. Google does many thing.

greener_grass · 2024-09-04T10:31:14 1725445874

They have a general interest in the quality of the web. More web use is good for Google Ads. This sometimes comes into tension with Google's attempts to control and monetize the web.

arp242 · 2024-09-04T10:39:41 1725446381

And at "Google scale" shaving off a few bytes on every request means saving real money.

rapsey · 2024-09-04T10:49:55 1725446995

Does bandwidth actually cost Google anything?

throw0101c · 2024-09-04T11:10:33 1725448233

> Does bandwidth actually cost Google anything?

Latency can cost monetization. Fewer bits at X b/s is fewer seconds.

Hendrikto · 2024-09-04T11:11:24 1725448284

At the datacenter scale, bandwidth is expensive, yes.

arp242 · 2024-09-04T12:06:08 1725451568

It's not free, no. And servers aren't free either.

At "Google scale" 1% or 2% savings can add up to hundreds of thousands or millions of dollars per year. 1% of 100,000 servers is 1,000 servers.

JyrkiAlakuijala · 2024-09-07T03:48:53 1725680933

My take on this is that junior engineers optimize for the cpu as it is easy to measure, senior engineers for the user experienced latency as it matters more.

Users know that their time is valuable. If your website is fast, they will explore it, and came back soon.

BoingBoomTschak · 2024-09-04T10:32:33 1725445953

Reminder that https://github.com/google/wuffs exists too.

lifthrasiir · 2024-09-04T11:04:25 1725447865

It will be fairly challenging to rewrite libjxl or any other independent JPEG XL decoder into wuffs, due to its inherent (and for now necessary) limitations. Even its PNG decoder is not very traditional, and JPEG XL has way more bookkeeping in its format.

BoingBoomTschak · 2024-09-04T12:28:05 1725452885

Yeah, probably, just thought it worth mentioning since it was literally made by Google for use within Chromium.

timeon · 2024-09-04T13:57:58 1725458278

I'm already covered by rust, but thanks for the link it is interesting. I was not aware of wuff.

Vecr · 2024-09-04T10:43:38 1725446618

Might be too much effort if the original is 100 kilolines.

user36188 · 2024-09-04T10:40:54 1725446454

Rust is the savior of Firefox.

_Amrinder · 2024-09-04T11:01:00 1725447660

if only it had no telemetry. Try opening wireshark with firefox running and you will see how much of a noisyboi it is

pshirshov · 2024-09-04T11:11:36 1725448296

Telemetry can be disabled by a couple of config entries.

hu3 · 2024-09-04T11:23:14 1725448994

You're making parent commenter's point stronger.

So not only it is on by default, but it also requires a couple of config entries to disable.

illiac786 · 2024-09-05T04:08:28 1725509308

Don’t see how. If there is too much telemetry, there is only two options: it can or it cannot be disabled via config. Very unlikely someone complaining about telemetry opted into telemetry on purpose… hence, the fact it can be disabled is “good news in the bad news”.

pshirshov · 2024-09-04T13:56:45 1725458205

Still, if you are unhappy with it, you have a way to opt out. Unlike in many other cases.

leduyquang753 · 2024-09-04T11:24:46 1725449086

That sentiment is just so unnecessary but oh well Rust is literally their brain child so.

rapsey · 2024-09-04T11:33:54 1725449634

Libpng and libjpeg have been huge sources oh high severity security vulnerabilities. It is not unnecessary at all.

kookamamie · 2024-09-04T10:14:24 1725444864

> mozilla locked as too heated and limited conversation to collaborators.

I wonder which one was objected to - the implementation considered to be in Rust or that a rewrite might not actually make sense for the dying browser (2.75% share) at this stage?

lost_womble · 2024-09-04T10:55:26 1725447326

Firefox is around 8% of market share in the US on non-mobile devices -- up from around 4.5% this time last year. It's downright wrong to see 80% increase in users over 12 months as 'dying' -- and in general 1 in 12 users is significant enough that you should be caring about it.

Y_Y · 2024-09-04T12:05:48 1725451548

Commenters seen to love declaring things to be dying, and marketshare trends are only tangentially related. (see e.g. "Netcraft confirms it" meme)

I personally feel like softeare is dead once it is no longer useful, and there is no likelihood of that changing in the foreseeable future. That's obviously not the case woth Firefox.

On the other hand I've seen exaggerated reports of the demise of a project that hasn't had a git commit in the last week, or isn't commercially relevant for whatever the commenter is interested in. Even this doesn't seem to me to be true for Firefox, but I don't think the accusations will stop, or the signal-to-noise will increase.

illiac786 · 2024-09-05T04:11:18 1725509478

This is basic trolling, you’re making Mozilla’s point here…