Optimized Go gzip/zip packages, 30-50% faster

andrewstuart2 · on July 28, 2015

Is there any reason you're not approaching the team for inclusion into stdlib (for more than just the crc32 bit)? If it's really drop-in and passes all tests then I can imagine it would be an easy sell.

https://groups.google.com/forum/#!forum/golang-dev

on July 28, 2015

[deleted]

danieldk · on July 28, 2015

Does that really matter? It uses conditional compilation (only on amd64 CPUs) and even if it does not have optimized assembly for other CPUs, just having the benefit on amd64 alone is useful.

jasode · on July 28, 2015

I'm not criticizing the programmer's effort but whenever I see a new project as a "drop in replacement" and/or "50% to 3x faster", the first thing I always look for is the test data to verify that new bugs weren't introduced.

I went to the test subfolder:

https://github.com/klauspost/compress/tree/master/testdata

I saw 3 files.

The kind of test data I'd like to see for something like a compression library would be:

0 byte files

1 byte files

2,147,483,647 byte file

2,147,483,648 byte file

2,147,483,649 byte file

(to check for bugs around signed 32-bit integers)

4,294,967,295 byte file

4,294,967,296 ...

4,294,967,297 ...

(to check for bugs around unsigned 32-bit integers and see if 64bit was used when necessary for correctness)

Also add files of sizes that straddle the boundary of algorithm's internal block buffers (e.g. 64kb or whatever). Add permutations of the above files filled with all zeros 0x00 and all ones 0xFF. I'm sure I'm forgetting a bunch of other edge cases.

The programmer may have done a wonderful job and all the code may be 100% correct. Unfortunately, I can't trust a library replacement unless I also trust the test bed of data that was used to check for defects. It's very common for performance optimizations to introduce new bugs so there has to be an extensive suite of regression tests to help detect them. Test data for bug detection has a different purpose than benchmark data showing speed improvements.

Those multi-gigabyte files are not git repository friendly so perhaps a compromise would be a small utility program to generate the test files as necessary.

pdq · on July 28, 2015

I'm in agreement he should use more test cases, but it looks like he used the same as Golang's compression tests [1]. Note he has other test cases in the subdirectories (also the same as Go) [2].

However I believe even better testing would be to fuzz it for a few hours [3].

[1] http://golang.org/src/compress/testdata/

[2] https://github.com/klauspost/compress/tree/master/zip/testda...

[3] https://github.com/dvyukov/go-fuzz

on July 28, 2015

[deleted]

jasode · on July 28, 2015

>It's a stream-oriented format. There are no file-global offsets or lengths. There are some relative lengths and offsets, but they're going to be 8 or 16 bit values. Most of the tests you're suggesting don't look particularly useful.

I don't think that streaming vs file is relevant. The deflate algorithm (stream) may have no global knowledge of the eventual file size but the gzip file format does.

Anyways, even with a cursory google search, we can find a compression bug related to file sizes:

https://bugs.openjdk.java.net/browse/JDK-6419239

As for an example of streaming bugs, zlib's creator Mark Adler reported that Microsoft coded its deflate implementation incorrectly.

http://stackoverflow.com/a/11435898

The lesson is that re-implementing even well-known algorithms such as zlib/gzip/pkzip is not trivial.

quotedmycode · on July 28, 2015

It was also changed to only look for 4-byte or more minimum matches. "Better compression" is claimed, but I don't see any benchmarks to indicate the size of compressed files before and after.

0xdeadbeefbabe · on July 28, 2015

> I'm not criticizing the programmer's effort

Yes, you are not criticizing the programmer's effort. That's what a pull request is for. A pull request with a failing test in it would be some very concise criticism.

tosseraccount · on July 28, 2015

Don't they really need to compare this with the baseline C implementation and Intel-optimized and CloudFlare-optimized zlib ?

mc_hammer · on July 28, 2015

curious what flags was the c version compiled with? optimized or non? i guess x86 or 64 bit also?

oofabz · on July 28, 2015

He didn't benchmark any C implementations, just Go.

mc_hammer · on July 29, 2015

oh i thought it was versus the c implementation

thanks