Here are some of my models for data compression research compared against popular compressors gzip and 7-zip. Click on the names to download them as command-line tools. Note that none of these compressors are compatible with the others. All of these compressors are experimental. RH5 and its variants are designed for high speed and modest memory usage, and are meant to be practical solid multi-file archivers.
These benchmarks are with enwik8, a 100 MB text-file from the English-language Wikipedia. More information and a benchmark of some of my compressors can be found on the Large Text Compression Benchmark.
Program | Algorithm | Compressed size (bytes) | Compression time (seconds) | Decompression time (seconds) | Compression memory (MB) | Decompression memory (MB) | Compression speed (MB/sec) | Decompression speed (MB/sec) |
---|---|---|---|---|---|---|---|---|
Original | 100000000 | |||||||
BTCM max | BWT + CM | 20,955,165 | 21.20 | 22.60 | 822 | 657 | 4.72 | 4.42 |
BTCM 8 | BWT + CM | 23,786,763 | 17.26 | 16.80 | 52 | 42 | 5.79 | 5.95 |
CM5 x64 | CM | 25,042,264 | 16.80 | 16.94 | 35 | 35 | 5.95 | 5.90 |
7-zip (normal) | LZMA | 25,899,684 | 72.00 | 1.40 | 186 | 18 | 1.39 | 71.43 |
RH5ba_x64 max | LZMA | 27,510,180 | 17.00 | 4.00 | 130 | 47 | 5.88 | 25.00 |
RH5_x64 max | LZ + ctx | 29,878,256 | 13.20 | 0.53 | 19 | 12 | 7.58 | 188.68 |
ctxn (32-bit) | LZMA | 30,211,251 | 9.00 | 5.00 | 67 | 67 | 11.11 | 20.00 |
RH4_x64 | ROLZ | 31,309,689 | 3.10 | 0.58 | 29 | 25 | 32.26 | 172.41 |
RH5_x64 | LZ + ctx | 31,798,141 | 2.10 | 0.61 | 19 | 12 | 47.62 | 163.93 |
RH5m_x64 | LZ + ctx | 33,638,243 | 3.40 | 0.69 | 2.9 | 1.8 | 29.41 | 144.93 |
gzip -9 | LZ77 | 35,194,719 | 14.00 | 0.94 | 4 | 3 | 7.14 | 106.38 |
gzip | LZ77 | 37,907,623 | 4.10 | 0.97 | 4 | 3 | 24.39 | 103.09 |