Buckets:
dict-auto-xz (AutoZip)
Approach:
- Detect bytes not present in enwik8.
- Substitute high-gain XML/wiki/text patterns with those single-byte codes.
- Compress the transformed stream with tuned xz/lzma2.
- Decompress by xz decode + inverse substitution table (
subs.json).
Backend xz options:
-9e --lzma2=dict=512MiB,nice=273,mf=bt4,mode=normal,lc=4,lp=0,pb=0
Roundtrip:
- Verified byte-identical output against
shared_resources/enwik8.
Score:
- See
results.json(archive + decompressor.zip).
Xet Storage Details
- Size:
- 506 Bytes
- Xet hash:
- 99ba2cca2126c3a8d1a9e9f5a38dd15b0ede687e6dbfb9e8c7f1a50a347d5300
·
Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.