Buckets:

alucchi's picture
|
download
raw
506 Bytes

dict-auto-xz (AutoZip)

Approach:

  • Detect bytes not present in enwik8.
  • Substitute high-gain XML/wiki/text patterns with those single-byte codes.
  • Compress the transformed stream with tuned xz/lzma2.
  • Decompress by xz decode + inverse substitution table (subs.json).

Backend xz options:

  • -9e --lzma2=dict=512MiB,nice=273,mf=bt4,mode=normal,lc=4,lp=0,pb=0

Roundtrip:

  • Verified byte-identical output against shared_resources/enwik8.

Score:

  • See results.json (archive + decompressor.zip).

Xet Storage Details

Size:
506 Bytes
·
Xet hash:
99ba2cca2126c3a8d1a9e9f5a38dd15b0ede687e6dbfb9e8c7f1a50a347d5300

Xet efficiently stores files, intelligently splitting them into unique chunks and accelerating uploads and downloads. More info.