Encoding of probability distributions for Asymmetric Numeral Systems
From MaRDI portal
Publication:6370078
arXiv2106.06438MaRDI QIDQ6370078
Jarek Duda
Publication date: 11 June 2021
Abstract: Many data compressors regularly encode probability distributions for entropy coding - requiring minimal description length type of optimizations. Canonical prefix/Huffman coding usually just writes lengths of bit sequences, this way approximating probabilities with powers-of-2. Operating on more accurate probabilities usually allows for better compression ratios, and is possible e.g. using arithmetic coding and Asymmetric Numeral Systems family. Especially the multiplication-free tabled variant of the latter (tANS) builds automaton often replacing Huffman coding due to better compression at similar computational cost - e.g. in popular Facebook Zstandard and Apple LZFSE compressors. There is discussed encoding of probability distributions for such applications, especially using Pyramid Vector Quantizer(PVQ)-based approach with deformation, bucket approximation, prefix trees, improving accuracy with additional bits, also tuned symbol spread for tANS.
Has companion code repository: https://github.com/JarekDuda/AsymmetricNumeralSystemsToolkit
This page was built for publication: Encoding of probability distributions for Asymmetric Numeral Systems
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6370078)