Efficient Compression of Long Arbitrary Sequences With No Reference at the Encoder

From MaRDI portal
Publication:5151688

DOI10.1109/TIT.2020.3023945zbMATH Open1465.94043arXiv2002.09893OpenAlexW3086976386MaRDI QIDQ5151688

Yuval Cassuto, Jacob Ziv

Publication date: 22 February 2021

Published in: IEEE Transactions on Information Theory (Search for Journal in Brave)

Abstract: In a distributed information application an encoder compresses an arbitrary vector while a similar reference vector is available to the decoder as side information. For the Hamming-distance similarity measure, and when guaranteed perfect reconstruction is required, we present two contributions to the solution of this problem. One result shows that when a set of potential reference vectors is available to the encoder, lower compression rates can be achieved when the set satisfies a certain clustering property. Another result reduces the best known decoding complexity from exponential in the vector length n to O(n1.5) by generalized concatenation of inner coset codes and outer error-correcting codes. One potential application of the results is the compression of DNA sequences, where similar (but not identical) reference vectors are shared among senders and receivers.


Full work available at URL: https://arxiv.org/abs/2002.09893






Related Items (1)






This page was built for publication: Efficient Compression of Long Arbitrary Sequences With No Reference at the Encoder

Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q5151688)