On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
From MaRDI portal
Publication:6369504
arXiv2106.02782MaRDI QIDQ6369504
Peilin Liu, Rendong Ying, Chao Ma, Fei Wen, Zeyu Yan
Publication date: 4 June 2021
Abstract: Lossy compression algorithms are typically designed to achieve the lowest possible distortion at a given bit rate. However, recent studies show that pursuing high perceptual quality would lead to increase of the lowest achievable distortion (e.g., MSE). This paper provides nontrivial results theoretically revealing that, extit{1}) the cost of achieving perfect perception quality is exactly a doubling of the lowest achievable MSE distortion, extit{2}) an optimal encoder for the "classic" rate-distortion problem is also optimal for the perceptual compression problem, extit{3}) distortion loss is unnecessary for training a perceptual decoder. Further, we propose a novel training framework to achieve the lowest MSE distortion under perfect perception constraint at a given bit rate. This framework uses a GAN with discriminator conditioned on an MSE-optimized encoder, which is superior over the traditional framework using distortion plus adversarial loss. Experiments are provided to verify the theoretical finding and demonstrate the superiority of the proposed training framework.
Has companion code repository: https://github.com/ZeyuYan/Perceptual-Lossy-Compression
This page was built for publication: On Perceptual Lossy Compression: The Cost of Perceptual Reconstruction and An Optimal Training Framework
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6369504)