Back to Search Start Over

Semantic Perceptual Image Compression With a Laplacian Pyramid of Convolutional Networks

Authors :
Mai Xu
Xiaoming Tao
Jianhua Lu
Juan Wang
Yiping Duan
Source :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. 30
Publication Year :
2021

Abstract

The existing image compression methods usually choose or optimize low-level representation manually. Actually, these methods struggle for the texture restoration at low bit rates. Recently, deep neural network (DNN)-based image compression methods have achieved impressive results. To achieve better perceptual quality, generative models are widely used, especially generative adversarial networks (GAN). However, training GAN is intractable, especially for high-resolution images, with the challenges of unconvincing reconstructions and unstable training. To overcome these problems, we propose a novel DNN-based image compression framework in this paper. The key point is decomposing an image into multi-scale sub-images using the proposed Laplacian pyramid based multi-scale networks. For each pyramid scale, we train a specific DNN to exploit the compressive representation. Meanwhile, each scale is optimized with different aspects, including pixel, semantics, distribution and entropy, for a good “rate-distortion-perception” trade-off. By independently optimizing each pyramid scale, we make each stage manageable and make each sub-image plausible. Experimental results demonstrate that our method achieves state-of-the-art performance, with advantages over existing methods in providing improved visual quality. Additionally, a better performance in the down-stream visual analysis tasks which are conducted on the reconstructed images, validates the excellent semantics-preserving ability of the proposed method.

Details

ISSN :
19410042
Volume :
30
Database :
OpenAIRE
Journal :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Accession number :
edsair.doi.dedup.....725af54b38dd8373ccd4eaec3cbb5d71