Back to Search
Start Over
Semantic Perceptual Image Compression With a Laplacian Pyramid of Convolutional Networks
- Source :
- IEEE transactions on image processing : a publication of the IEEE Signal Processing Society. 30
- Publication Year :
- 2021
-
Abstract
- The existing image compression methods usually choose or optimize low-level representation manually. Actually, these methods struggle for the texture restoration at low bit rates. Recently, deep neural network (DNN)-based image compression methods have achieved impressive results. To achieve better perceptual quality, generative models are widely used, especially generative adversarial networks (GAN). However, training GAN is intractable, especially for high-resolution images, with the challenges of unconvincing reconstructions and unstable training. To overcome these problems, we propose a novel DNN-based image compression framework in this paper. The key point is decomposing an image into multi-scale sub-images using the proposed Laplacian pyramid based multi-scale networks. For each pyramid scale, we train a specific DNN to exploit the compressive representation. Meanwhile, each scale is optimized with different aspects, including pixel, semantics, distribution and entropy, for a good “rate-distortion-perception” trade-off. By independently optimizing each pyramid scale, we make each stage manageable and make each sub-image plausible. Experimental results demonstrate that our method achieves state-of-the-art performance, with advantages over existing methods in providing improved visual quality. Additionally, a better performance in the down-stream visual analysis tasks which are conducted on the reconstructed images, validates the excellent semantics-preserving ability of the proposed method.
- Subjects :
- Pixel
Artificial neural network
Computer science
business.industry
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Pattern recognition
02 engineering and technology
Iterative reconstruction
Computer Graphics and Computer-Aided Design
Image (mathematics)
0202 electrical engineering, electronic engineering, information engineering
Entropy (information theory)
020201 artificial intelligence & image processing
Pyramid (image processing)
Artificial intelligence
Representation (mathematics)
business
Software
Image compression
Subjects
Details
- ISSN :
- 19410042
- Volume :
- 30
- Database :
- OpenAIRE
- Journal :
- IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
- Accession number :
- edsair.doi.dedup.....725af54b38dd8373ccd4eaec3cbb5d71