Back to Search
Start Over
Learning-Based Just-Noticeable-Quantization- Distortion Modeling for Perceptual Video Coding
- Source :
- IEEE Transactions on Image Processing. 27:3178-3193
- Publication Year :
- 2018
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2018.
-
Abstract
- Conventional predictive video coding-based approaches are reaching the limit of their potential coding efficiency improvements, because of severely increasing computation complexity. As an alternative approach, perceptual video coding (PVC) has attempted to achieve high coding efficiency by eliminating perceptual redundancy, using just-noticeable-distortion (JND) directed PVC. The previous JNDs were modeled by adding white Gaussian noise or specific signal patterns into the original images, which were not appropriate in finding JND thresholds due to distortion with energy reduction. In this paper, we present a novel discrete cosine transform-based energy-reduced JND model, called ERJND, that is more suitable for JND-based PVC schemes. Then, the proposed ERJND model is extended to two learning-based just-noticeable-quantization-distortion (JNQD) models as preprocessing that can be applied for perceptual video coding. The two JNQD models can automatically adjust JND levels based on given quantization step sizes. One of the two JNQD models, called LR-JNQD, is based on linear regression and determines the model parameter for JNQD based on extracted handcraft features. The other JNQD model is based on a convolution neural network (CNN), called CNN-JNQD. To our best knowledge, our paper is the first approach to automatically adjust JND levels according to quantization step sizes for preprocessing the input to video encoders. In experiments, both the LR-JNQD and CNN-JNQD models were applied to high efficiency video coding (HEVC) and yielded maximum (average) bitrate reductions of 38.51% (10.38%) and 67.88% (24.91%), respectively, with little subjective video quality degradation, compared with the input without preprocessing applied.
- Subjects :
- business.industry
Computer science
Quantization (signal processing)
020206 networking & telecommunications
Pattern recognition
02 engineering and technology
Computer Graphics and Computer-Aided Design
Convolutional neural network
symbols.namesake
Additive white Gaussian noise
Distortion
0202 electrical engineering, electronic engineering, information engineering
Discrete cosine transform
symbols
020201 artificial intelligence & image processing
Artificial intelligence
business
Encoder
Software
Subjective video quality
Coding (social sciences)
Subjects
Details
- ISSN :
- 19410042 and 10577149
- Volume :
- 27
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Image Processing
- Accession number :
- edsair.doi.dedup.....0b90add88d30cf30345a19de44657804