Start Over

Learning a convolutional neural network for fractional interpolation in HEVC inter coding

Authors :: Li Song
Xiaokang Yang
Han Zhang
Zhengyi Luo
Source :: VCIP
Publication Year :: 2017
Publisher :: IEEE, 2017.
Abstract: Motion compensated prediction (MCP) is an effective technology for video coding to improve compression efficiency. Fractional sample precision prediction is utilized in HEVC to further remove temporal redundancy, and finite impulse response (FIR) filters designed using decomposition of the discrete cosine transform are applied to generate samples that do not fall on the integer positions. However, the coefficients of these DCT-based interpolation filters are fixed, which may not be able to adapt to varied video content. Inspired by the remarkable success of convolutional neural network (CNN) in the single image super-resolution task, we propose to learn a convolutional neural network for fractional interpolation in HEVC inter prediction. Compared with super-resolution, there is one big difference in fractional interpolation — fractional interpolation needs to maintain samples at integer positions while super-resolution generates a whole high-resolution image. Another difference is no real ground truth is available in fractional interpolation process. To overcome these two challenges, we introduce a constraint strategy to the training phase of the original super-resolution network as well as a specially designed preprocessing step which reuses the DCTIF interpolation process. Unlike other previous work, our proposed approach simultaneously generating the fractional positions from one network and experimental results show our proposed approach achieves 0.45% BD-Rate reduction under the low-delay-P configuration on average.

Subjects :: Ground truth
Finite impulse response
Computer science
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
020207 software engineering
02 engineering and technology
Convolutional neural network
Superresolution
Redundancy (information theory)
0202 electrical engineering, electronic engineering, information engineering
Discrete cosine transform
020201 artificial intelligence & image processing
Algorithm
Coding (social sciences)
Interpolation

Details

Database :: OpenAIRE
Journal :: 2017 IEEE Visual Communications and Image Processing (VCIP)
Accession number :: edsair.doi...........e16a0a50a0590ea1bcc354ebe27b2c28

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Learning a convolutional neural network for fractional interpolation in HEVC inter coding

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Learning a convolutional neural network for fractional interpolation in HEVC inter coding

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources