Back to Search Start Over

Learning a convolutional neural network for fractional interpolation in HEVC inter coding

Authors :
Li Song
Xiaokang Yang
Han Zhang
Zhengyi Luo
Source :
VCIP
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

Motion compensated prediction (MCP) is an effective technology for video coding to improve compression efficiency. Fractional sample precision prediction is utilized in HEVC to further remove temporal redundancy, and finite impulse response (FIR) filters designed using decomposition of the discrete cosine transform are applied to generate samples that do not fall on the integer positions. However, the coefficients of these DCT-based interpolation filters are fixed, which may not be able to adapt to varied video content. Inspired by the remarkable success of convolutional neural network (CNN) in the single image super-resolution task, we propose to learn a convolutional neural network for fractional interpolation in HEVC inter prediction. Compared with super-resolution, there is one big difference in fractional interpolation — fractional interpolation needs to maintain samples at integer positions while super-resolution generates a whole high-resolution image. Another difference is no real ground truth is available in fractional interpolation process. To overcome these two challenges, we introduce a constraint strategy to the training phase of the original super-resolution network as well as a specially designed preprocessing step which reuses the DCTIF interpolation process. Unlike other previous work, our proposed approach simultaneously generating the fractional positions from one network and experimental results show our proposed approach achieves 0.45% BD-Rate reduction under the low-delay-P configuration on average.

Details

Database :
OpenAIRE
Journal :
2017 IEEE Visual Communications and Image Processing (VCIP)
Accession number :
edsair.doi...........e16a0a50a0590ea1bcc354ebe27b2c28