1. New Results in End-to-end Image and Video Compression by Deep Learning
- Author
-
A. Murat Tekalp, Gokberk Ozsoy, Ogun Kirmemis, and Melih Yilmaz
- Subjects
Computer science ,business.industry ,Deep learning ,Speech recognition ,Digital video ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Latency (audio) ,Data_CODINGANDINFORMATIONTHEORY ,02 engineering and technology ,010501 environmental sciences ,01 natural sciences ,End-to-end principle ,Compression (functional analysis) ,0202 electrical engineering, electronic engineering, information engineering ,Codec ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Transform coding ,0105 earth and related environmental sciences ,Image compression ,Data compression - Abstract
Expanding ubiquity of high-resolution digital video over the Internet calls for better compression methods to enable streaming with higher compression efficiency and lower latency. Recently, important gains have been achieved in learned image compression by using end-to-end learned models. However, these improvements haven't been fully leveraged in video compression. This paper aims to improve upon work proposed by Lu et al. in CVPR 2019, which has been claimed to outperform conventional video codecs in terms of PSNR and provide some implementation details that are absent in the original paper. Ultimately, we show that modeling latent symbols by Laplacian distribution outperforms the Gaussian assumption used in the original work and also demonstrate in a repeatable fashion that our learned model is superior to x264 video codec in terms of PSNR over a range of compression rates measured by bit-per-pixel.
- Published
- 2020