1. A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters
- Author
-
J. Dinesh Peter, D. Raveena Judie Dolly, and G. Josemin Bala
- Subjects
Motion compensation ,Multidisciplinary ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,020206 networking & telecommunications ,02 engineering and technology ,Translation (geometry) ,Affine shape adaptation ,Transformation (function) ,Motion estimation ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Computer vision ,Artificial intelligence ,Affine transformation ,business ,Group of pictures ,Data compression ,Mathematics - Abstract
Video compression marks its necessity when a huge sized video needs to be transmitted. The process starts with the identification of GoP (group of pictures), which depends on I- (intra), B- (bidirectional) and P- (predicted) frames determination. GoP is fixed, where consecutive frames are placed in an orderly manner based on the GoP size. Conventionally, B-frames lead to buffering of memory within the past and future frames consuming more computational time. Such issues are handled by an adaptive framework for determining frames based on matching criteria rather than fixed GoP. NSEW (North–South–East–West) affine translation (NAT) is proposed for replacing B with either I- or P-frame. The proposed framework involves video compression using affine motion-based free-form transformation and video decompression using warping methodologies for the purpose of compressing and decompressing the video sequence, based on the resulted I- and P-frames. B-spline transformation was also initiated at local level along with global affine transformation to improve the subjective quality of the decompressed video sequence. The methodology was investigated for the file size, computational time, peak-signal-to-noise ratio (PSNR) and Structural Similarity index (SSIM), which proved the superiority of the proposed technique. Further, the methodology was also investigated with optimizing the affine motion parameters (AMP) using nonlinear least squares, Broyden–Fletcher–Goldfarb–Shanno (BFGS) and limited-memory BFGS which yet again proved to be far more superior to conventional techniques with an average PSNR of 38.98 dB with LBFGS. To further improve the subjective quality, affine B-spline-based motion estimation using LBFGS was implemented and observed the average PSNR gain to be 42.03 dB.
- Published
- 2017