Back to Search Start Over

Optical Flow-Based Spatiotemporal Sketch for Video Representation: A Novel Framework

Authors :
Du, Qiyuan
Duan, Yiping
Xie, Zhipeng
Tao, Xiaoming
Shi, Linsu
Jin, Zhijuan
Source :
IEEE Transactions on Circuits and Systems for Video Technology; August 2024, Vol. 34 Issue: 8 p6963-6977, 15p
Publication Year :
2024

Abstract

With the rapid development of multimedia services and the dramatic growth of video data volume, efficient video representation and AI-generated content (AIGC) become critical parts of future multimedia communication systems. Sketch graph is a structured abstraction of key textures in an image, and video sketch graph further exploits the temporal continuity of videos to achieve a sparse representation. Sketch-based representation has potential applications in communication systems for both human subjective perception and machine vision tasks, and provides a new idea for AIGC. However, current video sketch extraction methods rely on human assistance and correction, and cannot be applied to end-to-end communication systems. We design a novel framework for spatiotemporal sketch extraction based on deep learning methods. In the proposed framework, sketch extraction and sparse coding are performed at the sender side using structural and temporal features of the video. The original videos are generatively reconstructed at the receiver side or applied to downstream machine vision tasks. We validate the performance of the proposed method on Cityscapes dataset with different metrics. Experiments show that our proposed framework can be end-to-end adapted to video communication tasks in different scenarios and can achieve efficient video characterization and transmission. Moreover, our proposed method enables sketch-based end-to-end AIGC for video generation.

Details

Language :
English
ISSN :
10518215 and 15582205
Volume :
34
Issue :
8
Database :
Supplemental Index
Journal :
IEEE Transactions on Circuits and Systems for Video Technology
Publication Type :
Periodical
Accession number :
ejs67162655
Full Text :
https://doi.org/10.1109/TCSVT.2023.3349130