Start Over

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos

Authors :: Linning Xu
Libiao Jin
Anyi Rao
Dahua Lin
Xuekun Jiang
Source :: IEEE Transactions on Multimedia. 24:3049-3059
Publication Year :: 2022
Publisher :: Institute of Electrical and Electronics Engineers (IEEE), 2022.
Abstract: In film making, shot has a profound influence on how the movie content is delivered and how the audiences are echoed, where different emotions and contents can be delivered through well-designed camera movements or shot editing. Therefore, in pursuit of a high-level understanding of long videos, accurate shot detection from untrimmed videos should be considered as the first and the most fundamental step. Existing approaches address this problem based on the visual difference and content transitions between consecutive frames, while ignoring intrinsic shot attributes, viz., camera movements, scales and viewing angles, which essentially reveals how each shot is created. In this work, we propose a new learning framework (SCTSNet) for shot boundary detection by jointly recognizing the attributes and composition of shots in videos. To facilitate the analysis of shots and the evaluation of shot detection models, we collect a large-scale shot boundary dataset MovieShots2, which contains 15K shots from 282 movie clips. It is richly annotated with the temporal boundary between consecutive shots and its shot attributes, including camera movements, scales and viewing angles, which are the three most distinct shot attributes. Our experiments show that the joint learning framework can significantly boost the boundary detection performance, surpassing the previous scores by a large margin. SCTSNet improves shot boundary detection AP from 0.65 to 0.77, pushing the performance to a new level.

Subjects :: Boundary detection
Film making
Computer science
business.industry
Shot (filmmaking)
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Boundary (topology)
Computer Science Applications
Margin (machine learning)
ComputerApplications_MISCELLANEOUS
Signal Processing
Media Technology
Computer vision
Artificial intelligence
Electrical and Electronic Engineering
business
Composition (language)

Details

ISSN :: 19410077 and 15209210
Volume :: 24
Database :: OpenAIRE
Journal :: IEEE Transactions on Multimedia
Accession number :: edsair.doi...........ae63caf4a7db24a492a61b6b862e2a25

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Jointly Learning the Attributes and Composition of Shots for Boundary Detection in Videos

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources