Back to Search
Start Over
Space-Time Memory Networks for Video Object Segmentation With User Guidance
- Source :
- IEEE Transactions on Pattern Analysis and Machine Intelligence. 44:442-455
- Publication Year :
- 2022
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2022.
-
Abstract
- We propose a novel and unified solution for user-guided video object segmentation tasks. In this work, we consider two scenarios of user-guided segmentation: semi-supervised and interactive segmentation. Due to the nature of the problem, available cues -- video frame(s) with object masks (or scribbles) -- become richer with the intermediate predictions (or additional user inputs). However, the existing methods make it impossible to fully exploit this rich source of information. We resolve the issue by leveraging memory networks and learning to read relevant information from all available sources. In the semi-supervised scenario, the previous frames with object masks form an external memory, and the current frame as the query is segmented using the information in the memory. Similarly, to work with user interactions, the frames that are given user inputs form the memory that guides segmentation. Internally, the query and the memory are densely matched in the feature space, covering all the space-time pixel locations in a feed-forward fashion. The abundant use of the guidance information allows us to better handle challenges such as appearance changes and occlusions. We validate our method on the latest benchmark sets and achieve state-of-the-art performance along with a fast runtime.
- Subjects :
- business.industry
Computer science
Applied Mathematics
Feature vector
Feature extraction
Frame (networking)
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
02 engineering and technology
Object (computer science)
Visualization
Computational Theory and Mathematics
Artificial Intelligence
0202 electrical engineering, electronic engineering, information engineering
Benchmark (computing)
020201 artificial intelligence & image processing
Segmentation
Computer vision
Computer Vision and Pattern Recognition
Artificial intelligence
business
Software
Auxiliary memory
Subjects
Details
- ISSN :
- 19393539 and 01628828
- Volume :
- 44
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- Accession number :
- edsair.doi.dedup.....d1ede30823f8e2054ddd28785d80c556