Back to Search
Start Over
Discriminative Multi-View Dynamic Image Fusion for Cross-View 3-D Action Recognition
- Source :
- IEEE transactions on neural networks and learning systems. 33(10)
- Publication Year :
- 2021
-
Abstract
- Dramatic imaging viewpoint variation is the critical challenge toward action recognition for depth video. To address this, one feasible way is to enhance view-tolerance of visual feature, while still maintaining strong discriminative capacity. Multi-view dynamic image (MVDI) is the most recently proposed 3-D action representation manner that is able to compactly encode human motion information and 3-D visual clue well. However, it is still view-sensitive. To leverage its performance, a discriminative MVDI fusion method is proposed by us via multi-instance learning (MIL). Specifically, the dynamic images (DIs) from different observation viewpoints are regarded as the instances for 3-D action characterization. After being encoded using Fisher vector (FV), they are then aggregated by sum-pooling to yield the representative 3-D action signature. Our insight is that viewpoint aggregation helps to enhance view-tolerance. And, FV can map the raw DI feature to the higher dimensional feature space to promote the discriminative power. Meanwhile, a discriminative viewpoint instance discovery method is also proposed to discard the viewpoint instances unfavorable for action characterization. The wide-range experiments on five data sets demonstrate that our proposition can significantly enhance the performance of cross-view 3-D action recognition. And, it is also applicable to cross-view 3-D object recognition. The source code is available at https://github.com/3huo/ActionView.
- Subjects :
- Image fusion
Source code
Computer Networks and Communications
Computer science
business.industry
media_common.quotation_subject
Feature vector
Cognitive neuroscience of visual object recognition
Pattern recognition
Computer Science Applications
Discriminative model
Action (philosophy)
Artificial Intelligence
Feature (machine learning)
Artificial intelligence
Representation (mathematics)
business
Software
media_common
Subjects
Details
- ISSN :
- 21622388
- Volume :
- 33
- Issue :
- 10
- Database :
- OpenAIRE
- Journal :
- IEEE transactions on neural networks and learning systems
- Accession number :
- edsair.doi.dedup.....3fca657e372b275d35fa2cc079e7eb85