Back to Search
Start Over
Deep Bimodal Regression of Apparent Personality Traits from Short Video Sequences
- Source :
- IEEE Transactions on Affective Computing. 9:303-315
- Publication Year :
- 2018
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2018.
-
Abstract
- Apparent personality analysis (APA) is an important problem of personality computing, and furthermore, automatic APA becomes a hot and challenging topic in computer vision and multimedia. In this paper, we propose a deep learning solution to APA from short video sequences. In order to capture rich information from both the visual and audio modality of videos, we tackle these tasks with our Deep Bimodal Regression (DBR) framework. In DBR, for the visual modality, we modify the traditional convolutional neural networks for exploiting important visual cues. In addition, taking into account the model efficiency, we extract audio representations and build a linear regressor for the audio modality. For combining the complementary information from the two modalities, we ensemble these predicted regression scores by both early fusion and late fusion. Finally, based on the proposed framework, we come up with a solution for the Apparent Personality Analysis competition track in the ChaLearn Looking at People challenge in association with ECCV 2016. Our DBR is the winner (first place) of this challenge with 86 registered participants. Beyond the competition, we further investigate the performance of different loss functions in our visual models, and prove non-convex loss functions for regression are optimal on the human-labeled video data.
- Subjects :
- Communication
Modality (human–computer interaction)
business.industry
Speech recognition
media_common.quotation_subject
Deep learning
020206 networking & telecommunications
02 engineering and technology
Convolutional neural network
Regression
Visualization
Human-Computer Interaction
0202 electrical engineering, electronic engineering, information engineering
Personality
020201 artificial intelligence & image processing
Artificial intelligence
Psychology
Association (psychology)
business
Sensory cue
Software
media_common
Subjects
Details
- ISSN :
- 23719850
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Affective Computing
- Accession number :
- edsair.doi...........004a4fe7c554fd89818fde249ac179fa
- Full Text :
- https://doi.org/10.1109/taffc.2017.2762299