1. 'Honey, I Learned to Talk'
- Author
-
Panayiotis G. Georgiou, Haoqi Li, Shao-Yen Tseng, and Brian R. Baucom
- Subjects
Multimodal fusion ,Modality (human–computer interaction) ,Modalities ,business.industry ,Computer science ,media_common.quotation_subject ,Significant difference ,Mean absolute error ,020206 networking & telecommunications ,02 engineering and technology ,Machine learning ,computer.software_genre ,Expression (mathematics) ,Perception ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,psychological phenomena and processes ,media_common - Abstract
In this work we analyze the importance of lexical and acoustic modalities in behavioral expression and perception. We demonstrate that this importance relates to the amount of therapy, and hence communication training, that a person received. It also exhibits some relationship to gender. We proceed to provide an analysis on couple therapy data by splitting the data into clusters based on gender or stage in therapy. Our analysis demonstrates the significant difference between optimal modality weights per cluster and relationship to therapy stage. Given this finding we propose the use of communication-skill aware fusion models to account for these differences in modality importance. The fusion models operate on partitions of the data according to the gender of the speaker or the therapy stage of the couple. We show that while most multimodal fusion methods can improve mean absolute error of behavioral estimates, the best results are given by a model that considers the degree of communication training among the interlocutors.
- Published
- 2018