Back to Search Start Over

Multimodal AutoML via Representation Evolution

Authors :
Blaž Škrlj
Matej Bevec
Nada Lavrač
Source :
Machine Learning and Knowledge Extraction; Volume 5; Issue 1; Pages: 1-13
Publication Year :
2022
Publisher :
MDPI AG, 2022.

Abstract

With the increasing amounts of available data, learning simultaneously from different types of inputs is becoming necessary to obtain robust and well-performing models. With the advent of representation learning in recent years, lower-dimensional vector-based representations have become available for both images and texts, while automating simultaneous learning from multiple modalities remains a challenging problem. This paper presents an AutoML (automated machine learning) approach to automated machine learning model configuration identification for data composed of two modalities: texts and images. The approach is based on the idea of representation evolution, the process of automatically amplifying heterogeneous representations across several modalities, optimized jointly with a collection of fast, well-regularized linear models. The proposed approach is benchmarked against 11 unimodal and multimodal (texts and images) approaches on four real-life benchmark datasets from different domains. It achieves competitive performance with minimal human effort and low computing requirements, enabling learning from multiple modalities in automated manner for a wider community of researchers.

Details

ISSN :
25044990
Volume :
5
Database :
OpenAIRE
Journal :
Machine Learning and Knowledge Extraction
Accession number :
edsair.doi.dedup.....5cfc821eb3a7e36381d694d5fda51e62
Full Text :
https://doi.org/10.3390/make5010001