Back to Search Start Over

An overview of large AI models and their applications

Authors :
Xiaoguang Tu
Zhi He
Yi Huang
Zhi-Hao Zhang
Ming Yang
Jian Zhao
Source :
Visual Intelligence, Vol 2, Iss 1, Pp 1-22 (2024)
Publication Year :
2024
Publisher :
Springer, 2024.

Abstract

Abstract In recent years, large-scale artificial intelligence (AI) models have become a focal point in technology, attracting widespread attention and acclaim. Notable examples include Google’s BERT and OpenAI’s GPT, which have scaled their parameter sizes to hundreds of billions or even tens of trillions. This growth has been accompanied by a significant increase in the amount of training data, significantly improving the capabilities and performance of these models. Unlike previous reviews, this paper provides a comprehensive discussion of the algorithmic principles of large-scale AI models and their industrial applications from multiple perspectives. We first outline the evolutionary history of these models, highlighting milestone algorithms while exploring their underlying principles and core technologies. We then evaluate the challenges and limitations of large-scale AI models, including computational resource requirements, model parameter inflation, data privacy concerns, and specific issues related to multi-modal AI models, such as reliance on text-image pairs, inconsistencies in understanding and generation capabilities, and the lack of true “multi-modality”. Various industrial applications of these models are also presented. Finally, we discuss future trends, predicting further expansion of model scale and the development of cross-modal fusion. This study provides valuable insights to inform and inspire future future research and practice.

Details

Language :
English
ISSN :
27319008
Volume :
2
Issue :
1
Database :
Directory of Open Access Journals
Journal :
Visual Intelligence
Publication Type :
Academic Journal
Accession number :
edsdoj.3b8a041a510a4c18bbc5f47d3ee3abb4
Document Type :
article
Full Text :
https://doi.org/10.1007/s44267-024-00065-8