Start Over

A Mathematical Interpretation of Autoregressive Generative Pre-Trained Transformer and Self-Supervised Learning.

Authors :: Lee, Minhyeok
Source :: Mathematics (2227-7390). Jun2023, Vol. 11 Issue 11, p2451. 19p.
Publication Year :: 2023
Abstract: In this paper, we present a rigorous mathematical examination of generative pre-trained transformer (GPT) models and their autoregressive self-supervised learning mechanisms. We begin by defining natural language space and knowledge space, which are two key concepts for understanding the dimensionality reduction process in GPT-based large language models (LLMs). By exploring projection functions and their inverses, we establish a framework for analyzing the language generation capabilities of these models. We then investigate the GPT representation space, examining its implications for the models' approximation properties. Finally, we discuss the limitations and challenges of GPT models and their learning mechanisms, considering trade-offs between complexity and generalization, as well as the implications of incomplete inverse projection functions. Our findings demonstrate that GPT models possess the capability to encode knowledge into low-dimensional vectors through their autoregressive self-supervised learning mechanism. This comprehensive analysis provides a solid mathematical foundation for future advancements in GPT-based LLMs, promising advancements in natural language processing tasks such as language translation, text summarization, and question answering due to improved understanding and optimization of model training and performance. [ABSTRACT FROM AUTHOR]

Subjects :: *LANGUAGE models
*TEXT summarization
*QUESTION answering systems
*NATURAL language processing
*INVERSE functions
*AUTOREGRESSIVE models
*NATURAL languages

Details

Language :: English
ISSN :: 22277390
Volume :: 11
Issue :: 11
Database :: Academic Search Index
Journal :: Mathematics (2227-7390)
Publication Type :: Academic Journal
Accession number :: 164217743
Full Text :: https://doi.org/10.3390/math11112451

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

A Mathematical Interpretation of Autoregressive Generative Pre-Trained Transformer and Self-Supervised Learning.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

A Mathematical Interpretation of Autoregressive Generative Pre-Trained Transformer and Self-Supervised Learning.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources