Back to Search Start Over

The Tong Test: Evaluating Artificial General Intelligence Through Dynamic Embodied Physical and Social Interactions

Authors :
Yujia Peng
Jiaheng Han
Zhenliang Zhang
Lifeng Fan
Tengyu Liu
Siyuan Qi
Xue Feng
Yuxi Ma
Yizhou Wang
Song-Chun Zhu
Source :
Engineering, Vol 34, Iss , Pp 12-22 (2024)
Publication Year :
2024
Publisher :
Elsevier, 2024.

Abstract

The release of the generative pre-trained transformer (GPT) series has brought artificial general intelligence (AGI) to the forefront of the artificial intelligence (AI) field once again. However, the questions of how to define and evaluate AGI remain unclear. This perspective article proposes that the evaluation of AGI should be rooted in dynamic embodied physical and social interactions (DEPSI). More specifically, we propose five critical characteristics to be considered as AGI benchmarks and suggest the Tong test as an AGI evaluation system. The Tong test describes a value- and ability-oriented testing system that delineates five levels of AGI milestones through a virtual environment with DEPSI, allowing for infinite task generation. We contrast the Tong test with classical AI testing systems in terms of various aspects and propose a systematic evaluation system to promote standardized, quantitative, and objective benchmarks and evaluation of AGI.

Details

Language :
English
ISSN :
20958099
Volume :
34
Issue :
12-22
Database :
Directory of Open Access Journals
Journal :
Engineering
Publication Type :
Academic Journal
Accession number :
edsdoj.9930effd6e1448918ea1c251b3b737e7
Document Type :
article
Full Text :
https://doi.org/10.1016/j.eng.2023.07.006