Back to Search Start Over

How Different AI Chatbots Behave? Benchmarking Large Language Models in Behavioral Economics Games

Authors :
Xie, Yutong
Liu, Yiyao
Ma, Zhuang
Shi, Lin
Wang, Xiyuan
Yuan, Walter
Jackson, Matthew O.
Mei, Qiaozhu
Publication Year :
2024

Abstract

The deployment of large language models (LLMs) in diverse applications requires a thorough understanding of their decision-making strategies and behavioral patterns. As a supplement to a recent study on the behavioral Turing test, this paper presents a comprehensive analysis of five leading LLM-based chatbot families as they navigate a series of behavioral economics games. By benchmarking these AI chatbots, we aim to uncover and document both common and distinct behavioral patterns across a range of scenarios. The findings provide valuable insights into the strategic preferences of each LLM, highlighting potential implications for their deployment in critical decision-making roles.<br />Comment: Presented at The First Workshop on AI Behavioral Science (AIBS 2024)

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2412.12362
Document Type :
Working Paper