Start Over

Stick to your role! Stability of personal values expressed in large language models.

Authors :: Kovač, Grgur
Portelas, Rémy
Sawayama, Masataka
Dominey, Peter Ford
Oudeyer, Pierre-Yves
Source :: PLoS ONE. 8/26/2024, Vol. 19 Issue 8, p1-20. 20p.
Publication Year :: 2024
Abstract: The standard way to study Large Language Models (LLMs) through benchmarks or psychology questionnaires is to provide many different queries from similar minimal contexts (e.g. multiple choice questions). However, due to LLM's highly context-dependent nature, conclusions from such minimal-context evaluations may be little informative about the model's behavior in deployment (where it will be exposed to many new contexts). We argue that context-dependence should be studied as another dimension of LLM comparison alongside others such as cognitive abilities, knowledge, or model size. In this paper, we present a case-study about the stability of value expression over different contexts (simulated conversations on different topics), and as measured using a standard psychology questionnaire (PVQ) and behavioral downstream tasks. We consider 21 LLMs from six families. Reusing methods from psychology, we study Rank-order stability on the population (interpersonal) level, and Ipsative stability on the individual (intrapersonal) level. We explore two settings: with and without instructing LLMs to simulate particular personalities. We observe similar trends in the stability of models and model families—Mixtral, Mistral, GPT-3.5 and Qwen families being more stable than LLaMa-2 and Phi—over those two settings, two different simulated populations, and even on three downstream behavioral tasks. When instructed to simulate particular personas, LLMs exhibit low Rank-Order stability, and this stability further diminishes with conversation length. This highlights the need for future research directions on LLMs that can coherently simulate a diversity of personas, as well as how context-dependence can be studied in more thorough and efficient ways. This paper provides a foundational step in that direction, and, to our knowledge, it is the first study of value stability in LLMs. The project website with code is available at https://sites.google.com/view/llmvaluestability. [ABSTRACT FROM AUTHOR]

Subjects :: *LANGUAGE models
*GENERATIVE pre-trained transformers
*PSYCHOLOGY
*QUESTIONNAIRES

Details

Language :: English
ISSN :: 19326203
Volume :: 19
Issue :: 8
Database :: Academic Search Index
Journal :: PLoS ONE
Publication Type :: Academic Journal
Accession number :: 179262690
Full Text :: https://doi.org/10.1371/journal.pone.0309114

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Stick to your role! Stability of personal values expressed in large language models.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Stick to your role! Stability of personal values expressed in large language models.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources