Back to Search Start Over

Judgment aggregation, discursive dilemma and reflective equilibrium: Neural language models as self-improving doxastic agents

Authors :
Gregor Betz
Kyle Richardson
Source :
Frontiers in Artificial Intelligence, Vol 5 (2022)
Publication Year :
2022
Publisher :
Frontiers Media S.A., 2022.

Abstract

Neural language models (NLMs) are susceptible to producing inconsistent output. This paper proposes a new diagnosis as well as a novel remedy for NLMs' incoherence. We train NLMs on synthetic text corpora that are created by simulating text production in a society. For diagnostic purposes, we explicitly model the individual belief systems of artificial agents (authors) who produce corpus texts. NLMs, trained on those texts, can be shown to aggregate the judgments of individual authors during pre-training according to sentence-wise vote ratios (roughly, reporting frequencies), which inevitably leads to so-called discursive dilemmas: aggregate judgments are inconsistent even though all individual belief states are consistent. As a remedy for such inconsistencies, we develop a self-training procedure—inspired by the concept of reflective equilibrium—that effectively reduces the extent of logical incoherence in a model's belief system, corrects global mis-confidence, and eventually allows the model to settle on a new, epistemically superior belief state. Thus, social choice theory helps to understand why NLMs are prone to produce inconsistencies; epistemology suggests how to get rid of them.

Details

Language :
English
ISSN :
26248212
Volume :
5
Database :
Directory of Open Access Journals
Journal :
Frontiers in Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
edsdoj.648bd9e8d18f43798a47ee3a1b2914ef
Document Type :
article
Full Text :
https://doi.org/10.3389/frai.2022.900943