Back to Search Start Over

The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces

Authors :
El-Shangiti, Ahmed Oumar
Hiraoka, Tatsuya
AlQuabeh, Hilal
Heinzerling, Benjamin
Inui, Kentaro
Publication Year :
2024

Abstract

This paper investigates whether large language models (LLMs) utilize numerical attributes encoded in a low-dimensional subspace of the embedding space when answering logical comparison questions (e.g., Was Cristiano born before Messi?). We first identified these subspaces using partial least squares regression, which effectively encodes the numerical attributes associated with the entities in comparison prompts. Further, we demonstrate causality by intervening in these subspaces to manipulate hidden states, thereby altering the LLM's comparison outcomes. Experimental results show that our findings hold for different numerical attributes, indicating that LLMs utilize the linearly encoded information for numerical reasoning.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2410.13194
Document Type :
Working Paper