Language Models Do Not Embed Numbers Continuously (Student Abstract)

Alex Davies; Roussel Nzoyem Ngueguin; Nirav Ajmeri; Telmo de Menezes E Silva Filho

doi:10.1609/aaai.v40i48.42206

Back to AAAI

AAAI 2026

Language Models Do Not Embed Numbers Continuously (Student Abstract)

Short Paper AAAI Student Abstract and Poster Program Artificial Intelligence

PDF Details DOI

Abstract

We evaluate how well large language model embeddings represent continuous numerical values across different precisions and ranges. Using linear models and principal component analysis on models from major providers, we show that while embeddings can reconstruct numbers with high fidelity (R2 ≥ 0.95), they introduce substantial noise, with principal components explaining less than 40% of embedding variance. Performance degrades with increasing decimal precision and mixed-sign values, revealing fundamental limitations in how these models encode numerical information.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 402384783562888645