EAAI Journal 2026 Journal Article
A comprehensive survey on table question answering: Datasets, methods and future directions
- Weiqiang Xu
- Yang Liu
- Lingfeng Lu
- Huakang Li
- Guozi Sun
In the data-centric era of Industry 4. 0, a vast amount of critical information — ranging from machining sensor logs and material synthesis recipes to financial statements — is stored in structured tabular formats. Table Question Answering (TableQA) aims to automatically interpret tabular data and provide precise answers to natural language (NL) queries, serving as a vital interface for intelligent engineering systems. This survey provides a comprehensive review of the table question answering landscape, bridging theoretical advances with practical engineering applications. Methodologically, we propose a unified taxonomy that categorizes datasets and approaches into table-only and non-table-only paradigms. We systematically trace the technical evolution from early rule-based semantic parsing and pre-trained language models (PLMs) to recent large language models (LLMs), highlighting innovations in numerical reasoning and cross-modal alignment. From an engineering perspective, we critically evaluate how these techniques are applied to solve domain-specific challenges, such as predictive maintenance in smart manufacturing, property extraction in material informatics, and decision support in business intelligence. Furthermore, going beyond academic benchmarks, we analyze pressing constraints for industrial deployment, including real-time inference latency, system reliability, and verification in safety-critical environments. Finally, we outline future research directions for building robust, verifiable, and computationally efficient table question answering systems across various industrial domains.