KnowThyself: An Agentic Assistant for LLM Interpretability

Suraj Prasai; Mengnan Du; Ying Zhang; Fan Yang

doi:10.1609/aaai.v40i48.42373

Back to AAAI

AAAI 2026

KnowThyself: An Agentic Assistant for LLM Interpretability

System Paper AAAI Demonstration Track Artificial Intelligence

PDF Details DOI

Abstract

We develop KnowThyself, an agentic assistant that advances large language model (LLM) interpretability. Existing tools provide useful insights but remain fragmented and code-intensive. KnowThyself consolidates these capabilities into a chat-based interface, where users can upload models, pose natural language questions, and obtain interactive visualizations with guided explanations. At its core, an orchestrator LLM first reformulates user queries, an agent router further directs them to specialized modules, and the outputs are finally contextualized into coherent explanations. This design lowers technical barriers and provides an extensible platform for LLM inspection. By embedding the whole process into a conversational workflow, KnowThyself offers a robust foundation for accessible LLM interpretability.

Authors

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 41004581692301568