Specification-Guided Reinforcement Learning

Tanmay Ambadkar

doi:10.1609/aaai.v40i48.42142

Back to AAAI

AAAI 2026

Specification-Guided Reinforcement Learning

Short Paper AAAI Doctoral Consortium Track Artificial Intelligence

PDF Details DOI

Abstract

While Reinforcement Learning (RL) has demonstrated remarkable success in solving complex sequential decision-making problems, its application in real-world, safety-critical systems is hindered by its reliance on carefully engineered reward functions. Designing effective rewards is notoriously challenging and can lead to unintended or unsafe behaviors, a phenomenon known as reward hacking. Specification-guided RL has emerged as a principled alternative, leveraging formal methods to directly encode high-level objectives, safety requirements, and behavioral constraints. However, the practical utility of this approach is often limited by coarse or under-specified logical formulas and the computational challenge of enforcing safety at scale. This thesis addresses these limitations by developing a unified framework for the automated refinement, scalable enforcement, and flexible adaptation of formal specifications in RL.

Authors

Tanmay Ambadkar The Pennsylvania State University, University Park, PA

Keywords

No keywords are indexed for this paper.

Context

Venue: AAAI Conference on Artificial Intelligence
Archive span: 1980-2026
Indexed papers: 28718
Paper id: 576542733907035594