Ergodic Unobservable MDPs: Decidability of Approximation

Raimundo Saona

Highlights 2024

Ergodic Unobservable MDPs: Decidability of Approximation

Conference Abstract 14h00-15h03 Session 8: Probabilistic Systems Logic in Computer Science · Theoretical Computer Science

Details

Abstract

Unobservable Markov decision processes (UMDPs) serve as a prominent mathematical framework for modeling sequential decision-making problems and they are equivalent to Probabilistic Finite Automata. A key aspect in computational analysis is decidability. In general, the computation of the exact and approximated values is undecidable for UMDPs, even with reachability objectives. We look into a decidable subclass for the long-run average objective. Building on matrix product theory and ergodic properties, we introduce a novel subclass of UMDPs, termed ergodic UMDPs. In this class, as in general ergodic systems, the initial state of the dynamic is irrelevant after a long time. Our main result is that approximating the value within this subclass is decidable. Moreover, at the same time, the exact problem remains undecidable. Finally, we discuss the primary challenges of extending these results to partially observable Markov decision processes. Co-authors: Krishnendu Chatterjee, David Lurie, Bruno Ziliotto.

Authors

Raimundo Saona Institute of Science and Technology Austria

Keywords

No keywords are indexed for this paper.

Context

Venue: Highlights of Logic, Games and Automata
Archive span: 2013-2025
Indexed papers: 1236
Paper id: 891852122083756789