AAMAS 2007
A Globally Optimal Algorithm for TTD-MDPs
Abstract
In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)–a variant of MDPs in which the goal is to realize a specified distribution of trajectories through a state space–as a general agent-coordination framework.
Authors
Keywords
Context
- Venue
- International Conference on Autonomous Agents and Multiagent Systems
- Archive span
- 2002-2025
- Indexed papers
- 7403
- Paper id
- 500441461627321198