A Globally Optimal Algorithm for TTD-MDPs

Sooraj Bhat; David L. Roberts; Mark J. Nelson; Charles L. Isbell; Michael Mateas

Back to AAMAS

AAMAS 2007

A Globally Optimal Algorithm for TTD-MDPs

Conference Paper Cooperative Distributed Problem Solving Autonomous Agents and Multiagent Systems

PDF

Abstract

In this paper, we discuss the use of Targeted Trajectory Distribution Markov Decision Processes (TTD-MDPs)–a variant of MDPs in which the goal is to realize a specified distribution of trajectories through a state space–as a general agent-coordination framework.

Authors

Keywords

Markov decision processes
interactive entertainment
convex optimization

Context

Venue: International Conference on Autonomous Agents and Multiagent Systems
Archive span: 2002-2025
Indexed papers: 7403
Paper id: 500441461627321198