Google Scholar

Metrics for Markov decision processes with infinite state spaces

N Ferns, P Panangaden, D Precup - arXiv preprint arXiv:1207.1386, 2012 - arxiv.org

arXiv preprint arXiv:1207.1386, 2012•arxiv.org

We present metrics for measuring state similarity in Markov decision processes (MDPs) with
infinitely many states, including MDPs with continuous state spaces. Such metrics provide a
stable quantitative analogue of the notion of bisimulation for MDPs, and are suitable for use
in MDP approximation. We show that the optimal value function associated with a
discounted infinite horizon planning task varies continuously with respect to our metric
distances.

We present metrics for measuring state similarity in Markov decision processes (MDPs) with infinitely many states, including MDPs with continuous state spaces. Such metrics provide a stable quantitative analogue of the notion of bisimulation for MDPs, and are suitable for use in MDP approximation. We show that the optimal value function associated with a discounted infinite horizon planning task varies continuously with respect to our metric distances.

arxiv.org

Show moreShow less

Save Cite Cited by 91 Related articles All 13 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Metrics for Markov decision processes with infinite state spaces