Reward Bound for Behavioral Guarantee of Model-based Planning Agents

An, Zhiyu; Ding, Xianzhong; Du, Wan

Computer Science > Artificial Intelligence

arXiv:2402.13419 (cs)

[Submitted on 20 Feb 2024]

Title:Reward Bound for Behavioral Guarantee of Model-based Planning Agents

Authors:Zhiyu An, Xianzhong Ding, Wan Du

View PDF HTML (experimental)

Abstract:Recent years have seen an emerging interest in the trustworthiness of machine learning-based agents in the wild, especially in robotics, to provide safety assurance for the industry. Obtaining behavioral guarantees for these agents remains an important problem. In this work, we focus on guaranteeing a model-based planning agent reaches a goal state within a specific future time step. We show that there exists a lower bound for the reward at the goal state, such that if the said reward is below that bound, it is impossible to obtain such a guarantee. By extension, we show how to enforce preferences over multiple goals.

Comments:	To be published in ICLR 24 tiny paper track
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.13419 [cs.AI]
	(or arXiv:2402.13419v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2402.13419

Submission history

From: Zhiyu An [view email]
[v1] Tue, 20 Feb 2024 23:17:07 UTC (36 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2024-02

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Reward Bound for Behavioral Guarantee of Model-based Planning Agents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Reward Bound for Behavioral Guarantee of Model-based Planning Agents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators