Google Scholar

Reinforcement learning agents for interacting with humans

I Shapira, A Azaria - Proceedings of the Annual Meeting of the …, 2022 - escholarship.org

Proceedings of the Annual Meeting of the Cognitive Science Society, 2022•escholarship.org

We tackle the problem of an agent interacting with humans in a general-sum environment,
ie, a non-zero sum, non-fully cooperative setting, where the agent's goal is to increase its
own utility. We show that when data is limited, building an accurate human model is very
challenging, and that a reinforcement learning agent, which is based on this data, does not
perform well in practice. Therefore, we propose that the agent should try maximizing a linear
combination of the human's utility and its own utility rather than simply trying to maximize …

We tackle the problem of an agent interacting with humans in a general-sum environment, i.e., a non-zero sum, non-fully cooperative setting, where the agent's goal is to increase its own utility. We show that when data is limited, building an accurate human model is very challenging, and that a reinforcement learning agent, which is based on this data, does not perform well in practice. Therefore, we propose that the agent should try maximizing a linear combination of the human's utility and its own utility rather than simply trying to maximize only its own utility.

escholarship.org

Show moreShow less

Save Cite Cited by 2 Related articles All 2 versions View as HTML

Showing the best result for this search. See all results

Cite

Advanced search

Saved to My library

Reinforcement learning agents for interacting with humans