Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Wei, Yao; Sun, Yanchao; Zheng, Ruijie; Vemprala, Sai; Bonatti, Rogerio; Chen, Shuhang; Madaan, Ratnesh; Ba, Zhongjie; Kapoor, Ashish; Ma, Shuang

Computer Science > Artificial Intelligence

arXiv:2307.07909 (cs)

[Submitted on 16 Jul 2023 (v1), last revised 9 Oct 2023 (this version, v3)]

Title:Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Authors:Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, Shuang Ma

View PDF

Abstract:We introduce DualMind, a generalist agent designed to tackle various decision-making tasks that addresses challenges posed by current methods, such as overfitting behaviors and dependence on task-specific fine-tuning. DualMind uses a novel "Dual-phase" training strategy that emulates how humans learn to act in the world. The model first learns fundamental common knowledge through a self-supervised objective tailored for control tasks and then learns how to make decisions based on different contexts through imitating behaviors conditioned on given prompts. DualMind can handle tasks across domains, scenes, and embodiments using just a single set of model weights and can execute zero-shot prompting without requiring task-specific fine-tuning. We evaluate DualMind on MetaWorld and Habitat through extensive experiments and demonstrate its superior generalizability compared to previous techniques, outperforming other generalist agents by over 50$\%$ and 70$\%$ on Habitat and MetaWorld, respectively. On the 45 tasks in MetaWorld, DualMind achieves over 30 tasks at a 90$\%$ success rate.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2307.07909 [cs.AI]
	(or arXiv:2307.07909v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2307.07909

Submission history

From: Yao Wei [view email]
[v1] Sun, 16 Jul 2023 00:34:12 UTC (10,731 KB)
[v2] Tue, 18 Jul 2023 16:05:00 UTC (10,731 KB)
[v3] Mon, 9 Oct 2023 08:07:00 UTC (11,897 KB)

Computer Science > Artificial Intelligence

Title:Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators