×
Nov 18, 2023 · We introduce LEO, an embodied multi-modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.
We introduce an embodied multi-modal and multi-task generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.
We introduce an embodied multi-modal and multi-task generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.
We introduce LEO, an embodied multi-modal generalist agent capable of grounding, reasoning, chatting, planning, and acting in the 3D world.
May 1, 2024 · This work introduces LEO, an embodied multi-modal generalist agent designed to extend machine learning capabilities into the 3D realm, marking a ...
Nov 17, 2023 · Our proposed agent, referred to as LEO, is trained with shared LLM-based model architectures, objectives, and weights in two stages: (i) 3D ...
To this end, we introduce LEO, an embodied multi- modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the. 3D world.
This work introduces LEO, an embodied multi-modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world and ...
Nov 26, 2023 · LEO stands as a pioneering embodiment of a generalist agent, showcasing remarkable capabilities in navigating and interacting within the 3D world.
Jul 26, 2024 · Keynote Talk in Workshop: Multi-modal Foundation Model meets Embodied AI (MFM-EAI) LEO: An embodied generalist agent in 3D world and Beyond.