An Embodied Generalist Agent in 3D World.

AllImages Videos News Maps Shopping Books

[2311.12871] An Embodied Generalist Agent in 3D World - arXiv

Nov 18, 2023 · We introduce LEO, an embodied multi-modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.

An Embodied Generalist Agent in 3D World

embodied-generalist.github.io

We introduce an embodied multi-modal and multi-task generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.

An Embodied Generalist Agent in 3D World - OpenReview

openreview.net › forum

We introduce an embodied multi-modal and multi-task generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world.

[ICML 2024] Official code repository for 3D embodied generalist agent ...

github.com › embodied-generalist › emb...

We introduce LEO, an embodied multi-modal generalist agent capable of grounding, reasoning, chatting, planning, and acting in the 3D world.

An Embodied Generalist Agent in 3D World - arXiv

arxiv.org › html

May 1, 2024 · This work introduces LEO, an embodied multi-modal generalist agent designed to extend machine learning capabilities into the 3D realm, marking a ...

Paper page - An Embodied Generalist Agent in 3D World - Hugging Face

huggingface.co › papers

Nov 17, 2023 · Our proposed agent, referred to as LEO, is trained with shared LLM-based model architectures, objectives, and weights in two stages: (i) 3D ...

People also search for

Siyuan Huang

An embodied generalist agent in 3d world pdf

An embodied generalist agent in 3d world github

Generalist Embodied Agent Research

3d-llm: injecting the 3d world into large language models

3d-vista: pre-trained transformer for 3d vision and text alignment.

[PDF] An Embodied Generalist Agent in 3D World - OpenReview

openreview.net › pdf

To this end, we introduce LEO, an embodied multi- modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the. 3D world.

[PDF] An Embodied Generalist Agent in 3D World - Semantic Scholar

www.semanticscholar.org › paper

This work introduces LEO, an embodied multi-modal generalist agent that excels in perceiving, grounding, reasoning, planning, and acting in the 3D world and ...

Meet LEO: An Embodied Generalist Agent Excelling in 3D World Tasks

syncedreview.com › ... › November › 26

Nov 26, 2023 · LEO stands as a pioneering embodiment of a generalist agent, showcasing remarkable capabilities in navigating and interacting within the 3D world.

LEO: An embodied generalist agent in 3D world and Beyond - ICML 2025

icml.cc › virtual

Jul 26, 2024 · Keynote Talk in Workshop: Multi-modal Foundation Model meets Embodied AI (MFM-EAI) LEO: An embodied generalist agent in 3D world and Beyond.

People also search for

PhyScene: Physically Interactable 3D scene synthesis for embodied AI

multiply: a multisensory object-centric embodied large language model in 3d world

Large language models as generalizable policies for embodied tasks

3D-VLA

vision-language foundation models as effective robot imitators