Bridging Language, Vision and Action in 3D Environments

1st Workshop on 3D-LLM/VLA | CVPR 2025 | June 12th, 2025 | Nashville, TN, USA

Workshop Overview

This workshop addresses a critical gap in current AI research by focusing on the integration of language and 3D perception, which is essential for developing embodied agents and robots, especially considering the recent rise of multimodal LLMs and vision-language-action (VLA) models.

The workshop will explore challenges and opportunities in this area, providing a platform for researchers to share their work, discuss future directions, and foster collaboration across disciplines including robotics, computer vision, natural language processing, and human-computer interaction.

Topics Include:

  • Integration of language and 3D perception
  • Large language models (LLMs) for 3D environment understanding
  • 3D vision-language-action (VLA) models
  • Embodied agents that integrate language, vision, and action
  • 3D scene understanding and generation with language
  • Robot control and navigation using natural language
  • Multimodal learning for embodied AI

Important Dates

Paper Submission

April 20, 2025 (23:59)

Notification

May 7, 2025

Camera Ready

May 20, 2025

Workshop Date

June 12th, 2025 (Morning Session)

Call for Papers

Overview

We invite submissions of papers related to the integration of language and 3D perception, with a focus on developing embodied agents and robots. Topics of interest include, but are not limited to:

  • Language-guided 3D perception and understanding
  • Large language models (LLMs) for 3D environment understanding
  • 3D vision-language-action (VLA) models
  • Embodied agents that integrate language, vision, and action
  • 3D scene understanding and generation with language
  • Robot control and navigation using natural language
  • Multimodal learning for embodied AI
  • Datasets and benchmarks for 3D-LLMs and 3D-VLAs
  • Applications of 3D-LLMs and 3D-VLAs in real-world scenarios
  • Ethical considerations in developing embodied AI systems

Submission Guidelines

Papers can be submitted in any major conference's format, with a length of 2-8 pages (excluding references). All submissions will be peer-reviewed, and accepted papers will be presented at the workshop either as oral presentations or posters.

Submissions should be made through the OpenReview submission system.

Important Dates

  • Paper Submission Deadline: April 20, 2025 (23:59)
  • Decision Notification: May 7, 2025
  • Camera Ready Deadline: May 20, 2025
  • Workshop Date: June 12th, 2025 (Morning Session)

Review Process

All submissions will undergo a double-blind review process. Please ensure that your submission does not contain any information that can identify the authors.

Publication

The workshop will be non-archival. Authors of accepted papers retain the full copyright of their work and are free to submit extended versions to conferences or journals.

Schedule

Half-day workshop at CVPR 2025, June 12th Morning Session, Nashville, TN, USA

Note: The schedule is tentative and subject to change. All times are in Central Daylight Time (CDT).

8:00 - 8:30

Poster Setup & Early Morning Poster Session

Authors set up posters and early attendees can browse

8:30 - 8:40

Opening Remarks

Welcome and introduction to the workshop

8:40 - 9:05
Jitendra Malik

Keynote 1: Prof. Jitendra Malik

UC Berkeley / Meta

Vision, Robotics

9:05 - 9:30
Angel X. Chang

Keynote 2: Prof. Angel X. Chang

Simon Fraser University

Language and 3D Grounding

9:30 - 9:55
Dieter Fox

Keynote 3: Prof. Dieter Fox

University of Washington / NVIDIA

Robotics

9:55 - 10:10

Coffee Break

10:10 - 10:35
Katerina Fragkiadaki

Keynote 4: Prof. Katerina Fragkiadaki

Carnegie Mellon University

3D Scene Understanding

10:35 - 11:00
Siyuan Huang

Keynote 5: Dr. Siyuan Huang

BIGAI

Language and 3D Grounding

11:00 - 11:25
Yilun Du

Keynote 6: Dr. Yilun Du

Harvard University

Generative AI, Embodied AI

11:25 - 11:50
Andy Zeng

Keynote 7: Dr. Andy Zeng

Google / Generalist AI

Robotics

11:50 - 12:20

Closing Poster Session

Continued discussion with authors at their posters

12:20 - 12:30

Closing Remarks

Summary and future directions

Keynote Speakers

Leading experts in 3D perception, language, and robotics

Organizing Committee

Meet the team behind the 3D-LLM/VLA Workshop

Contact Us

Have questions? We're here to help

Email

For general inquiries:

jianingy@umich.edu

syqian@meta.com

Paper Submissions

Submit your papers via:

OpenReview Submission System

Workshop Location

CVPR 2025

Nashville, TN, USA

Exact venue details will be announced closer to the event

Frequently Asked Questions

What is the paper submission deadline?

The paper submission deadline is April 20, 2025 (23:59).

Is the workshop in-person or virtual?

The workshop will be held in-person at CVPR 2025 in Nashville, TN, USA.

Are the workshop papers archival?

No, the workshop will be non-archival. Authors of accepted papers retain the full copyright of their work and are free to submit extended versions to conferences or journals.

What is the maximum page length for submissions?

Papers can be submitted in any major conference's format, with a length of 2-8 pages (excluding references).