3D 还是 2D？哥大李昀烛：通用机器人基础模型的解药在“中间地带” | ICRA 2026

6/8/2026

·~6 min·6/8/2026·zh·10

Quick Answer

Yunzhu Li from Columbia University proposes 'Structured World Models' as a scalable data engine for robot training, merging 3D physics with 2D data to overcome data collection costs and enhance robot adaptability.

Quick Take

This approach addresses the limitations of end-to-end models lacking physical understanding and traditional physics engines constrained by observational conditions.

Key Points

Structured World Models serve as infinite data engines for robot policy training.
Combining 3D physics priors with vast 2D data is crucial for overcoming data bottlenecks.
Digital twins enable efficient testing and evaluation without extensive real-world data collection.
Pure AI-generated simulations can match real-world performance in complex tasks.
The approach has potential applications in various robotic operations, enhancing adaptability.

Source Excerpt

作者｜岑峰2026年6月1日，机器人领域最重要的学术会议国际机器人与自动化会议（ICRA）在奥地利维也纳召开。在首日举行的“Synthetic Data for Robot Learning” Workshop上，哥伦比亚大学助理教授李昀烛（Yunzhu Li）发表了题为“Structured World Models as Scalable Data Enginesfor Robot Policy Training and Evaluation”的演讲，直击了当今具身智能领域面临的核心痛点：真实物理交互数据采集成本极高，且模型试错与评估极其困难。为此，他提出将结构化世界模型（Structured World Models）作为机器人策略训练与评估的“无限数据引擎”。演讲指出，纯端到端大模型缺乏物理常识，而纯物理引擎又受限于严苛的观测条件。团队从而开辟了一条融合两者优势的“中间路线”：总结而言，将3D物理先验与海量2D数据学习深度融合，是突破机器人基础模型（Foundation Models）数据瓶颈的必由之路。

（编者按：雷峰网·AI科技评论此前在《MIT具身智能达人志》一文中有提及李昀烛亲历 Learning 深刻改变机器人领域的经历，MIT博士毕业后，李昀烛在哥伦比亚大学任职推进世界模型与多模态感知。 …

Read on leiphone.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from 雷峰网机器人

See more →

雷峰网机器人

1d ago

FeaturedOriginal

WAIC深度观察：具身数据，进入「开箱即用」时代

AI Summary

The WAIC forum highlights the urgent need for 'Model-Ready Data' in embodied intelligence, emphasizing high precision, multi-modality, and diversity as essential for effective physical interaction. Companies like JianZhi Robotics are pioneering solutions to bridge existing data gaps, ensuring that data is directly usable without further processing.

#Robotics #Open Source #AI Startup