独家｜北大董豪：「仅停留在数据层面的Scaling Law，教不出通用机器人」

6/16/2026

·~2 min·6/16/2026·zh·13

Quick Answer

Peking University's Dong Hao advocates for a two-dimensional Scaling Law to enhance embodied AI, combining world models and generative data to improve robot learning efficiency.

Quick Take

This approach aims to reduce data usage while increasing task success rates, crucial for the widespread deployment of general-purpose humanoid robots.

Key Points

Current imitation and reinforcement learning methods have significant limitations in robot training.
Dong's lab achieved fully autonomous laundry processes using a combination of imitation and reinforcement learning.
The proposed two-dimensional Scaling Law integrates task quantity with data volume for better growth.
Generative AI can create 50 realistic training samples from a single real-world trajectory.
The focus on a unified growth curve aims to facilitate the commercialization of general-purpose robots.

Source Excerpt

「数据量×任务量」二维Scaling才是具身AGI正解。作者丨齐铖湧编辑丨林觉民近段时间，具身智能的模型技术迭代方向，节奏放缓，分歧不断。对此，北大副教授董豪（上纬启元首席科学家）分享了一个新观点：现在主流的模仿学习、强化学习、仿真数据各有硬伤，行业需要换套思路。在不久前的百度智能云的具身智能论坛上，董豪详细分享了他的想法，董豪主张用二维横向Scaling Law新思路，把世界模型、生成数据、人类示教这些热门技术串成一条线，让机器人任务越学越多的同时，实现数据越用越省。（雷峰网）董豪坚信，这才是家用和通用人形机器人能大规模落地的关键。以下为董豪分享内容，经AI科技评论独家获取并做不改变原义的整理删改：01模仿学习只能完成冷启动，单一示范数据存在天然缺陷谈及大模型行业共识的 Scaling Law，董豪将当前具身模型训练划分为两大阶段：预训练依托模仿学习，后置迭代依靠强化学习，两套方案各有显著短板。模仿学习优势在于快速冷启动，依托标准化人工示范数据，能快速赋予机器人基础操作能力，逻辑与大语言模型训练逻辑相通。但其致命短板在于训练样本全部为正确轨迹，完全缺失故障、失误样本分布。

即便积累上万条标准操作数据，机器人在真实场景执行出错后，不具备自主调整、纠错能力。国内已有成熟落地探索，北京智源研究院基于 15 款异构双臂机器人搭建大规模多模态数据集，训练出可跨硬件通用的 VLA 视觉语言动作模型，成为模仿学习路线标杆工程。 …

Read on leiphone.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from 雷峰网 AI

See more →

刚刚，GPT 5.6 发布会上，OpenAI 暴露了哪些 Agent 技术路线？

雷峰网 AI

2w ago

FeaturedOriginal

刚刚，GPT 5.6 发布会上，OpenAI 暴露了哪些 Agent 技术路线？

AI Summary

OpenAI's GPT 5.6 integrates ChatGPT and Codex, introducing a for complex task execution, with models Soul, Terra, and Luna for efficient workflow management. The release emphasizes task orchestration, contextual understanding, and robust security measures for enterprise applications.

#Agent #AI Coding #Security #Enterprise AI