波恩大学 Maren Bennewitz 教授：让机器人在遮挡世界中主动获取信息 | ICRA 2026

6/5/2026

·~4 min·6/5/2026·zh·6

Quick Answer

Professor Maren Bennewitz from Bonn University emphasizes the necessity of active perception in robotics, advocating for a closed-loop system that integrates perception, prediction, prior knowledge, and action planning.

Quick Take

She presents methods for robots to effectively gather information in cluttered environments by moving objects and utilizing semantic maps, enhancing their understanding and operational efficiency in domestic, agricultural, and service scenarios.

Key Points

Robots must actively perceive to navigate cluttered, dynamic environments effectively.
Semantic maps help robots decide which objects to move for better visibility.
3D scene graphs enable targeted object searches without re-exploring entire environments.
Prior knowledge enhances robots' decision-making in complex tasks like fruit harvesting.
Active perception transforms uncertainty into actionable insights for robots.

Source Excerpt

机器人不能只会“看见”。作者丨郑佳美编辑丨马晓宁 2026 年 6 月 4 日，在 ICRA 2026 “Robot perception and spatial AI” Keynote Session 上，波恩大学教授 Maren Bennewitz 发表了关于主动感知机器人的演讲，直指真实机器人部署中的一个基本困境：机器人面对的世界往往是杂乱、持续变化且只能部分观测的，仅靠被动观察无法完成可靠理解。 Bennewitz 的核心判断是：机器人要真正进入家庭、农业和服务场景，不能只把感知当作“看一眼”的过程，而必须把感知、预测、先验知识和动作规划放到同一个闭环里。机器人需要主动移动视角、推动或抓取遮挡物，用最少的动作获得最多的信息。她在演讲中给出了三类典型场景：其一，在货架或桌面等遮挡环境中，机器人通过不确定性感知的语义地图，决定哪些物体值得移动；其二，在家庭物体搜索中，机器人利用 3D 场景图、语义先验、几何约束和物体重定位规律，在不重新探索全屋的情况下按需寻找物体；其三，在农业监测与果实采摘中，机器人借助上一时刻的地图先验、非刚性配准和叶片形变模型，规划更高效的观测与操作动作。

这场演讲的关键洞察在于：主动感知并不是“多看几眼”，而是把“看哪里、动什么、何时停止”变成信息增益最大化问题。对于机器人而言，世界不是一张静态照片，而是一组可以通过行动逐步揭开的信念分布。 1、真实环境的核心难点不是没有图像，而是不确定性和遮挡：机器人必须知道自己不知道什么。 2、主动感知的价值在于把动作变成信息采集工具：换视角、推开物体、移动叶片，都是为了降低地图和语义的不确定性。 …

Read on leiphone.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from 雷峰网 AI

See more →

刚刚，GPT 5.6 发布会上，OpenAI 暴露了哪些 Agent 技术路线？

雷峰网 AI

1w ago

FeaturedOriginal

刚刚，GPT 5.6 发布会上，OpenAI 暴露了哪些 Agent 技术路线？

AI Summary

OpenAI's GPT 5.6 integrates ChatGPT and Codex, introducing a for complex task execution, with models Soul, Terra, and Luna for efficient workflow management. The release emphasizes task orchestration, contextual understanding, and robust security measures for enterprise applications.

#Agent #AI Coding #Security #Enterprise AI