对话速腾聚创杨先声：机器人的通用智能，先从一双「不骗人」的眼睛开始 | ICRA 2026

6/15/2026

·~3 min·6/15/2026·zh·4

Quick Answer

At ICRA 2026, Yang Xiansheng from SUTENG demonstrated a novel visual perception architecture for robots, aligning depth and RGB data at the physical level, significantly enhancing precision and reliability.

Quick Take

This innovation aims to overcome the limitations of traditional 3D cameras, which struggle with depth accuracy and environmental adaptability, ultimately accelerating the evolution of .

Key Points

The new architecture eliminates the need for post-processing algorithms for depth and color data.
Current robots struggle with perception speed and accuracy, limiting operational efficiency.
SUTENG's SPAD technology allows for higher integration and performance in depth sensing.
The RGB-D sensor fusion approach reduces computational load while improving frame rates.
Future advancements will focus on enhancing tactile sensing alongside visual capabilities.

Source Excerpt

机器人不仅要看得见，还要看得远、看得稳、看得全。作者丨高景辉编辑丨马晓宁当全球具身智能公司都在ICRA 2026的展台上比拼 DEMO 时，一个棘手的问题却被众人所忽视：机器人至今没有一双真正好用的眼睛。这是整个行业心照不宣的卡点。所有人都在说大模型、VLA带来了通用智能的曙光，但落地时却卡在了最原始的环节——机器人看不准三维世界，做不了精细操作，速度永远赶不上人类。为了弥补传感器的缺陷，解决制约物理AI的卡点，公司们不得不投入大量资源搭建仿真环境、采集标注数据，用算法去“猜”深度……本质上是在用软件填硬件的坑。在这一背景下，速腾聚创副总裁杨先声在 ICRA 做了一场学术汇报，展示了一套面向机器人的全新视觉感知架构。与行业普遍采用的“先分别采集、后算法融合”不同，这套架构在物理层面就实现了深度探测与RGB的天然对齐，深度信息与颜色信息无需后期校准，直接输出给后端。在杨先声看来，这才是解决机器人感知问题的根本路径。但这一新架构究竟要如何打破传统3D相机“稳定、距离、精度”的不可能三角？从雷峰网·AI科技评论在 ICRA 现场与杨先声的对话中，我们或许可以找到答案。

▎AI科技评论：速腾在ICRA带来了新的视觉感知架构，可以简单透露下吗？杨先声：新的架构和之前相比，最大不同在于原始数据底层就已经融合好了，所以在物理层面上，它的深度信息和颜色信息是天然对齐，不需要后期算法去处理。所以这个架构在精度、可靠性上，包括成本、性能，各方面都是比之前的方案要好，会大幅提升机器人训练数据的质量和实时感知能力，将会大大加速物理AI的进化。 ▎AI科技评论：现在机器人在感知方面的痛点是什么？ …

Read on leiphone.com

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from 雷峰网机器人

See more →

雷峰网机器人

1w ago

FeaturedOriginal

WAIC深度观察：具身数据，进入「开箱即用」时代

AI Summary

The WAIC forum highlights the urgent need for 'Model-Ready Data' in embodied intelligence, emphasizing high precision, multi-modality, and diversity as essential for effective physical interaction. Companies like JianZhi Robotics are pioneering solutions to bridge existing data gaps, ensuring that data is directly usable without further processing.

#Robotics #Open Source #AI Startup