雷峰网 AI 学术

https://www.leiphone.com/category/academic

Latest AI signals from 雷峰网 AI 学术

DeepSignal tracks AI updates from 雷峰网 AI 学术, filtering research and product signals into plain-English summaries, signal scores and source-linked article pages.

Current topics: China Research, AI Startup, LLM, AI Assistant, Robotics · Companies: DeepSeek, DeepMind, Gemini

High-signal updates

AMI Labs 冯雁：AI 迈向现实世界，世界模型不可或缺 | ICML 2026

雷峰网 AI 学术

5d ago

FeaturedOriginal

AMI Labs 冯雁：AI 迈向现实世界，世界模型不可或缺 | ICML 2026

AI Summary

Pascale Fung at ICML 2026 emphasized the necessity of world models for AI in real-world applications, highlighting JEPA's advantages over generative models like Cosmos. JEPA's smaller parameters and faster inference lead to superior performance in tasks like robotic motion planning, outperforming large language models in physical reasoning benchmarks.

Why Featured

The emphasis on world models, particularly through the JEPA framework, signals a shift towards more efficient AI systems that excel in real-world tasks like robotic motion planning. Builders and PMs should consider integrating these models to enhance performance while reducing resource consumption, making them attractive for investors focused on scalable AI solutions.

#LLM #Inference #Robotics

6

雷峰网 AI 学术

6d ago

FeaturedOriginal

ICML 2026现场直击：从展台到游轮，中国大厂在顶会抢人才

AI Summary

At ICML 2026 in Seoul, major Chinese tech firms like Alibaba, ByteDance, and Xiaomi actively recruited AI talent, showcasing their latest research and innovations. The event highlighted a shift in focus from traditional internet applications to industry-specific AI, with significant participation from quant firms like JQData, emphasizing the growing importance of AI in finance and various sectors.

Why Featured

The active recruitment of AI talent by major Chinese tech firms at ICML 2026 signals a strategic shift towards industry-specific AI applications, particularly in finance. Builders and PMs should focus on developing solutions tailored to these emerging sectors, while investors may find opportunities in startups that align with this trend.

#AI Startup #Enterprise AI #Policy

7

ICML 2026 开幕，清华团队获最佳论文奖，DeepMind 经典巨作拿下时间检验奖

雷峰网 AI 学术

1w ago

FeaturedOriginal

ICML 2026 开幕，清华团队获最佳论文奖，DeepMind 经典巨作拿下时间检验奖

AI Summary

ICML 2026 opened in Seoul with Tsinghua University winning the Outstanding Paper Award for their work on diffusion language models, revealing a 'flexibility trap' in token generation. DeepMind's 2016 paper on asynchronous methods for deep reinforcement learning received the Test of Time Award, highlighting its lasting impact on the field.

Why Featured

The Outstanding Paper Award at ICML 2026 for Tsinghua University's work on diffusion language models highlights a significant advancement in token generation, suggesting new avenues for improving AI language applications. Builders and PMs should consider integrating these findings to enhance model performance, while investors may see potential in startups leveraging these innovations.

#AI Startup #Policy

11

2026北京智源大会开幕 | 从“悟道”到“悟界”，智源研究院推动人工智能、物理世界和生命科学“三体互动”

雷峰网 AI 学术

4w ago

FeaturedOriginal

2026北京智源大会开幕 | 从“悟道”到“悟界”，智源研究院推动人工智能、物理世界和生命科学“三体互动”

AI Summary

The 2026 Beijing Zhiyuan Conference showcased advancements in AI, featuring models like WuJie·Emu3.5 and WuJie·Brainμ1.0, which achieved significant breakthroughs in multimodal learning and neuroscience applications. Notably, the WuJie·Physis model aims to unify physical state learning, enhancing AI's interaction with the real world, while the BAAI Cardiac Agent demonstrated diagnostic accuracy exceeding 0.93 AUC.

Why Featured

The introduction of the WuJie·Physis model at the 2026 Beijing Zhiyuan Conference signifies a major advancement in AI's ability to interact with the physical world, which could lead to more effective applications in robotics and IoT. Additionally, the high diagnostic accuracy of the BAAI Cardiac Agent indicates potential for AI-driven healthcare solutions, attracting interest from investors in the health tech sector.

#LLM #Agent #Robotics #AI Assistant

2

港中文团队提出 Skill 生命周期管理 SLIM，让大模型智能体不再盲目堆积 Skill ！

雷峰网 AI 学术

6/1/2026

FeaturedOriginal

港中文团队提出 Skill 生命周期管理 SLIM，让大模型智能体不再盲目堆积 Skill ！

AI Summary

The CUHK team introduced SLIM for dynamic skill lifecycle management in LLM agents, enhancing task performance by 7.1% over traditional methods like SkillRL. SLIM intelligently retains, retires, or expands skills based on their contributions, optimizing agent capabilities for complex tasks.

Why Featured

The introduction of SLIM by the CUHK team for dynamic skill lifecycle management in LLM agents is significant as it improves task performance by 7.1% compared to traditional methods. This advancement allows builders and PMs to create more efficient AI systems that adaptively manage skills, potentially leading to better resource allocation and enhanced user experiences.

#LLM #Agent #AI Startup

2

2050学习节「AGI 4 Science」专场：17位青年学者「挤」在3小时里，都讲了些什么？

雷峰网 AI 学术

5/11/2026

Original

2050学习节「AGI 4 Science」专场：17位青年学者「挤」在3小时里，都讲了些什么？

AI Summary

The 'AGI 4 Science' session at the 2050 Learning Festival featured 17 young scholars discussing AI's evolving role in scientific research, emphasizing AI's potential to reduce experimental costs and enhance interdisciplinary collaboration. Key topics included AI's application in fields like controlled nuclear fusion, semiconductor design, and biological sciences, with a focus on bridging the gap between simulation and real-world applications.

Why Featured

The 'AGI 4 Science' session highlighted AI's potential to significantly lower experimental costs and foster interdisciplinary collaboration in scientific research. Builders and PMs should note the applicability of AI in fields like nuclear fusion and semiconductor design, as these advancements could lead to new market opportunities and innovative product development.

#AI Assistant #Policy

1

雷峰网 AI 学术

4/30/2026

Original

我所知道的代季峰：从微软亚研7万次引用，到盛大3亿美金风暴

AI Summary

Dai Jifeng's collaboration with MiroMind led to the development of the MiroMind ODR system, surpassing OpenAI's DeepResearch. However, internal conflicts over technology transfer and intellectual property resulted in his abrupt departure after just five months, leading to his new venture Naive.ai, which secured $300 million in funding.

Why Featured

Dai Jifeng's departure from MiroMind and the launch of Naive.ai, backed by $300 million in funding, signals a shift in AI innovation dynamics. Builders and PMs should note the potential for new competitive technologies emerging from startups, while investors may find opportunities in funding ventures that prioritize agile development and intellectual property clarity.

#Funding #AI Startup

3

港大张清鹏 Nature 子刊最新研究：AI 结合血液多组学，提前 15 年预测心血管疾病风险

雷峰网 AI 学术

2/5/2026

Original

港大张清鹏 Nature 子刊最新研究：AI 结合血液多组学，提前 15 年预测心血管疾病风险

AI Summary

A study led by HKU's Zhang Qingpeng integrates AI with blood multi-omics to predict cardiovascular disease risk up to 15 years in advance. The CardiOmicScore framework uses 2,920 proteins and 168 metabolites, outperforming traditional genetic risk scores with C-indexes of 0.69-0.82 for ProScore and 0.64-0.74 for MetScore, enhancing risk assessment accuracy significantly.

Why Featured

The integration of AI with blood multi-omics in the CardiOmicScore framework allows for predicting cardiovascular disease risk up to 15 years in advance, significantly improving risk assessment accuracy. This development presents opportunities for builders and PMs to innovate in health tech solutions and for investors to capitalize on advancements in predictive healthcare technologies.

#AI Image #AI Assistant

0

开源新标杆！商汤 SenseNova-MARS超 Gemini-3-Pro，模型代码数据全开放

雷峰网 AI 学术

1/30/2026

Original

开源新标杆！商汤 SenseNova-MARS超 Gemini-3-Pro，模型代码数据全开放

AI Summary

SenseTime has launched the open-source SenseNova-MARS, outperforming Gemini-3-Pro and GPT-5.2 in key benchmarks with scores of 69.74, 69.06, and 67.64 respectively. This model excels in dynamic visual reasoning and tool invocation, making it a top performer in complex task execution.

Why Featured

SenseTime's launch of the open-source multimodal model SenseNova-MARS, which outperforms Gemini-3-Pro and GPT-5.2, signals a significant advancement in AI capabilities for dynamic visual reasoning and tool invocation. This development offers builders and PMs a powerful tool for complex task execution, while investors should note its potential to disrupt existing models and enhance competitive positioning in the AI market.

#Open Source #AI Startup

1

雷峰网 AI 学术

1/26/2026

Original

上交大 SciMaster 团队新作：一个「AI 物理博士」的诞生

AI Summary

The SciMaster team from Shanghai Jiao Tong University has developed PHYSMASTER, an autonomous AI physicist capable of completing complex research tasks in theoretical and computational physics, demonstrating significant advancements in research efficiency and capability. This system can autonomously execute full research workflows, achieving results comparable to human researchers in a fraction of the time.

Why Featured

The development of PHYSMASTER by the SciMaster team represents a significant leap in AI's ability to autonomously conduct complex research in physics, which could drastically reduce research timelines and costs. For builders, PMs, and investors, this signals a shift towards AI-driven research tools that can enhance productivity and innovation across various scientific fields.

#Agent #Robotics #AI Assistant #AI Startup

0

雷峰网 AI 学术

1/22/2026

Original

卢宗青团队新作：人类先验打底，统一动作对齐，通用机器人模型正在落地

AI Summary

The research team led by Lu Zongqing has developed the Being-H0.5 model, which demonstrates cross-embodiment generalization for robots, achieving a 98.9% success rate on the LIBERO benchmark. By leveraging the UniHand-2.0 dataset, the model addresses the challenges of action consistency across different robot forms, enhancing deployment stability in real-world scenarios.

Why Featured

The development of the Being-H0.5 model by Lu Zongqing's team, which achieves 98.9% success on the LIBERO benchmark, signifies a major advancement in cross-embodiment generalization for robots. This enhances the reliability and versatility of robotic applications, making it a crucial consideration for builders, PMs, and investors focusing on scalable robotic solutions in diverse environments.

#Robotics #AI Startup

1

Nature 正刊收录！清华 FIB 实验室揭示：AI 提升科学家个人影响力，却收缩科学整体探索空间

雷峰网 AI 学术

1/19/2026

Original

Nature 正刊收录！清华 FIB 实验室揭示：AI 提升科学家个人影响力，却收缩科学整体探索空间

AI Summary

A study by Tsinghua University's FIB Lab, published in Nature, reveals that while AI significantly boosts individual scientists' impact—showing a 3.02x increase in publications and 4.84x in citations—it concurrently contracts the overall scope of scientific exploration by 4.63%, leading to reduced academic interaction.

Why Featured

The study from Tsinghua University's FIB Lab published in Nature highlights that while AI enhances individual scientists' productivity and citation impact, it also narrows the overall scope of scientific inquiry. Builders, PMs, and investors should consider how AI tools can optimize individual performance without compromising collaborative exploration, as this could influence funding and development strategies in research-focused ventures.

#AI Assistant #Policy

1

雷峰网 AI 学术

1/16/2026

Original

上科大何旭明团队新作：克服简单样本偏置，让多模态模型学会「难题优先」

AI Summary

The DA- framework developed by Professor He Xuming's team at ShanghaiTech University effectively reduces hallucinations in multimodal models by prioritizing difficult samples during training, achieving significant improvements in hallucination rates and model performance across benchmarks like AMBER and MMHalBench without additional labeling costs.

Why Featured

The DA-DPO framework developed by Professor He Xuming's team addresses hallucination issues in multimodal models by prioritizing challenging samples during training. This advancement is crucial for builders and PMs as it enhances model reliability without incurring additional labeling costs, making it more feasible for investors to support projects that leverage improved AI performance in real-world applications.

#Inference #AI Assistant #AI Startup

1

雷峰网 AI 学术

1/14/2026

Original

浙大彭思达团队 × 理想最新研究：直面高分辨率深度的细节缺失

AI Summary

The Zhejiang University team, in collaboration with Li Auto, introduced InfiniDepth, a novel depth estimation model that overcomes resolution limitations by directly predicting depth values at arbitrary resolutions. In benchmarks, InfiniDepth achieved a δ1 score of 96.3% on the Synth4K dataset, outperforming existing methods by 5-10 percentage points in high-frequency detail areas, crucial for applications in autonomous driving and robotics.

Why Featured

The introduction of InfiniDepth by the Zhejiang University team and Li Auto represents a significant advancement in depth estimation technology, achieving a δ1 score of 96.3%. This improvement in resolution and detail is crucial for builders and PMs in the autonomous driving and robotics sectors, as it enhances the accuracy of perception systems, potentially leading to safer and more efficient products.

#Robotics #AI Image

2

清华孙茂松团队 × 深言科技：以解释作为训练信号，让 8B 模型在幻觉检测上反超闭源大模型

雷峰网 AI 学术

1/14/2026

Original

清华孙茂松团队 × 深言科技：以解释作为训练信号，让 8B 模型在幻觉检测上反超闭源大模型

AI Summary

Tsinghua's Sun Maosong team and DeepMind's FaithLens model achieve superior hallucination detection with only 8B parameters, outperforming larger closed-source models like GPT-4.1 and Claude 3.7. This new approach emphasizes generating useful explanations as training signals, significantly reducing computational costs while enhancing reliability in high-stakes applications.

Why Featured

The collaboration between Tsinghua's Sun Maosong team and DeepMind's FaithLens model demonstrates that an 8B parameter model can outperform larger closed-source models in hallucination detection by utilizing explanations as training signals. This development indicates a potential shift towards more efficient, cost-effective AI solutions that prioritize reliability, which is crucial for builders and PMs focusing on high-stakes applications.

#LLM #Open Source #AI Startup

1

雷峰网 AI 学术

1/14/2026

Original

北大卢宗青团队新作：超 70% 实机成功率，支持语言指令的功能性抓取系统

AI Summary

Peking University's Lu Zongqing team developed DemoFunGrasp, achieving over 70% success in functional grasping tasks using language commands, significantly improving robotic interaction with objects. This method integrates functional positioning and grasping styles, enabling robots to perform tasks like pouring water or spraying effectively.

Why Featured

The development of DemoFunGrasp by Peking University's Lu Zongqing team, which achieves over 70% success in functional grasping using language commands, is significant for builders and PMs as it enhances robotic capabilities for practical tasks. This advancement opens up new opportunities for investors in robotics and AI applications that require intuitive human-robot interaction.

#LLM #Robotics #AI Assistant

4

雷峰网 AI 学术

1/8/2026

Original

GAIR 2025 世界模型分论坛：从通用感知到视频、物理世界模型的百家争鸣

AI Summary

The GAIR 2025 forum showcased advancements in world models, featuring talks from researchers like Peng Sida on embodied intelligence and 3D perception, and Hu Wenbo on 3D-aware video models. Innovations included the UP2You method, reducing digital human modeling time from 4 hours to 1.5 minutes, and the introduction of SpatialTracker for robust 3D tracking.

Why Featured

The introduction of the UP2You method, which reduces digital human modeling time from 4 hours to 1.5 minutes, significantly accelerates the development process for builders and PMs in creating realistic avatars and simulations. This efficiency gain can lead to faster product iterations and lower costs, making it an attractive opportunity for investors in the AI and gaming sectors.

#Inference #Robotics #AI Video

6

雷峰网 AI 学术

1/8/2026

Original

上海AI Lab胡侠：KV Cache压缩之后，可让价格2万美金的GPU发挥出20万美金的价值 | GAIR 2025

AI Summary

Shanghai AI Lab's Hu Xia introduces a 'Lossy Computation' approach to enhance large language models' efficiency, achieving up to 8x context length and 3.5x speedup by compressing KV Cache to 2 bits. This innovation could elevate a $20,000 GPU's value to $200,000 by significantly increasing memory capacity.

Why Featured

The introduction of 'Lossy Computation' by Shanghai AI Lab allows for significant improvements in large language model efficiency, potentially transforming a $20,000 GPU into a $200,000 asset. This development is crucial for builders and PMs as it enables cost-effective scaling of AI applications while investors should note the enhanced performance and value proposition of hardware investments.

#LLM #GPU #Open Source

1

雷峰网 AI 学术

12/26/2025

Original

北交大 x 小米 EV 团队：一次关于世界模型「靠不靠谱」的系统复盘

AI Summary

A collaborative study by Beijing Jiaotong University and Xiaomi's autonomous driving team critiques the reliability of world models in real driving scenarios, revealing that improvements in visual prediction metrics do not translate to enhanced system robustness. The research emphasizes the need for a unified evaluation framework to accurately assess model performance in complex environments.

Why Featured

The collaborative study by Beijing Jiaotong University and Xiaomi highlights that advancements in visual prediction metrics do not guarantee improved robustness in world models for autonomous driving. This signals to builders and PMs the necessity for a unified evaluation framework to ensure reliability in real-world applications, which is crucial for investor confidence and product viability.

#Robotics #AI Startup #Policy

0

计算所严明玉团队新作： Attention 并非永远是瓶颈，多 GPU 并不一定更快

雷峰网 AI 学术

12/22/2025

Original

计算所严明玉团队新作： Attention 并非永远是瓶颈，多 GPU 并不一定更快

AI Summary

The research by Mingyu Yan's team reveals that LLM inference performance is not solely bottlenecked by attention or improved by multi-GPU setups. Their systematic study on GPU inference identifies distinct phases (Prefill and Decode) that dictate performance, suggesting that optimization strategies must consider workload characteristics and system architecture.

Why Featured

The research by Mingyu Yan's team highlights that LLM inference performance can be optimized beyond just focusing on attention mechanisms or multi-GPU setups. This suggests that builders and PMs should consider workload characteristics and system architecture when developing AI solutions, while investors may need to reassess the scalability and efficiency of AI infrastructure investments.

#LLM #Inference #GPU

3

雷峰网 AI 学术

12/19/2025

Original

中山大学王广润：大模型的微调只是对空间建模的微调 | GAIR 2025

AI Summary

Dr. Wang Guangrun from Sun Yat-sen University emphasizes the need for advanced AI models to effectively understand and interact with the physical world. His new embodied model, E0, showcases significant improvements in precision and adaptability, requiring minimal parameter adjustments for new environments, as demonstrated in various robotic tasks.

Why Featured

Dr. Wang Guangrun's development of the E0 embodied model highlights a significant advancement in AI's ability to adapt to physical environments with minimal adjustments. This implies that builders and PMs can leverage this technology for more efficient robotic applications, while investors may find opportunities in companies integrating such adaptable AI solutions into their products.

#LLM #AI Assistant #Enterprise AI

2

欧洲科学与艺术院长 Klaus Mainzer：通用人工智能的终极通关秘籍，藏在思想史里 GAIR Live | 018

雷峰网 AI 学术

11/7/2025

Original

欧洲科学与艺术院长 Klaus Mainzer：通用人工智能的终极通关秘籍，藏在思想史里 GAIR Live | 018

AI Summary

Klaus Mainzer argues that true AGI requires insights from humanities, emphasizing the philosophical challenges of creativity and embodiment. He critiques current AI's reliance on formal logic and calls for educational reform to integrate systems thinking across disciplines, highlighting the need for a new generation of thinkers who can bridge science and humanities.

Why Featured

Klaus Mainzer's call for integrating humanities into AI development highlights the need for a multidisciplinary approach to achieve true AGI. Builders and PMs should consider incorporating systems thinking and creativity into their projects, while investors may need to reassess funding strategies to support educational initiatives that foster this new generation of thinkers.

#AI Assistant #Policy

1

雷峰网 AI 学术

8/31/2025

Original

万字长文实录：RL 界与 CV 界的“世界模型”有什么不同？丨GAIR Live

AI Summary

The GAIR Live roundtable discussed the differences in world models between reinforcement learning and computer vision, emphasizing the need for integrating physical laws into embodied intelligence. Key insights included the importance of causal relationships and the challenges of 2D versus 3D modeling in AI applications like autonomous driving.

Why Featured

The discussion on integrating physical laws into world models highlights a crucial development for AI applications like autonomous driving, where understanding causal relationships is essential. Builders and PMs should focus on creating models that effectively bridge 2D and 3D environments, as this will enhance the safety and reliability of AI systems, making them more attractive to investors.

#LLM #AI Coding

3

32B 稠密模型推理能力超越 R1？秘密 AI 团队发布推理小模型 AM-Thinking-v1

雷峰网 AI 学术

5/15/2025

Original

32B 稠密模型推理能力超越 R1？秘密 AI 团队发布推理小模型 AM-Thinking-v1

AI Summary

The AM-Thinking-v1 model, developed by the A-M-team, a secretive research group, outperforms the 671B DeepSeek-R1 in reasoning tasks with a compact 32B architecture, achieving scores of 85.3 and 70.3 in AIME and benchmarks, respectively. This model demonstrates that substantial reasoning capabilities can be achieved without relying on massive datasets or expensive computational resources.

Why Featured

The release of the AM-Thinking-v1 model, which outperforms the 671B DeepSeek-R1 in reasoning tasks with a compact 32B architecture, signals a shift towards more efficient AI solutions that require less computational power and data. This development is crucial for builders and PMs looking to create scalable AI applications while minimizing costs and resource requirements.

#Inference #AI Startup

2

雷峰网 AI 学术

5/15/2025

Original

首次披露！DeepSeek V3 发布软硬一体协同训练论文，公开“降成本”秘诀

AI Summary

DeepSeek V3 introduces a cost-effective training model using only 2,048 NVIDIA H800 GPUs, achieving state-of-the-art performance through innovative techniques like FP8 mixed precision and multi-head latent attention. This model addresses memory efficiency and computational costs, making large-scale AI training accessible for smaller teams.

Why Featured

The release of DeepSeek V3's cost-effective training model using only 2,048 NVIDIA H800 GPUs significantly lowers the barrier to entry for AI development, making advanced AI training feasible for smaller teams. This innovation in memory efficiency and computational cost management signals a shift towards democratizing AI technology, which is crucial for builders, PMs, and investors looking to scale their projects efficiently.

#AI Coding #Open Source #Funding

1

雷峰网 AI 学术

2/28/2025

Original

万字梳理：揭秘 DeepSeek 中的 RL 与 AGI 下一步丨AIR 2025

AI Summary

DeepSeek's innovative use of large-scale reinforcement learning (RL) over traditional supervised fine-tuning (SFT) significantly enhances model reasoning capabilities, as discussed at AIR 2025 by researchers from institutions like UCL and CMU. Key findings include the effectiveness of preference fine-tuning and the introduction of the Goedel-Prover model for formal mathematical proofs, achieving state-of-the-art performance.

Why Featured

The introduction of DeepSeek's large-scale reinforcement learning approach, particularly with the Goedel-Prover model for formal proofs, signals a significant leap in AI reasoning capabilities. This development is crucial for builders and PMs focusing on advanced AI applications, as it suggests new pathways for creating more robust and intelligent systems that can handle complex reasoning tasks.

#LLM #Agent #AI Startup #Policy

1

北京大学-字节跳动成立“豆包大模型系统软件联合实验室”，聚焦AI系统软件关键技术问题

雷峰网 AI 学术

12/13/2024

Original

北京大学-字节跳动成立“豆包大模型系统软件联合实验室”，聚焦AI系统软件关键技术问题

AI Summary

Peking University and ByteDance established the 'Doubao Large Model System Software Joint Laboratory' to address key AI system software challenges, focusing on intelligent software technologies and ecosystem development. The collaboration aims to enhance research and talent cultivation in AI, leveraging both institutions' strengths in foundational research and practical applications.

Why Featured

The establishment of the 'Doubao Large Model System Software Joint Laboratory' by Peking University and ByteDance signifies a strategic collaboration aimed at advancing AI system software technologies. This development is crucial for builders and PMs as it indicates a growing focus on foundational research, which can lead to more robust AI applications and a stronger talent pipeline in the industry.

#LLM #Open Source #AI Startup

3