This paper shows that The Ling-2.6 and Ring-2.6 models enhance agentic intelligence with low-latency responses and advanced reasoning, utilizing architectural upgrades and a hybrid attention design for efficient training and deployment.
The Ling-2.6 and Ring-2.6 models enhance agentic intelligence with low-latency responses and advanced reasoning, utilizing architectural upgrades and a hybrid attention design for efficient training and deployment. Open-sourced checkpoints support further research in scalable agentic systems.
arXiv:2606. 15079v1 Announce Type: new Abstract: Efficient and scalable agentic intelligence requires models that can deliver both low-latency responses and strong reasoning capabilities while remaining practical to train, serve, and deploy. In this report, we present Ling-2. 6 and Ring-2. 6, a family of models designed to address this challenge at scale. Ling-2. 6 is optimized for instant response generation and high capability per output token, whereas Ring-2.
6 is tailored for deeper reasoning and more advanced agentic workflows. Instead of training from scratch, we upgrade the Ling-2. 0 base model through architectural migration pre-training and large-scale post-training. This upgrade is guided by a unified co-design of model architecture, optimization objectives, serving systems, and agent training environments, enabling improvements in both model capability and deployment efficiency.
At the architectural level, we introduce a hybrid linear attention design that integrates Lightning Attention with MLA, improving the efficiency of long-context training and decoding. To further enhance token efficiency, we optimize capability per output token through Evolutionary Chain-of-Thought, Linguistic Unit Policy Optimization, bidirectional preference alignment, and shortest-correct-response distillation.
For agentic capabilities, we propose KPop, a reinforcement learning framework designed to support stable training of Ring-2. 6-1T on large-scale environment-grounded data. KPop improves training efficiency through asynchronous scheduling across coding, search, , and workflow execution, enabling scalable learning from complex agent-environment interactions. Together, Ling-2. 6 and Ring-2. 6 provide a practical pathway toward efficient, scalable, and open agentic systems.
We open-source all checkpoints in the 2. 6 family to support further research and development in practical agentic intelligence.
Reader Mode unavailable (could not extract clean content).
Daily brief at your local 8am — bilingual EN/中文, free.
The REFLECT benchmark reveals that current LLM judges are unreliable, achieving below 55% accuracy in evaluating reasoning and evidence use, highlighting the need for improved evaluation methods for deep research agents.
This study evaluates LLM-based urban simulators like AgentSociety and CitySim, revealing a significant gap between narrative plausibility and real-world mobility realism. Using datasets from Greater Paris and Shanghai, the analysis shows these models struggle with core spatial and temporal constraints, necessitating rigorous empirical validation and improved initialization methods for realistic urban simulations.
The QIAS 2026 shared task evaluates large language models' reasoning in Islamic inheritance, utilizing the MAWARITH dataset of 12,500 annotated cases. Sixteen teams participated, revealing significant challenges in legal interpretation and numerical reasoning, with results indicating current models struggle with complex inheritance calculations.