Adaptive Latent Agentic Reasoning

arXiv cs.CL·Dongwon Jung, Peng Shi, Yi Zhang, Junshan Zhang, Muhao Chen

6/3/2026

·~1 min·6/3/2026·en·2

Quick Answer

This paper shows that The Adaptive Latent Agentic Reasoning (ALAR) framework enhances LLM agents by using compact latent reasoning for routine decisions and explicit chain-of-thought for complex ones, achieving up to 43.6% fewer tokens in search tasks and 84.6% in tool use while maintaining or improving task accuracy.

Quick Take

The Adaptive Latent Agentic Reasoning (ALAR) framework enhances LLM agents by using compact latent reasoning for routine decisions and explicit chain-of-thought for complex ones, achieving up to 43.6% fewer tokens in search tasks and 84.6% in while maintaining or improving task accuracy.

Key Points

ALAR reduces reasoning verbosity in LLM agents, improving efficiency.
Achieves up to 43.6% fewer tokens in agentic search tasks.
Reduces token generation by 84.6% in tool-use scenarios.
Maintains comparable or better task accuracy with adaptive reasoning.
Optimizes reasoning effort based on task complexity.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Article Content

From source RSS / original summary

arXiv:2606. 02871v1 Announce Type: new Abstract: Large reasoning models improve performance by generating extended chain-of-thought (CoT) reasoning, but this behavior becomes inefficient when applied to LLM agents. Current LLM agents often generate verbose textual reasoning at every decision step and allocate reasoning effort nearly uniformly across turns, leading to substantial inefficiency in multi-turn agentic trajectories.

We propose Adaptive Latent Agentic Reasoning (ALAR), a dual-mode framework that uses compact latent reasoning for routine turns and selectively escalates to explicit chain-of-thought when deeper deliberation is needed. ALAR learns latent reasoning by using the agent's actions as supervision anchors and is further optimized to use latent reasoning when it is sufficient for task success and reserve explicit CoT for harder decisions.

Experiments on agentic search and benchmarks show that ALAR maintains comparable or better task accuracy while substantially reducing generated tokens by up to 43. 6% in search and 84. 6% in tool use. These results demonstrate that ALAR improves the accuracy-efficiency trade-off of LLM agents by reducing unnecessary textual reasoning while preserving explicit deliberation for harder decision steps.

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.CL

See more →

arXiv cs.CL·Miguel Arana-Catania, Catherine Conisbee, Matthew Kidd

5d ago

FeaturedOriginal

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

AI Summary

The study evaluates three NLP approaches—Named Entity Recognition, Keyword Extraction, and Topic Modelling—using the Their Finest Hour Online Archive to automate keyword extraction from crowdsourced WWII collections. Findings suggest that while NLP methods show promise, no single approach is sufficient, and ethical considerations in automated keyword extraction are crucial for responsible stewardship.

#AI Coding #Inference #Open Source #Policy

Adaptive Latent Agentic Reasoning

Quick Answer

Quick Take

Key Points

Paper Resources

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quantifying Prior Dominance in Systems

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Quick Answer

Quick Take

Key Points

Paper Resources

Article Content

Want this in your inbox every morning?

More from arXiv cs.CL

Letting the Data Speak: Extracting Keywords from Crowdsourced Collections with AI

Quantifying Prior Dominance in RAG Systems

Time to REFLECT: Can We Trust LLM Judges for Evidence-based Research Agents?

Quantifying Prior Dominance in Systems