Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search

arXiv cs.AI·Sidhaarth Murali, Jo\~ao Coelho, Jingjie Ning, Jo\~ao Magalh\~aes, Bruno Martins, Chenyan Xiong

6/17/2026

·~1 min·6/17/2026·en·0

Quick Answer

The paper introduces DivInit, a novel approach for diverse query initialization in agentic search, which improves performance by reducing query redundancy.

Quick Take

Tested on five open-weight models across eight benchmarks, DivInit achieves average gains of five to seven points in multi-hop QA while maintaining matched computational costs. This method enhances the efficiency of parallel sampling by selecting diverse seeds from a single call rather than independent queries.

Key Points

DivInit improves agentic search by addressing query redundancy in parallel sampling.
Average performance gains of five to seven points on multi-hop QA benchmarks.
Tested across five open-weight models and eight different benchmarks.
Draws diverse candidates from a single call, enhancing query initialization.
Code for DivInit is available on GitHub for further exploration.

Paper Resources

Read Paperarxiv.org View PDFarxiv.org

Source Excerpt

arXiv:2606. 17209v1 Announce Type: new Abstract: for agentic search typically increases depth (i. e. , more turns and tokens per trajectory) or breadth (i. e. , more parallel rollouts). Here we focus on breadth scaling, showing that standard parallel sampling yields diminishing returns, tracing this to query redundancy at the first turn. When models issue similar first queries across rollouts, the threads retrieve overlapping evidence, and subsequent turns are conditioned on this shared retrieval.

We address this limitation with DivInit, a training-free intervention at the first turn. …

Read on arxiv.org

Want this in your inbox every morning?

Daily brief at your local 8am — bilingual EN/中文, free.

Subscribe — it's free

More from arXiv cs.AI

See more →

arXiv cs.AI·Ji Wu, Yunshan Peng, Wentao Bai, Yunke Bai, Wenzheng Shu, Jinan Pang, Yanxiang Zeng, Xialong Liu

3d ago

FeaturedOriginal

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AI Summary

HOBA (Hierarchical On-policy Bidding Agents) is a novel hierarchical reinforcement learning framework that enhances online advertising bidding systems by improving adaptability and reducing hyperparameter tuning costs. It utilizes a for hyperparameter inference, a SARSA agent for expert model selection, and a dynamic expert pool for bid execution, achieving a +3.6% increase in target cost during large-scale deployment and outperforming state-of-the-art baselines on AuctionNet.

#LLM #Agent #Inference #AI Startup

Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents

Quick Answer

Quick Take

Key Points

Paper Resources

Source Excerpt

Want this in your inbox every morning?

More from arXiv cs.AI

HOBA: Hierarchical On-Policy Bidding Agents for Adaptive Online Advertising

AINTMA: Agentic AI Architecture for Autonomous Test Management with Generative Intelligence, Secure Cloud Communication and Adaptive Quality Analytics

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for LLM Agents

RAIL Guard: Closing the Evaluation-to-Remediation Gap in Responsible AI for Agents