Research · Jun 17, 2026

Training-free method DivInit improves agentic search performance by five to seven points across five open-weight models and eight benchmarks

DivInit addresses query redundancy in parallel sampling by generating diverse first-turn queries from a single model call, yielding consistent gains in multi-hop QA without additional training.

Trust79

HypeLow hype

1 source · cross-referenced

ShareX LinkedIn Email

TL;DR

DivInit is a training-free intervention that improves agentic search by generating diverse first-turn queries from a single model call instead of sampling independent queries.

Agentic search systems often scale breadth by running more parallel rollouts, but standard parallel sampling can suffer from diminishing returns due to query redundancy at the first turn. When models issue similar first queries across rollouts, subsequent turns retrieve overlapping evidence, limiting diversity in downstream reasoning.

The proposed method, DivInit, addresses this by drawing n candidate first queries from a single model call and selecting k < n diverse seeds to initialize parallel trajectories. This approach reduces redundancy at the source while maintaining matched compute compared to standard parallel sampling.

Across five open-weight models and eight benchmarks, DivInit consistently improves performance on multi-hop question answering, with average gains of five to seven points at the same compute budget. The improvements are reported without requiring additional training, making DivInit a practical intervention for existing agentic search systems.

The authors release code and note that the method is under review at EMNLP 2026. The paper includes eight figures and spans 15 pages, positioning DivInit as a lightweight yet effective technique for test-time scaling in agentic search.

Sources

01arXiv cs.AI — Beyond Parallel Sampling: Diverse Query Initialization for Agentic Search

Also on Research

Training-free method DivInit improves agentic search performance by five to seven points across five open-weight models and eight benchmarks

Apple researchers propose graph-based sensemaking workflows using UMAP’s internal kNN graph

Apple details memory-efficient audio synthesis architecture for on-device Siri Expressive Voices

Apple proposes MoMo, a two-stage imitation-learning framework for robot motion control