Training-free method DivInit improves agentic search performance by five to seven points across five open-weight models and eight benchmarks
DivInit addresses query redundancy in parallel sampling by generating diverse first-turn queries from a single model call, yielding consistent gains in multi-hop QA without additional training.
1 source · cross-referenced
- DivInit is a training-free intervention that improves agentic search by generating diverse first-turn queries from a single model call instead of sampling independent queries.
Agentic search systems often scale breadth by running more parallel rollouts, but standard parallel sampling can suffer from diminishing returns due to query redundancy at the first turn. When models issue similar first queries across rollouts, subsequent turns retrieve overlapping evidence, limiting diversity in downstream reasoning.
The proposed method, DivInit, addresses this by drawing n candidate first queries from a single model call and selecting k < n diverse seeds to initialize parallel trajectories. This approach reduces redundancy at the source while maintaining matched compute compared to standard parallel sampling.
Across five open-weight models and eight benchmarks, DivInit consistently improves performance on multi-hop question answering, with average gains of five to seven points at the same compute budget. The improvements are reported without requiring additional training, making DivInit a practical intervention for existing agentic search systems.
The authors release code and note that the method is under review at EMNLP 2026. The paper includes eight figures and spans 15 pages, positioning DivInit as a lightweight yet effective technique for test-time scaling in agentic search.
- Jun 17, 2026 · arXiv cs.CL
Paper proposes PromptMN, a pseudo-prompting language to structure human-AI instructions
Trust79 - Jun 17, 2026 · arXiv cs.CL
Researchers propose MemSlides, a hierarchical memory framework for personalized slide generation agents
Trust79 - Jun 17, 2026 · arXiv cs.CL
Researchers propose RepSelect method to make LLM unlearning more robust against reversal attacks
Trust79