Skip to content
Research · Jul 4, 2026

Apple proposes amortized maximum inner product search to speed up vector retrieval

Regression-based approach trains neural networks to predict MIPS solutions directly, cutting compute cost for repeated queries over fixed databases.

Trust79
HypeLow hype

1 source · cross-referenced

ShareXLinkedInEmail
TL;DR
  • Apple researchers propose amortized MIPS, a regression-based method to predict vector search solutions directly rather than computing them repeatedly.
  • Two complementary models are introduced: SupportNet regresses the support function to route queries, and KeyNet regresses the optimal key for direct use in indexing pipelines.
  • On the BEIR benchmark with document embeddings, the approach improves IVF match rates while reducing compute measured in FLOPs, probes, or wall-clock time.

Apple’s Machine Learning Research team describes a method to amortize the cost of maximum inner product search (MIPS), a core subroutine in many machine learning pipelines that selects the best-matching vector from a database for a given query.

The proposed approach trains neural networks to directly predict MIPS solutions instead of computing them repeatedly for each query, assuming queries are drawn from a known distribution over a fixed key database.

The authors frame MIPS as a support function of the key set, motivating two models: SupportNet, an input-convex neural network that regresses the support function and can act as a cluster router to guide queries to relevant database partitions, and KeyNet, a vector-valued network that regresses the optimal key and can be used as a drop-in replacement for the original query in off-the-shelf indexing pipelines.

In experiments on the BEIR benchmark using document embeddings, SupportNets and KeyNets improved IVF match rates while reducing compute as measured by FLOPs, the number of probes, or wall-clock time.

Sources
  1. 01Apple — Machine Learning ResearchAmortizing Maximum Inner Product Search with Learned Support Functions
Also on Research

Stories may contain errors. Dispatch is assembled with AI assistance and curated by human editors; despite the trust-score filter, mistakes happen. We correct publicly — every article links to its revision history. Nothing here is financial, legal, or medical advice. Verify before relying on any claim.

© 2026 Dispatch. No ads. No sponsorships. No paid placement. Reader-supported via Ko-fi.

Built by a person who cares about honest AI news.