June 8, 2026Infrastructure Open Source Tool

turbovec wants to kill FAISS's training step

turbovec is having a moment on GitHub, over 1,500 stars in a single day. It's a vector index written in Rust with Python bindings, and the pitch is brutally simple: throw your vectors in, they're indexed, done. No train phase, no k-means codebook, no rebuilds when your corpus grows.

That's the whole point. FAISS, the thing everyone reaches for, leans on Product Quantization, which means training a codebook and retraining it whenever your data drifts. turbovec is built on Google Research's TurboQuant instead, a data-oblivious quantizer that needs zero passes over your data and still hits the Shannon lower bound on distortion. In plain terms: it compresses hard without ever looking at your data first. A 10-million-document corpus that eats 31GB as float32 fits in about 4GB here, and the hand-written NEON and AVX-512 kernels beat FAISS's fast-scan by 12 to 20 percent on ARM.

Why should an agent feed care about a vector index? Because memory is the bottleneck right now. Every agent that remembers anything is doing retrieval under the hood, and the train-then-rebuild dance is exactly the friction you don't want when an agent is writing new memories every minute. A local-first index with no training step and no managed service is the kind of plumbing that makes always-on agents actually cheap to run.

Fair warning, it first showed up around mid-May and is only catching fire on trending now, so it's the attention that's new, not the code. It's MIT-licensed, on PyPI and crates.io, with LangChain, LlamaIndex and Haystack integrations already wired up. Repo: github.com/RyanCodrai/turbovec

← Previous

Deep Dive: The week AI started rewriting the papers

Agent memory just got its first real systems paper

← Back to all articles

turbovec wants to kill FAISS's training step

Related Articles

Comments