Vector search is now boring, but PostgreSQL has ways to go

Presenters

Jonathan Katz

Source

PGConf.dev 2025

Level Up Your Vector Search in PostgreSQL: What’s New with PGVector 0.8 and Beyond 🚀

Are you diving into the exciting world of vector search? 🌐 PostgreSQL, with its powerful pgvector extension, has become a go-to choice for many. But as workloads grow, we’re hitting some walls. Let’s explore the latest advancements in pgvector, the challenges we face, and what the future holds.

What’s Vector Search, and Why Does it Matter? 🎯

Imagine searching for images that resemble a given image, or finding documents with similar themes. That’s the power of vector search! Instead of exact matches, you’re finding data based on similarity. The pgvector extension brings this capability to PostgreSQL, leveraging machine learning models (embedding models) to convert data into vectors. 💡

The Challenge: Selectivity and Performance 🛠️

Early versions of pgvector were great, but as datasets grew, performance bottlenecks emerged. The query planner sometimes made poor decisions about using the HNSW index (a specialized index for vectors), leading to:

Slow Queries: Scanning the entire index when it wasn’t necessary.
Inefficient Resource Use: Wasting valuable processing power.
Overfiltering: Narrowing down results too much, missing potentially relevant data.

PGVector 0.8: Smarter Decisions and Faster Scans 🦾

The good news? PGVector 0.8 introduced key improvements:

Improved Cost Estimation: The query planner now better understands the selectivity of your WHERE clauses, making smarter decisions about when to use the HNSW index.
Iterative Scanning: A new scanning strategy helps avoid full index scans, significantly boosting performance.
New Parameters: More control over query planner behavior for low-selectivity scenarios.

TL;DR: PGVector 0.8 makes the query planner smarter, leading to faster and more efficient vector searches.

Beyond 0.8: Addressing the Big Challenges 💾

But the journey doesn’t stop there! Let’s peek under the hood and explore the current limitations and what’s being done to tackle them:

IO Bound Operations: Disk access is the biggest bottleneck. Algorithms like IVF Flat Scan are heavily reliant on disk, and even HNSW graph traversal can be slow.
Toast Storage Limitations: How PostgreSQL handles large data (toast storage) is a constraint.
Parallelization Hurdles: Making indexing fully parallelized remains a challenge.
The Native Vector Type Debate: Should vectors be integrated directly into PostgreSQL, or remain as an extension?

What’s on the Horizon?

Streaming IO: Implementing PostgreSQL’s new streaming IO capabilities could dramatically improve IVF Flat Scan performance.
Async IO & Direct IO: Leveraging PostgreSQL 18’s async and direct IO features offers potential gains.
Resizable Buffer Pools: Allowing resizable buffer pools could better manage memory footprint.
Bringing Vectors Home: Integrating vectors directly into PostgreSQL could unlock new optimizations.

What This Means for You 👨‍💻

Data Scientists/ML Engineers: Focus on crafting high-quality embedding models. Experiment with different models and fine-tune them for your use case. A bad model can negate all the performance gains from optimized querying!
Database Administrators: Monitor query performance and tweak parameters. Understanding the trade-offs between recall (getting all relevant results) and query speed is crucial.
Application Developers: Design queries that are selective whenever possible. Consider the impact of your WHERE clauses on performance.

Further Resources:

pgvector Documentation: https://pgvector.net/
pgvector GitHub Repository: https://github.com/pgvector/pgvector

The future of vector search in PostgreSQL is bright! By understanding the challenges and embracing innovation, we can unlock even greater performance and capabilities. ✨

Level Up Your Vector Search in PostgreSQL: What’s New with PGVector 0.8 and Beyond 🚀#

What’s Vector Search, and Why Does it Matter? 🎯#

The Challenge: Selectivity and Performance 🛠️#

PGVector 0.8: Smarter Decisions and Faster Scans 🦾#

Beyond 0.8: Addressing the Big Challenges 💾#

What This Means for You 👨‍💻#

Appendix#