The Main Vector Database Options
pgvector (PostgreSQL extension)
If you already run PostgreSQL, pgvector adds vector search capability without adding a new infrastructure dependency. Queries can join vector search results with relational data in a single query — a powerful capability for applications where metadata filtering is important. The trade-off: approximate nearest neighbour (ANN) performance at very large scale (hundreds of millions of vectors) lags behind dedicated vector databases. The right choice for most teams that are already PostgreSQL-native and at moderate scale.
Qdrant
Open-source, high-performance, written in Rust. Strong performance on ANN benchmarks, rich filtering capabilities (filter by metadata conditions before or during vector search), and active development. Self-hostable or managed cloud. A strong choice for teams that want production performance, open-source freedom, and the option to self-host.
Pinecone
The most widely adopted managed vector database. Fully managed, serverless option available, strong tooling ecosystem, extensive documentation and examples. The trade-off is cost at high scale and the managed-only model (no self-hosting). A pragmatic choice for teams that want to minimise operational overhead and are comfortable with managed service economics.
Weaviate
Open-source with managed cloud option. Distinctive for its hybrid search (vector + BM25 keyword) built in at the database level, multi-tenancy support, and GraphQL API. Strong for applications requiring hybrid retrieval without building the hybrid layer manually.
Chroma
Lightweight, easy to set up, popular for prototyping and development. Not designed for production scale. Best used as a local development environment for RAG systems, with a migration to Qdrant, Pinecone, or pgvector for production deployment.