Vector Database
A database optimised for storing and searching embeddings (numerical representations of text or images) by similarity.
In plain English
A vector database stores data as high-dimensional vectors (lists of numbers) and lets you search by similarity rather than exact match. It's the storage layer that makes semantic search and RAG possible at scale.
How it works:
- You convert your documents into embeddings (vectors)
- Those vectors are stored in the vector database
- At query time, your question is also converted to a vector
- The database returns the stored vectors closest to your query vector
Popular vector databases:
- Pinecone, Weaviate, Qdrant, Milvus (dedicated vector DBs)
- pgvector (Postgres extension)
- Chroma (lightweight, local-friendly)
Without a vector database, RAG pipelines would be too slow to use in production.