Vector databases, embedding stores, retrieval-augmented generation (RAG) infrastructure, and AI-native search. The retrieval layer behind LLM apps.







Enterprise search & knowledge discovery solution for modern teams

Redis is an in-memory data store used as a vector database, semantic cache and memory layer for AI and agent applications.

Neo4j is a graph database that powers knowledge graphs and GraphRAG so AI apps can ground answers in connected, verifiable relationships.

Serverless vector and full-text search built on object storage — powers Cursor, Notion AI, Linear, Superhuman. 95% cost reduction vs traditional vector DBs.

Open-source vector database for storing data objects and vector embeddings

Open-source vector database and search engine.

Pinecone: Transforming Vector Search for Enhanced Data Retrieval

MongoDB Atlas Vector Search adds semantic vector search to your database for RAG and AI agents.

AI-native multimodal lakehouse and serverless vector DB — embedded retrieval for production-scale generative AI, open source, YC-backed.

High-performance, cloud-native open-source vector database for scalable ANN search — built by Zilliz, Apache 2.0, the most-deployed OSS vector DB in production.

Scalable distributed vector search engine

Multimodal vector search engine for unstructured data

AI for enterprise search and document processing.

AI-native open-source embedding database for efficient data handling.

Zep is a memory platform for AI agents that builds temporal knowledge graphs from chat and business data so agents recall context that changes over time.

Reducto is an agentic document-parsing platform that turns messy PDFs and scans into accurate, LLM-ready data for RAG pipelines and extraction.

Open-source vector search engine for large-scale AI applications
A vector database stores embeddings, the numerical representations of text, images, or audio, and finds the most similar ones to a query. This semantic search powers retrieval-augmented generation, recommendations, and deduplication. Pinecone, Weaviate, and Qdrant are built for it, while Redis and MongoDB add vector search to existing stores.
RAG is a pattern where a model retrieves relevant documents before answering, instead of relying only on its training. The application embeds a query, searches a vector database for matching context, and feeds that context to the model. This grounds answers in your data and reduces hallucination.
Pick a managed service like Pinecone for speed to production with no operations, and open-source options like Weaviate or Qdrant when you want control or self-hosting. If you already run Redis or MongoDB, their built-in vector search avoids adding a system. Decide by scale, latency targets, and how much you want to operate yourself.
Not always. For small document sets, in-memory search or a library can be enough, but a vector database becomes important as data grows, latency matters, or you need filtering and updates at scale. Redis suits in-memory use, while Pinecone, Weaviate, and Qdrant handle larger production workloads.
A regular database finds exact matches on structured fields, while a vector database finds nearest neighbors by meaning using embeddings. One answers where status equals open, the other answers what is most similar to this. MongoDB and Redis now offer both, but dedicated vector databases optimize specifically for similarity search.
Enterprise RAG search connects a model to a company's internal documents and apps so employees get grounded, permission-aware answers. Glean indexes content across tools and applies access controls, so results respect who can see what. It applies the RAG pattern to organizational knowledge rather than a single dataset.
Receive weekly updates so you can stay up-to-date with the world of AI
Receive weekly updates so you can stay up-to-date with the world of AI