How to Set Up a Vector Database for Free

Stand up a free vector database in minutes: enable pgvector on Postgres or run ChromaDB locally, then store and query your first embeddings.

Written by

BSH Technologies

Published on2026-05-25

You can set up a vector database for free in minutes

Setting up a vector database for free is genuinely a few-minutes job: enable the pgvector extension on a PostgreSQL instance you already run, or start ChromaDB locally with a single import. Both store embeddings — the numerical vectors that capture meaning — and find the nearest ones to a query, which is the engine behind semantic search and RAG. Neither costs anything to begin with, and both scale far enough to prove real value before you spend anything at all.

The choice between them is mostly about durability and who else uses the data. Chroma is the fastest path for a local, single-developer prototype; pgvector is the better foundation for anything that needs backups, multiple users, or metadata filtering at scale. You can start with either and migrate later, because the concepts — vectors, dimensions, distance metrics, indexes — are identical across both.

Option A: pgvector on PostgreSQL

If you have Postgres — locally, in Docker, or on a free tier from Supabase or Neon — pgvector turns it into a vector store without a new service to operate, back up, or secure. That consolidation is a genuine advantage: you reuse the operational knowledge and tooling you already have rather than adopting a separate system just for vectors.

Enable the extension on your database in a single statement.
Create a table with a vector column whose dimension matches your embedding model — for example 384 for all-MiniLM-L6-v2, or 1536 for many hosted embedding APIs.
Insert each row with its vector plus ordinary columns for source, text, and any metadata you want to filter on.
Add an HNSW index once you pass a few thousand rows so similarity queries stay fast.

Option B: ChromaDB locally

ChromaDB is an open-source vector database designed to run with almost no setup, which makes it ideal when you just want to validate an idea quickly. Install it, create a collection, and add documents with their embeddings — Chroma can even call an embedding function for you, so you write very little glue code.

Chroma persists to disk when you point it at a directory, so your prototype survives a restart instead of vanishing into memory the moment the process ends.

Step: Get the embedding dimension right

The single most common setup error is a dimension mismatch, and it trips up nearly everyone the first time. Every embedding model outputs vectors of a fixed length, and your column or collection must match it exactly. Insert a 384-dimensional vector into a column expecting 1536 and the database rejects it — or, in some configurations, silently misbehaves in ways that are maddening to debug later.

Check your embedding model's output dimension before creating the table or collection.
Use the same model for stored documents and incoming queries; vectors from different models live in incompatible spaces and are not comparable.
If you switch embedding models later, every stored vector must be regenerated, so keep the source text to rebuild from.

Step: Run your first similarity query

With data loaded, query by embedding a piece of text and asking the store for its nearest neighbours. In pgvector you order by the distance operator that matches your metric — cosine is the usual default for text — and limit to the top few. In Chroma you call the collection's query method with the query text or vector, and it returns the closest documents with their distances.

Match the distance metric to your embedding model; cosine, dot product, and Euclidean are not interchangeable and the wrong one degrades results silently.
Return the stored text and metadata, not just the vector, so the results are actually useful to act on.
Treat the distance as a confidence signal — a very weak best match usually means no good answer exists.

Step: Know the free-tier limits

Free is perfect for learning, but plan ahead so the ceiling does not surprise you. Free Postgres tiers cap storage, connections, and compute; local Chroma is bounded by your machine's memory and has no built-in replication. Both are excellent for a prototype and neither is a production deployment on its own. When real load arrives you will want managed hosting, backups, and monitoring — the unglamorous infrastructure that keeps a service reliable once people depend on it.

Prefer it built and managed for you?

BSH Technologies sets up and operates vector infrastructure that scales past the free-tier ceiling — tuned indexes, correct metrics, metadata filtering, and a re-embedding plan for model upgrades. If semantic search is on your roadmap, talk to BSH Technologies or explore our AI & automation services.

Frequently asked questions

What is the easiest free vector database to set up?

ChromaDB for a local prototype — install it, create a collection, and add documents in a few lines, with no separate service to run. If you already use PostgreSQL, the pgvector extension is just as quick and gives you a durable, multi-user store. Both are free to start and use identical concepts.

Is pgvector free to use?

Yes. pgvector is an open-source PostgreSQL extension with no licence cost, and it runs on free Postgres tiers from providers like Supabase and Neon or in a local Docker container. You only pay when you move to a paid, managed Postgres instance for production scale and reliability later on.

What embedding dimension should my vector column be?

It must match your embedding model exactly. all-MiniLM-L6-v2 outputs 384 dimensions; many hosted embedding APIs output 1536. Check the model documentation before creating the table or collection. A mismatch causes inserts to fail or the store to behave incorrectly, so confirm the number first to avoid a frustrating debug.

Do I need an index on my vector database?

For small datasets an exact scan is fine, but past a few thousand vectors you want an approximate-nearest-neighbour index like HNSW so queries stay fast. In pgvector you create it explicitly; Chroma manages indexing internally. The index trades a sliver of recall for a large speed gain, which is almost always worth it.

Can a free vector database handle production traffic?

Not reliably. Free Postgres tiers cap storage, connections, and compute, and local ChromaDB is limited by your machine with no replication. They are excellent for prototypes and proving value. Production needs managed hosting, backups, and monitoring, which is where a free setup reaches its natural ceiling.

From the blog

View all posts

Applied AI

How to Build an AI Agent for Free in 2026

You can build a working AI agent for free in 2026 using n8n, open-source frameworks, and a free LLM tier. Here is the exact stack and the steps.

BSH Technologies · 2026-06-17