Glossary

Vector database(Vector DB)

A database optimised for storing and searching numerical representations (embeddings) of text, images, or other data. Used to give AI systems memory and search over large document collections.

A vector database stores text (or images, or audio) as numerical arrays called embeddings and lets you search by similarity rather than by exact-keyword match.

In practice: you’d use a vector database to give an AI system a “memory” of your business documents, so when someone asks a question, the system retrieves the most-relevant chunks and feeds them to the model as context.

What it’s used for

The dominant use case in 2026 is RAG (retrieval-augmented generation). The pattern:

  1. Take your business documents (PDFs, Confluence pages, customer support tickets, product manuals).
  2. Split into chunks of ~500 words.
  3. For each chunk, generate an embedding (a 1024-dimensional numerical array) using an embedding model.
  4. Store chunks + embeddings in a vector database.
  5. When a user asks a question, embed the question, search the database for the most-similar chunks, feed those chunks to the AI as context.
  6. AI answers using the retrieved context.

Result: AI can answer questions about your specific business data without needing the entire data in its training set or context window.

DatabaseHosted?Cost (rough)Best for
PineconeHostedStarts at $0; $70+ USD/month at scaleProduction apps, easy setup
WeaviateSelf + hostedFree self-hosted; $25+ USD/month hostedOpen-source preference
QdrantSelf + hostedFree self-hostedPerformance-focused
Postgres + pgvectorSelf-hostedFree (just Postgres)Already-on-Postgres teams
ChromaSelf-hostedFreeLocal dev, small projects
TurbopufferHostedPay-per-useModern serverless

For most Australian SMB use cases, Postgres + pgvector is plenty. Already-running database + a free extension.

Do you need one?

Probably not, if:

  • You have under 100 documents (just paste them into Claude Projects / Custom GPTs)
  • Your context fits in Claude’s 1M-token window
  • You’re prototyping, not building production

You likely do, if:

  • You have thousands of documents to search over
  • You’re building an AI product for customers, not internal use
  • You need sub-second retrieval at scale

The simpler alternative

For most Australian SMBs, Claude Projects + uploading 20 files gives you 80% of what RAG provides at 5% of the complexity. You’re not running a startup; you don’t need Pinecone.

We’ve shipped exactly one client deployment with a real vector database in 2026. The other 47 clients are happy with Claude Projects + Custom GPTs.

See also

Related terms

Want this built for your business?

Book a free 30-minute AI audit. We'll map your business and show you exactly which systems we'd build first. No pitch deck, no scoping fee.

Book my free AI audit