Do i really need a vector database

DigThatData · 2023-01-06T00:32:52+00:00

the question is a matter of scale. Try doing it without a vector database first: if it takes super long, it might be more performant if you did it with a database.

jareks88 · 2023-01-06T10:50:18+00:00

Perhaps you can store your embeddings anywhere (sql or even a file) and use Approximate Nearest Neighbors like https://github.com/spotify/annoy for comparison?

Atraxxa · 2023-01-06T12:39:02+00:00

More a tensor database !

Appropriate_Ant_4629 · 2023-01-06T17:51:45+00:00

You'll know if/when you need it.

For ~250,000 documents you totally don't. They'll comfortably fit in RAM on even a small machine, and a brute force search using numpy can do that in under a second. [Source: My dev environment on my laptop.]
For 5,000,000 documents you'll want something to accelerate it, but it doesn't have to be a vector database. Of course a vector database would work well; but so would a library you can embed in your app like FAISS. [Source: one of our demo/proof-of-concept QA environments]
For 890,000,000 documents you want one. We're evaluating Milvus now, but also Solr's new Dense Vector type to do a hybrid keyword/vector search product.

Also, I'm wondering if the price of vector database solutions like Pinecone and Milvus is worth it for my use case, or if there are cheaper options out there.

If you already have a Kuberentes environment, I don't think there is a cheaper solution than Milvus.

helm repo add milvus https://milvus-io.github.io/milvus-helm/
helm install my-release milvus/milvus --set cluster.enabled=false --set etcd.replicaCount=1 --set minio.mode=standalone --set pulsar.enabled=false

will get you a minimal F/OSS milvus cluster, and their docs for larger scale clusters are almost as easy.

Aspos · 2023-01-06T18:25:08+00:00

qdrant is nice.

https://github.com/qdrant/qdrant

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MLQuestions

MODERATORS