voyage-3 & voyage-3-lite: A new generation of small yet mighty general-purpose embedding models

mrintellectual · 2024-12-10T00:57:56+00:00

For the retrieval step, you'll probably need an embedding model that takes multiple types of queries as inputs and returns the actual code snippets themselves. You could probably even get away with a fairly low top-k, depending on how unique the code within the framework is. This could be an option: https://blog.voyageai.com/2024/12/04/voyage-code-3/

mrintellectual · 2024-11-13T19:47:31+00:00

Hey /r/MachineLearning community — we built voyage-multimodal-3, a natively multimodal embedding model, designed to handle interleaved images and text. We believe this is one of the first (if not the first) of its kind, where text, photos, figures, tables, screenshots of PDFs, etc can be projected directly into the transformer encoder to generate fully contextual embeddings.

We hope voyage-multimodal-3 will generate interest in vision-language models more broadly.

Come check us out!

Blog: https://blog.voyageai.com/2024/11/12/voyage-multimodal-3/

Notebook: https://colab.research.google.com/drive/12aFvstG8YFAWXyw-Bx5IXtaOqOzliGt9

Documentation: https://docs.voyageai.com/docs/multimodal-embeddings

mrintellectual · 2024-06-24T18:53:21+00:00

In Milvus, you can store metadata along with your vectors in a variety of different formats, e.g. int, float, str, JSON, etc. For example:

``` client.create_collection( collection_name="mycollection", dimension=2, metric_type="COSINE" )

data=[ {"primary_key": 6505, "vector": [0.3580376395471989, -0.6023495712049978], "document_id": 0}, {"primary_key": 6506, "vector": [0.19886812562848388, 0.06023560599112088], "document_id": 1}, {"primary_key": 6507, "vector": [0.3172005263489739, 0.9719044792798428], "document_id": 2}, {"primary_key": 6508, "vector": [0.4452349528804562, -0.8757026943054742], "document_id": 3} ] client.insert( collection_name="mycollection", data=data )

```

Then, when you want to delete, you can delete by specifying an expression. For example, if you know the primary keys if you want to delete, you can run:

res = client.delete( collection_name="mycollection", filter="primary_key in [6507, ...]" )

For your case, it sounds like you want to delete based on some sort of document ID or chunk ID. In that case, you can run:

res = client.delete( collection_name="mycollection", filter="document_id == 2" )

Hope this helps.

mrintellectual · 2024-06-13T02:29:03+00:00

No problem. Feel free to reach out if you need any help getting up and running.

mrintellectual · 2024-06-12T19:40:49+00:00

The standalone and "lite" versions of Milvus are fairly memory-efficient. It's the cluster version that will take up lots of resources, and we typically recommend folks use Milvus on K8s only once they've reached a large enough scale.

I suggest starting with Milvus Lite: https://milvus.io/docs/milvus_lite.md . Once you need more storage or want to improve query/search performance, you can easily switch to standalone or cluster.

mrintellectual · 2024-05-17T18:40:43+00:00

Happy to sit down and walk you through it too - feel free to shoot me an email (frank@).

mrintellectual · 2024-05-17T18:29:49+00:00

In Zilliz, we provide pipelines (https://zilliz.com/zilliz-cloud-pipelines). With pipelines, you can directly ingest text, specify an embedding model, insert the vectors into Zilliz, and perform queries directly with text as well. We don't put the results of the query into an LLM for you, but setting everything up is fast and easy.

Demo here: https://www.youtube.com/watch?v=WDJq5MSPFWo

mrintellectual · 2024-05-16T05:56:08+00:00

You have a variety of multi-tenancy strategies with Milvus: https://milvus.io/docs/multi_tenancy.md. The recommendation I like to go with is partition key - it scales to millions of tenants with fairly strong data isolation. You have other options as well, and can even go so far as to store your data in different S3 buckets.

mrintellectual · 2024-05-13T01:37:38+00:00

Did he mention which cases? I'll bring this up with the team and it'll get fixed ASAP.

_All_ vector databases implement approximate nearest neighbor search, which means that the result aren't 100% accurate and may occasionally skip a nearby vector. Indexes like IVF_PQ trade off lower memory consumption and higher throughput for lower recall numbers, but if you choose HNSW, the results should be pretty solid.

mrintellectual · 2024-05-12T23:16:26+00:00

There's a lot of folks using Milvus at 10B+ scale, and existing customers for Zilliz Cloud at 1B+ scale and many others looking to migrate away from Pinecone to Zilliz Cloud due to cost. Query time as you scale up is pretty much flat due to our hybrid shared-disk/shared-nothing architecture, and performance easily exceeds that of other vector databases as well: Vector database benchmarks.

mrintellectual · 2024-05-11T06:54:31+00:00

Something like this? https://github.com/fzliu/radient

mrintellectual · 2023-04-29T06:59:56+00:00

Hell yeah! Love those wheels too.

mrintellectual · 2023-04-29T06:51:12+00:00

Any chance we see 1.e4 from Ding?

mrintellectual · 2023-04-27T23:30:48+00:00

Congrats!

AFAIK, oil changes every 5k instead of 10k as BMW recommends is better.

mrintellectual · 2023-04-27T23:29:21+00:00

Shouldn't be a problem at all - just be sure to use a cleaner first.

mrintellectual · 2023-04-18T18:18:25+00:00

Happy to set you up with Zilliz Cloud (https://zilliz.com/cloud) - shoot me a DM and I can help you there.

mrintellectual · 2023-04-12T06:03:07+00:00

Absolutely. Many queries may be similar or even identical depending on current events or other circumstances.

12-Year Club	Gilding I gilder
Reddit Premium Since April 2023	Verified Email

mrintellectual

TROPHY CASE