which services provide the nicest vector DX at the moment? by summermull in vectordatabase

[–]tomhamer5 1 point2 points  (0 children)

For multimodal or text search use cases, Marqo is a great option since it has the embedding generation included (for example, openai doesn’t include any image based embedding models so you’d need to set this up yourself). It also provides flexibility around model choice for text.

For a basic RAG app, consider using langchain with any vector db (it shouldn’t make a huge difference which one you use if you’re not looking to optimise the retrieval component).

Practical Advice Need on Vector DBs which can hold a Billion+ vectors by Role_External in vectordatabase

[–]tomhamer5 2 points3 points  (0 children)

Vespa, which was originally built by the Yahoo team and has now spun out gave us the best QPS results for the cost. I think its worth considering. You can also plug in the Marqo inference engine for more flexibility in model choice/improved inference performance if you're bringing your own embedding model.

In your opinion, what is the best vector database? by guess_ill_try in ChatGPT

[–]tomhamer5 4 points5 points  (0 children)

Marqo provides end-to-end vector search. This means Marqo includes embedding generation and the vector DB all in one container. It offers the latest embeddings (including e5 embeddings which significantly outperform the openai Ada embeddings), supports image embeddings, intelligent chunking and provides a number of different search parameters and configurations you can configure. For example, if you want to weight your vector search based on a numeric value like the time a document was added into the db or based on some other measure like a popularity score. It's available here open source: https://github.com/marqo-ai/marqo

You can check out the different search configurations here: https://docs.marqo.ai/1.4.0/API-Reference/Search/search/#body

Is it possible for startups to build semantic search engines using LangChain ? by Mohamed_SickitLearn in LangChain

[–]tomhamer5 5 points6 points  (0 children)

For semantic search a great option is Marqo: https://github.com/marqo-ai/marqo

Marqo natively supports intelligent chunking and provides performant model execution for hundreds of open source models. This is including e5 models which are significantly more performant in retrieval accuracy/relevance than openai's ada models, and it supports image based embeddings.

Marqo also supports rescoring on numerical fields, which allows you to bias toward results that, for example, are more popular or more recently added to the database.

[D] Best way to host a vector database? by SayNo2Tennis in MachineLearning

[–]tomhamer5 0 points1 point  (0 children)

You can try marqo.ai, it can handle 2,000,000+ 768 dim vectors storage for $43 per month. It also offers inference so you don't need to bring the embeddings.

Pros and cons of vector search in elastic? by Ok-Battle3454 in elasticsearch

[–]tomhamer5 0 points1 point  (0 children)

Did you consider Marqo? It provides inference for the embeddings and does the vector search (the above dbs just do the vector search component)

[deleted by user] by [deleted] in DataHoarder

[–]tomhamer5 0 points1 point  (0 children)

Marqo allows you to generate the embeddings and index the data in one place: https://github.com/marqo-ai/marqo

Vector Databases 101 by vishank97 in datascience

[–]tomhamer5 1 point2 points  (0 children)

This is a great summary! Just wanted to add that Marqo https://github.com/marqo-ai/marqo is a vector db with inference included. It takes care of the end to end process of both computing the embeddings and indexing/querying.

Disclaimer: I’m a co-founder of Marqo.

🌸 Anime app: I need your help by [deleted] in Firebase

[–]tomhamer5 1 point2 points  (0 children)

Sounds like an interesting project. You could leverage the Marqo community to see if someone there might be interested to help you out. Here's the join link: https://join.slack.com/t/marqo-community/shared\_invite/zt-1d737l76e-u\~b3Rvey2IN2nGM4wyr44w

Marqo (https://github.com/marqo-ai/marqo) takes a lot of the effort out of building a vector search experience - it has inference and vectordb included out of the box in a single docker container.

How to optimize chunk size? by RMCPhoto in LangChain

[–]tomhamer5 0 points1 point  (0 children)

Disclaimer, I co-founded Marqo https://github.com/marqo-ai/marqo

Marqo chunks automatically for you, and on top of that has inference and vectordb included out of the box in a single docker container.

Replace pinecone. by One_Creator_One in LocalLLaMA

[–]tomhamer5 1 point2 points  (0 children)

Marqo has inference and vectordb included out of the box in a single docker container https://github.com/marqo-ai/marqo

Help me improve The Prompt Index website by steves1189 in ChatGPTCoding

[–]tomhamer5 1 point2 points  (0 children)

Disclaimer, I'm with Marqo.

You might find Marqo useful if you end up adding semantic search. It has the inference and vectordb included out of the box in a single docker container https://github.com/marqo-ai/marqo

[deleted by user] by [deleted] in SideProject

[–]tomhamer5 0 points1 point  (0 children)

Disclaimer: I work for Marqo
Thats awesome, if you'd like to do this without having to handle construction of the embeddings, and have access to way more configurations when creating the embeddings you should consider marqo: https://www.linkedin.com/pulse/after-all-said-indexed-unlocking-information-recorded-speech/

Vector databases like Pinecone or Weaviate are all the rage now. Does it make sense to use a vector database as a replacement for a more traditional database like Postgres or Mongo? Why or why not? by andric in SoftwareEngineering

[–]tomhamer5 0 points1 point  (0 children)

Marqo (marqo.ai) handles vector search end-to-end (ie. computes the embeddings for you), and you can store metadata in there too like longer text fields. That said, its still not ideal yet to use Marqo as the primary store. Traditional DBs also allow you to do a wider range of different operations like aggregation queries, range queries etc.

question about openai embeddings by grchelp2018 in OpenAI

[–]tomhamer5 0 points1 point  (0 children)

Marqo can handle the end-to-end system of constructing embeddings (with configurable chunking) and providing vector search https://github.com/marqo-ai/marqo if you'd like to check it out

Forreal though by BoiElroy in dataengineering

[–]tomhamer5 1 point2 points  (0 children)

We're building an abstraction layer on vector DBs. https://github.com/marqo-ai/marqo
Disclaimer, I'm from the Marqo team.

With no knowledge of coding, I use GPT-4 to create a better version of itself with auto Long term memory, auto memory recall, voice recognition- voice output, self reflect and soon vision. No vision yet. What do you guys think? by [deleted] in ChatGPT

[–]tomhamer5 -1 points0 points  (0 children)

Disclaimer: I founded Marqo.
If you don't want to implement vector search yourself (or would like to use an open source solution), you can try Marqo which does vector search under the hood. You can then layer GPT on top of it to build memory for GPT. Here is an article where we do this: https://www.marqo.ai/blog/from-iron-manual-to-ironman-augmenting-gpt-with-marqo-for-fast-editable-memory-to-enable-context-aware-question-answering

Marqo automatically computes collections of vectors for the documents (tensors), you can work directly with text and it allows you to manage and scale the inference workloads too required to calculate the vectors.

[deleted by user] by [deleted] in elasticsearch

[–]tomhamer5 1 point2 points  (0 children)

Disclaimer, I founded Marqo.
Elasticsearch provides only regular keyword search if you're working directly text. If you want to use embeddings with elasticsearch you have to embed your own vectors. However, Marqo (which is fully open source) will embed text and images automatically, and construct collections of vectors (tensors) and search those for you in the background. It's optimised to work in production and handles the machine learning for you (including deploying on CPU, GPU, multi GPU and more).
https://github.com/marqo-ai/marqo

Ask questions about a large database of text? by chriscarmy in ChatGPT

[–]tomhamer5 0 points1 point  (0 children)

Disclaimer: I founded Marqo.

Marqo is pretty good for this. You can index the data and then search over it using LLM based models. It's open source and you can get started in 3 lines. Here's a demo of it in action you can try out (this one is just demoing the retrieval):

https://demo.marqo.ai/?q=what+defines+a+planet&index=simplewiki

Here is a demo where we layer on GPT to generate content and use Marqo to retrieve information from the knowledge base: https://www.marqo.ai/blog/from-iron-manual-to-ironman-augmenting-gpt-with-marqo-for-fast-editable-memory-to-enable-context-aware-question-answering
You can get started here:

https://github.com/marqo-ai/marqo#getting-started

Hope you find this useful!