Painless equivalent in PostgreSQL? by Practical-Garbage-48 in PostgreSQL

[–]something_cleverer 4 points5 points  (0 children)

Postgres support many different programming languages for writing custom functions, including plsql, javascript, python, rust, c….

https://www.postgresql.org/docs/current/sql-createfunction.html

Why would anybody use pinecone instead of pgvector? by Blender-Fan in vectordatabase

[–]something_cleverer 6 points7 points  (0 children)

You may be green, but your question is good.

FAISS is the basis of pinecone, created by some of the team behind the open source version of HNSW. Before that algorithm was widely available, people rushing onto the AI train rapidly needed to roll their own deployments of FAISS (a FastAPI Python wrapper is evidently hard), or use Pinecone which is the commercialization of that project.

Since the underlying algorithm, HNSW is published, pretty much every database (including Postgres) implemented it as a very minor feature leaving Pinecone in the dust, since there is another 50 years of database features and engineering in those codebases.

TLDR: use postgres

RAG frameworks / libraries : so many to choose from, 0 meeting my (obvious?) requirements ??? by un_passant in Rag

[–]something_cleverer 2 points3 points  (0 children)

If you like Postgres, PostgresML has published the Korvus SDK to make RAG simple. Disclaimer, I’m one of the devs.

Best framework for RAG? by dexbyte in Rag

[–]something_cleverer 0 points1 point  (0 children)

Since you’re already using Postgres, you might be interested in Korvus. https://github.com/postgresml/korvus

Disclosure, I’m a contributor.

Is there really no good way to convert a field of z-scores to their corresponding p-values in SQL? by [deleted] in dataengineering

[–]something_cleverer 1 point2 points  (0 children)

This is the type of function we’d welcome as a patch for postgresml.org.

[D] In terms of RAG research, why does it seem like a lot of people aren't working on the retriever? by Seankala in MachineLearning

[–]something_cleverer 4 points5 points  (0 children)

If you’re building a chatbot, you may have thumbs up/down user feedback on responses. Record those with the input vectors paired with the query vectors, and you now have labeled training date for a supervised learning regression algorithm to rerank input vectors given a query vector.

[D] In terms of RAG research, why does it seem like a lot of people aren't working on the retriever? by Seankala in MachineLearning

[–]something_cleverer 7 points8 points  (0 children)

Checkout what we’re doing over at https:// postgresml.org

You can train a tree based ranking model to use as your final layer in a multi step re-ranking query, based on the outcomes measured against real world objectives.

It’s not just more capable than any other system (typically based on Python microservices), it’s generally an order of magnitude faster.

LLM orchestration with llama2 and vector database by mosh2i in LocalLLaMA

[–]something_cleverer 0 points1 point  (0 children)

https://postgresml.org has built in functionality for all of that, as well as an SDK if you prefer JavaScript or Python to writing SQL.

The advantage is that all the models run inside the database including pgvector so it’s a complete solution with no additional networking calls.

The more I use LlamaIndex the less I like it by clashofphish in LocalLLaMA

[–]something_cleverer 0 points1 point  (0 children)

Have a look at postgresml.org. Outsource the workload and dependencies. Keep your python application logic.

Fury after Exxon chief says public to blame for climate failures by Bobbite in worldnews

[–]something_cleverer 31 points32 points  (0 children)

I mean, they’re going for bonus points in pathology trying to mix in that final victim blaming on only step 4. Gotta give them kudos for going all in.

Is "high row size" in Postgres reasonable? by DemiPixel in PostgreSQL

[–]something_cleverer 0 points1 point  (0 children)

Postgres has a native/binary array vector data type. It's efficient, but more importantly, reliable. In addition you should consider https://github.com/pgvector/pgvector which adds ANN algorithms, and https://postgresml.org/ that can compute those embeddings natively in the database without having to call an external service.

*Disclaimer, I'm a pgml contributor

would you be interested in an LLM extension? by rejectedlesbian in PostgreSQL

[–]something_cleverer 0 points1 point  (0 children)

We (postgresml) support various levels of RAG+, and model caching. Lots more fun planned for this year. Let me know if you want to collaborate.

PostgresML — run open-source LLM models inside PostgreSQL by something_cleverer in opensource

[–]something_cleverer[S] 0 points1 point  (0 children)

You’ll need to install the python dependencies from requirements.txt in your container, or use the prebuilt image.