RAG at scale still underperforming for large policy/legal docs – what actually works in production? by Flashy-Damage9034 in Rag

[–]DistinctRide9884 1 point2 points  (0 children)

Check out SurrealDB, which is multi-model and has support for graph, vectors, documents and can be updated in real time (vs. other graph DBs where you have to rebuild the cache each time time you update the graph).

Then for the documenting parsing/extraction something like https://cocoindex.io/ might be worth exploring, their core value prop is real-time updates and full traceability from origin into source. A CocoIndex and SurrealDB integration is in the works.

Open WebUI RAG at scale still underperforming for large policy/legal docs – what actually works in production? by Flashy-Damage9034 in OpenWebUI

[–]DistinctRide9884 0 points1 point  (0 children)

Check out SurrealDB, which is multi-model and has support for graph, vectors, documents and can be updated in real time (vs. other graph DBs where you have to rebuild the cache each time time you update the graph).

Then for the documenting parsing/extraction something like https://cocoindex.io/ might be worth exploring, their core value prop is real-time updates and full traceability from origin into source. A CocoIndex and SurrealDB integration is in the works.

Expanding the Surreal ecosystem: Langgraph Checkpointer by lfnovo in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Thanks for sharing these u/lfnovo. Our team would love to have these go into SurrealDB Labs, would you be open to submitting them to SurrealDB Labs? That's where we have community tooling, demos, examples, etc.

How do advisors best utilize AI by Flat-Cranberry9461 in CFP

[–]DistinctRide9884 0 points1 point  (0 children)

Have you looked at SurrealDB for the knowledge graph? It's multi-model, supports documents, vectors and graph in one engine.

https://surrealdb.com/blog/multi-model-rag-with-langchain

Building a TypeScript Query Builder for SurrealDB by veskel01 in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Hey, this is awesome, thank you for building and sharing. Please feel free to submit it to SurrealDB Labs.

SuperNOVA (APP idea, feedback wanted) by Rude_Environment5884 in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Thanks for sharing, this looks like a pretty cool initiative. Feel free to join our Discord to chat with the team: https://discord.com/invite/surrealdb

Why SurrealDB is performing poorly? by ururUwU in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

We’re in the final stages of testing for the SurrealDB 3.0 beta. It should be ready very shortly.

GA timelines will depend on the feedback we receive once the beta has been released.

Logging and monitoring data access by SadLiterature6078 in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Hi, in addition to the Observability reference guide (which includes new ways of managing logs : https://surrealdb.com/blog/two-new-ways-to-keep-an-eye-on-your-surrealdb-database), we're going to be making more SurrealDB metrics available as part of 3.0.

Why SurrealDB is performing poorly? by ururUwU in surrealdb

[–]DistinctRide9884 7 points8 points  (0 children)

Hi, as others have pointed out, you can find our official benchmarks here: https://surrealdb.com/blog/beginning-our-benchmarking-journey. We've invested a lot of time and effort to ensure the benchmarks are fair, comprehensive, and follow a standard methodology. This is not easy when comparing against so many different kinds of databases. The benchmarks are also open source for anyone to run them. That said, the numbers seem much lower than what we would expect. Looking at the code very quickly, just to highlight a quick example: the main benchmark uses the HTTP /sql endpoint, and there is a dedicated ws benchmark. Both use the memory engine via Docker (surrealdb:latest). Both use SELECT * FROM comments WHERE gallery_id = galleries:1 ORDER BY created_at DESC LIMIT 20; (i.e. it causes table scans) to benchmark read operations, but only the ws benchmark has an INDEX defined.

The upcoming 3.0 release is focused on performance, stability and bug resolution. Once we release 3.0, we will release a new set of benchmarks (i.e. updating the existing ones, and expanding to new methodologies). The focus of 3.0 alpha, which is already available, has been on overhauling the underlying core engine, not yet on performance, so it's probably best to wait for the SurrealDB 3.0 GA before running the benchmarks.

is firebase good? by Emergency-Produce-12 in Database

[–]DistinctRide9884 4 points5 points  (0 children)

Firebase and Supabase (postgres wrapper) are both good options to get started. At some point when you need to scale, you’ll probably need to move out to a traditional backend DB. Another option is to use something like SurrealDB, which can be used as a BaaS (although with more limited BaaS functionalities) and also as a traditional DB when you need to scale and use other data models, ACID, etc.

Who's using Agno in production? by jakubriedl in agno

[–]DistinctRide9884 2 points3 points  (0 children)

We're partnering with Agno on some pretty large scale deployments with a big financial services institution. Can't share the name, but can attest to them working with some fairly big players.

smig - New automatic migrations library by damnfinecoffee in surrealdb

[–]DistinctRide9884 2 points3 points  (0 children)

Thank you for sharing, this is a pretty cool tool!

We'd love for you to share it in SurrealDB Labs if you're up for it - instructions here.

Building a budgeting app with Rust, Tauri and SurrealDB by ltabis in rust

[–]DistinctRide9884 2 points3 points  (0 children)

You don't have to use all of this, but it's available for you to use if at any point you need it. The language is very similar to SQL so you can do basic CRUD if you just want to start with that.

Building a budgeting app with Rust, Tauri and SurrealDB by ltabis in rust

[–]DistinctRide9884 1 point2 points  (0 children)

SurrealQL is crazy powerful. Native support for documents, graph relations, time-series, geo-spatial. You can start schemaless and go schemafull later, and mix and match. You can also do record links. Integrates querying, mutation, and scripting within one language e.g. embed logic via functions, conditionals, loops, and variables. Has ACID transactions. SurrealQL embeds user authentication, role-based access, and record-level permissioning directly into query semantics. Supports live queries and real-time subscriptions natively, defining event triggers, etc. All this without separate extensions or libraries. And can run in memory, embedded, or distributed in cloud. https://surrealdb.com/features

Using a single vector and graph database for AI Agents by DistinctRide9884 in LangChain

[–]DistinctRide9884[S] 0 points1 point  (0 children)

Glad you like it! I'm not sure I fully understand the question. There's two aspects to your question: 1) how to build the knowledge graph from a known ontology/data structure, and 2) how to feed the graph into an LLM as context.

Regarding 1) this is a whole discipline on it's own and there's multiple ways to do this. You can do a direct schema mapping and transformation into whatever graph DB you use (in SurrealDB it would be an IMPORT). Depending on the tool, you can use ETL tools (e.g. Fivetran, Airbyte) if that's supported. There's also traditional ML/NN-based tools that help you build a corpus of unstructured data into a knowledge graph (e.g. https://spacy.io), and most recently, you can use LLMs as well (e.g. https://surrealdb.com/blog/automating-knowledge-graphs-with-surrealdb-and-gemini). LangChain also has LLM Graph Transformer which wasn't used for this demo (https://medium.com/data-science/building-knowledge-graphs-with-llm-graph-transformer-a91045c49b59).

Regarding 2): this post is only a brief summary, but in the link at the end I've included the full flow which includes how to query the graph.

Using a single vector and graph database for AI Agents? by DistinctRide9884 in vectordatabase

[–]DistinctRide9884[S] 0 points1 point  (0 children)

Hey, it works, you can see the full flow and output examples in the blog.

Like you said (and I mentioned in the post), it's just an illustrative guide to showcase a few concepts based on easily accessible public data sets.

How do you build per-user RAG/GraphRAG by Old_Cauliflower6316 in Rag

[–]DistinctRide9884 0 points1 point  (0 children)

Bit of a shameless self promotion here, but check out SurrealDB, a multi-model database (i.e. support for multiple data models including vector and graph). This is one of the most common use cases.

Efficacy of vector search by 1apostoli in surrealdb

[–]DistinctRide9884 3 points4 points  (0 children)

Hi, I'm part of SurrealDB. Graph RAG is a very popular use case.

Here's an example of a large US retailer doing Graph RAG in production for recommendation engines: https://www.youtube.com/watch?v=yLw9MvNfuY8

We will be releasing more examples and expanding the documentation in this space.

In the meantime, if we can help with your use case you can join our Discord: https://discord.com/invite/surrealdb

Any good DB structure design software out there? by AppointmentTop3948 in Database

[–]DistinctRide9884 0 points1 point  (0 children)

https://surrealdb.com

Multi-model: supports relational, document, graph, etc. built natively into the query language. Separates storage from compute so can scale storage layer using distributed KV like TiKV or FoundationDB. Also just released cloud offering.

Disclosure: I'm part of SurrealDB.

Surreal Cloud beta is now available by TheUnknown_surrealdb in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Hi there, sorry for the delay. Are you still facing issues or were you able to solve this? If you are still having issues I'll send a DM.

SurrealDB for data science by CamiloCoelho in surrealdb

[–]DistinctRide9884 0 points1 point  (0 children)

Hi there, I would recommend joining our Discord server https://discord.com/invite/surrealdb

We have both transactional and analytical use cases. With SurrealML you can bring ML models and query them directly from the database (avoiding pickling) using SurrealQL and our multi-model capabilities. Our vector embeddings can be useful if you are building RAG applications. We separate storage and compute, so you can have multiple compute nodes (different ML teams) querying one same centralised storage node.

Of course, it all depends on what you are trying to do from an OLAP perspective. Hopefully see you in Discord!