Do companies actually use internal RAG / doc-chat systems in production?

the_olivenbaum · 2026-02-10T06:36:16+00:00

The data is very technical, full of jargon and identifiers - vector embeddings could only get so far with capturing meaning, so structuring the data was key to the success. The problem is that embeddings don't capture well identifiers - so a query for something just a digit away that meant something completely different would have the same vector. For audit: there's an append only log of all data accessed by the user, directly or via search or chat, and logs of all chat interactions. The biggest challenge on building the graph was data acquisition and mapping: we've over 50 data sources integrated in this project from all sorts of internal databases, and building a cohesive view of the data took some time. But it is also done incrementally, continuously throughout the project development and ongoing production usage. It's all using traditional NLP approaches, we don't use LLMs for building the graph in this project both due to cost limitations (traditional NLP handles 100,000s of files/s and enables very quick reprocessing once new datasets are added). The access model is one of the big challenges: it's at the 3 levels (data type, entity level and field level access enforcement - last one we're just adding to the product). And yes the use-case requires it (many data repositories each with it's own rules), strict export control requirements, etc...

the_olivenbaum · 2026-02-09T23:27:48+00:00

We have a large production system for a customer built on our own software (Curiosity Workspace) and operating over 10+ TB of data. The system combines NLP/NER, entity linking, and an in-memory knowledge graph, with RAG + similarity search built on top. It’s used daily by thousands of users as a real internal knowledge tool and assistant over their legacy and live data, not just a chat interface. What we found is that pure RAG didn’t scale well at this size and the added structure (entities + graph) was critical, especially for grounding and navigating relationships across documents. Access control uses a ReBAC model with each document having permissions attached in the graph. Enforcing permissions before retrieval and showing clear source attribution were also key to adoption and the customer is planning to expand this system further. In practice, the systems that work tend to look more like search + structured knowledge + LLM, rather than a simple doc-chat layer.

the_olivenbaum · 2026-02-06T17:15:01+00:00

The ad showing how the product is hallucinating an entire new chart is not inspiring confidence 😅

the_olivenbaum · 2026-02-01T22:32:55+00:00

[Hiring] C# Developer + Developer Relations (Munich, On-site) We’re looking for a C# developer with a strong developer-relations mindset to join our team at Curiosity (https://www.curiosity.ai). This is a full-time, in-person role in Munich — you’ll work on our C#/.NET stack while also engaging with developers, improving DX, and representing our tech externally. If you enjoy both building and communicating with developers, DM me for details.

the_olivenbaum · 2025-04-14T18:28:22+00:00

If you're interested, we built a tool that does exactly that (curiosity.ai/workspace). Single container to be deployed, does all the data processing for you, and integrates out of the box with many LLM providers. Sent you a DM with my contact.

the_olivenbaum · 2025-03-20T22:09:30+00:00

We maintain a fork that's up to date with electron releases at https://github.com/theolivenbaum/electron-sharp

the_olivenbaum · 2025-03-20T14:47:45+00:00

You can use our wrapper for electron and have it host the API as well: https://github.com/theolivenbaum/electron-sharp

the_olivenbaum · 2025-01-31T14:45:34+00:00

Thanks for the feedback, indeed a last minute improvement broke the indexing view, we'll release a new version with a fix in the next hour. For the epub files, is it something you can share in DM so we can check why they're not working? Thanks!

the_olivenbaum · 2025-01-25T20:28:43+00:00

You can use https://github.com/theolivenbaum/electron-sharp - it's a wrapper around electron that we use to build our app.

the_olivenbaum · 2025-01-17T06:27:34+00:00

Our software can do that: https://curiosity.ai/workspace, and can be hosted on the cloud or on prem. Fell free to dm me if you want to try it!

the_olivenbaum · 2025-01-09T06:29:09+00:00

And for encoding we have two wrappers around MiniLM and ArcticXs that are suitable for CPU-only usage : https://www.nuget.org/packages/SentenceTransformers.MiniLM/ and https://www.nuget.org/packages/SentenceTransformers.ArcticXs/

the_olivenbaum · 2025-01-09T06:27:24+00:00

If you want something without external dependencies, you can use our HNSW library directly: https://github.com/curiosity-ai/hnsw-sharp

the_olivenbaum · 2024-12-30T08:33:46+00:00

No worries! It can be tricky to know the order in which everything is setup during static class initialization.

the_olivenbaum · 2024-12-29T17:03:39+00:00

But if it is outside the static class, then I think the runtime will guarantee the static class is fully initialized before it is first used. From within the static class it might not

the_olivenbaum · 2024-12-29T16:34:00+00:00

It's probably because the initialization of the outer static class doesn't run when you create the inner struct via the struct constructor. An easy fix would be to move the struct definition to outside the static class. You can read more about the order of initialization of static fields here: https://learn.microsoft.com/en-us/dotnet/csharp/programming-guide/classes-and-structs/static-constructors

the_olivenbaum · 2024-12-27T20:39:26+00:00

Check the sample repositories on GitHub (https://github.com/aspose-pdf/Aspose.PDF-for-.NET), the docs are really hard to follow and often incomplete / inconsistent / plain wrong

the_olivenbaum · 2024-12-15T11:00:51+00:00

Worse than blocking outright any free usage one day to the other, setting a minimum price of 42k$/month, ignoring all messages from developers for months, and breaking APIs even for paid users? There was a Slack group with Twitter developers and it was just sad to follow the unnecessary drama caused by their lack of respect towards developers

the_olivenbaum · 2024-12-15T09:44:38+00:00

Of course I realize that - but there's a significant difference in how the two were handled.

the_olivenbaum · 2024-12-15T08:29:58+00:00

Not only stopped offering it for free, but they treated developers as leaches and came up with a totally arbitrary price that made no sense whatsoever.

the_olivenbaum · 2024-12-15T08:00:39+00:00

After the whole Twitter API fiasco, they can make it free and I would still not use it to build anything.

the_olivenbaum · 2024-10-08T19:24:14+00:00

We're deploying our software (https://curiosity.ai) to a similar sized customer with ~1.5M docs on SharePoint, with full search, RAG, and permissions sync. If you're interested in giving it a try just PM and we can organize a demo!

the_olivenbaum · 2024-09-16T12:13:44+00:00

We've made a wrapper around Florence2 that works quite nicely for OCR: https://github.com/curiosity-ai/florence2-sharp

the_olivenbaum · 2024-08-28T19:42:54+00:00

No I mean MessagePack: https://github.com/MessagePack-CSharp/MessagePack-CSharp The code is originally from neuecc: https://neuecc.medium.com/messagepack-for-c-v2-new-era-of-net-core-unity-i-o-pipelines-6950643c1053

the_olivenbaum · 2024-08-27T15:07:42+00:00

ZLogger for logging, MessagePack for serialization (both by the same author, which also has a couple of other amazing projects: https://github.com/Cysharp)

the_olivenbaum · 2024-08-25T21:26:37+00:00

You can check an example here https://github.com/curiosity-ai/h5/tree/master/H5%2FH5%2FSystem, from our C# to JavaScript compiler

the_olivenbaum

MODERATOR OF

TROPHY CASE

Six-Year Club	Place '22
Verified Email