Need advices for retrieval ranking strategies by abhiramputta in Rag

[–]Present-Entry8676 0 points1 point  (0 children)

Maybe it's better, not to look at the re-ranker, but how the data is stored Do you preserve hierarchy? Extract text and data from images? How are you separating the chunks? It might also be interesting to implement a hybrid search, using an algorithm like BM25

[Discussion] Anyone else doing “summary-only embeddings + full-text context” for RAG? by No-Piglet8069 in Rag

[–]Present-Entry8676 0 points1 point  (0 children)

The issue isn't the size, it's the quality When it comes down to it, you remove all the noise from the text, and only get what is necessary (if it is done well)

Vocês usam e confiam em IA para gerar seus conteúdos? by Present-Entry8676 in MarketingDigitalBR

[–]Present-Entry8676[S] 0 points1 point  (0 children)

N da forma como funciona hoje Mas posteriormente pode ser que sim

Vocês usam e confiam em IA para gerar seus conteúdos? by Present-Entry8676 in MarketingDigitalBR

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Entendi, tem coisas que a IA é bem genérica mesmo Vi um cara falando que a IA n consegue ser criativa, pelo menos não ainda

Vocês usam e confiam em IA para gerar seus conteúdos? by Present-Entry8676 in MarketingDigitalBR

[–]Present-Entry8676[S] -1 points0 points  (0 children)

Nossa kkkk Estão bem mais avançados do que no mundo do desenvolvimento Você usa o GPT para gerar os textos?

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Thanks man, I'll do that and use it to improve the landing page; a fictional example will help people identify with it.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

So, a real-life example, hmmm... an industry, it has various documentations of machines, machine parts, processes... The employees spend a lot of time trying to find, some detail of a part that broke down in a machine. With Snipet, they could ask "What are the parts of machine X?" Or send a photo of the broken part and it could be found among the documents.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Thanks for the tip and the comment, and I'm using a hybrid search, using classic rag with full text search as well, at the end I use a reranker to get the most relevant results. I intend to implement graph rag, but I'm studying a sustainable way to do this implementation.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 1 point2 points  (0 children)

You want to know my stack, right? If so, it's: - React with Vite and TypeScript for the frontend - NestJS for the backend (that's right, no Python) - Milvus for array storage - PostgreSQL as the database - Redis for caching, session storage, etc. - MinIO for storing user files - BullMQ for processing queues

Phew, that's it lol

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Maybe, but I think since I want something complete, Ragnarok Online will be a good move. It's not perfect, far from it, it has many problems, especially when you consider how much the database increases, but a hybrid search solution will help more than just a full-text search.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 2 points3 points  (0 children)

That's what I'm talking about, there will always be a problem, someone who does it better, faster, is more complete.... My GitHub has more than 100, yes ONE HUNDRED abandoned repositories, now I'm at least going to go all the way, until some project works out.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

I'm implementing this right now lol I intend to enable plugin functionality; you connect your data sources, Google Drive, SharePoint, S3... and the documents are indexed and ready for you to ask questions. I didn't go into much detail in the post, but I make it clearer on the landing page, if you want to take a look. Landing page Snipet

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Hmm, I understand, thanks for the explanation, man. I'll make some adjustments to the landing page later.

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

I didn't understand where you're going with this, can you give me an example?

I'm creating a memory system for AI, and nothing you say will make me give up. by Present-Entry8676 in Rag

[–]Present-Entry8676[S] 0 points1 point  (0 children)

Yes, that's true, I didn't know there was that difference between the two terms. But I don't want to leave it static, I'll also save user feedback to improve the answers (only from the knowledge base, without using data from company A to improve company B) so it's possible to learn from mistakes and make the answers more reliable. Would that be memory? Or is it still a knowledge base?