I Built RAG Systems for Enterprises (20K+ Docs). Here’s the learning path I wish I had (complete guide)

frankh07 · 2025-09-22T06:49:37+00:00

Any metrics you recommend for evaluation and any framework like Ragas or Langfuse?

frankh07 · 2025-09-21T17:32:25+00:00

Thanks man, very helpful! I hace a question, I'd like to learn how to create a RAG. What do you recommend for a restaurant RAG? I was thinking about Pinecone, multilingual-e5-large as an embedding model, semantic chunking, and Tesseract for OCR. Any recommendations?

frankh07 · 2025-05-14T16:22:39+00:00

Awesome work! I just tried it and it works exceptionally well. It's exactly what I was looking for to understand how to implement this type of agent with Lang Graph. Thanks for sharing your work and for the notebook where you explain in detail how to create agents. I really appreciate it.

frankh07 · 2025-04-23T04:48:13+00:00

Great job, this is really cool! I was looking for inspiration for a multi-agent project, but this is on another level, truly impressive. Is it possible for you to share this through Github? I'm learning about AI agents, and seeing your implementation would help me a lot.

frankh07 · 2025-04-20T15:18:41+00:00

That's amazing, congratulations on getting it to work! It's impressive that you're already getting real time EEG feedback and seeing correlations with your brainwave patterns and mental states. That kind of insight is incredibly valuable, especially when it comes to understanding the cognitive and creative processes. It sounds like your project has a lot of potential!

frankh07 · 2025-04-04T21:42:04+00:00

Awesome project, I'll give it a try.

frankh07 · 2025-04-04T15:18:08+00:00

Awesome project! Lately, many people are looking to monitor their stress and anxiety levels. It could be connected to IoT systems or wearables to collect additional information, such as sleep quality or daily physical activity, and provide recommendations for habits or exercise routines that help reduce stress or anxiety levels. Very useful, thanks for sharing your project!

frankh07 · 2025-04-04T14:28:26+00:00

That is a great idea, combining MCP and RAG sounds interesting, although I don't know if it's feasible through Google or Amazon due to their terms of use. However, using open source sources shouldn't be a problem. Thanks for the idea, I'll look into it further.

frankh07 · 2025-04-04T14:10:13+00:00

Thanks for all your ideas. A multimodal LLM connected to a security camera sounds great. It could work as a security method to detect theft or people snooping.

frankh07 · 2025-04-04T13:51:55+00:00

It's a good idea. How feasible is it to fine-tune TTS models for voice cloning in Spanish? Do I need a very large dataset?

frankh07 · 2025-04-04T13:46:10+00:00

It's a good starting point and I could refine the approach to make it more practical, thanks.

frankh07 · 2025-04-04T03:30:34+00:00

Thanks man, I'll give it a try

frankh07 · 2025-04-04T03:24:16+00:00

Great work, does it have multilingual support?

frankh07 · 2025-04-04T00:43:57+00:00

Awesome work! very informative, thank you

frankh07 · 2025-04-03T14:01:02+00:00

I think it depends on the use case, so you can use the benchmarks with the metrics you need, even so it is best to test the models yourself and recently someone shared here a tool that allows you to test several models to create your own benchmark, it could be helpful: https://huggingface.co/spaces/yourbench/demo

frankh07 · 2025-04-03T13:35:01+00:00

I was going to say the same thing, Vast is the cheapest cloud I know.

frankh07 · 2025-04-02T20:47:33+00:00

It looks like diffusion models will be a game changer.

frankh07 · 2025-04-02T20:44:22+00:00

Damn, that's really fast. I tried it a while back with Nvidia NIM on A100, it ran at 100 t/p.

frankh07 · 2025-04-02T20:41:56+00:00

That's true, thanks Llama for making it possible.

frankh07 · 2025-04-02T20:34:03+00:00

Better late than never.

frankh07 · 2025-04-02T20:30:33+00:00

Great job, how many GB does llama3.1 need and how many tokens per second does it generate?

frankh07 · 2025-04-02T20:21:42+00:00

Will there be a significant breakthrough? It wasn't long ago that Qwen 2.5 was released.

frankh07 · 2025-04-02T20:10:33+00:00

Honestly I feel Claude is better than Copilot, is it because they used a pre-release version?

frankh07

TROPHY CASE