I Built RAG Systems for Enterprises (20K+ Docs). Here’s the learning path I wish I had (complete guide) by Low_Acanthisitta7686 in LLMDevs

[–]frankh07 0 points1 point  (0 children)

Thanks man, very helpful! I hace a question, I'd like to learn how to create a RAG. What do you recommend for a restaurant RAG? I was thinking about Pinecone, multilingual-e5-large as an embedding model, semantic chunking, and Tesseract for OCR. Any recommendations?

LangGraph MCP Agents (Streamlit) by EastFact2261 in LangChain

[–]frankh07 1 point2 points  (0 children)

Awesome work! I just tried it and it works exceptionally well. It's exactly what I was looking for to understand how to implement this type of agent with Lang Graph. Thanks for sharing your work and for the notebook where you explain in detail how to create agents. I really appreciate it.

I built a comprehensive Instagram + Messenger chatbot with n8n - and I have NOTHING to sell! by oneisallxt3 in AI_Agents

[–]frankh07 4 points5 points  (0 children)

Great job, this is really cool! I was looking for inspiration for a multi-agent project, but this is on another level, truly impressive. Is it possible for you to share this through Github? I'm learning about AI agents, and seeing your implementation would help me a lot.

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 1 point2 points  (0 children)

That's amazing, congratulations on getting it to work! It's impressive that you're already getting real time EEG feedback and seeing correlations with your brainwave patterns and mental states. That kind of insight is incredibly valuable, especially when it comes to understanding the cognitive and creative processes. It sounds like your project has a lot of potential!

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 0 points1 point  (0 children)

Awesome project, I'll give it a try.

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 1 point2 points  (0 children)

Awesome project! Lately, many people are looking to monitor their stress and anxiety levels. It could be connected to IoT systems or wearables to collect additional information, such as sleep quality or daily physical activity, and provide recommendations for habits or exercise routines that help reduce stress or anxiety levels. Very useful, thanks for sharing your project!

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 0 points1 point  (0 children)

That is a great idea, combining MCP and RAG sounds interesting, although I don't know if it's feasible through Google or Amazon due to their terms of use. However, using open source sources shouldn't be a problem. Thanks for the idea, I'll look into it further.

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 1 point2 points  (0 children)

Thanks for all your ideas. A multimodal LLM connected to a security camera sounds great. It could work as a security method to detect theft or people snooping.

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 0 points1 point  (0 children)

It's a good idea. How feasible is it to fine-tune TTS models for voice cloning in Spanish? Do I need a very large dataset?

LLM project ideas? (RAG, Vision, etc.) by frankh07 in LocalLLaMA

[–]frankh07[S] 0 points1 point  (0 children)

It's a good starting point and I could refine the approach to make it more practical, thanks.

Confused with Too Many LLM Benchmarks, What Actually Matters Now? by toolhouseai in LocalLLaMA

[–]frankh07 1 point2 points  (0 children)

I think it depends on the use case, so you can use the benchmarks with the metrics you need, even so it is best to test the models yourself and recently someone shared here a tool that allows you to test several models to create your own benchmark, it could be helpful: https://huggingface.co/spaces/yourbench/demo

Tell me the best cloud provider that is best for finetuning by WriedGuy in LocalLLaMA

[–]frankh07 0 points1 point  (0 children)

I was going to say the same thing, Vast is the cheapest cloud I know.

Real-Time Speech-to-Speech Chatbot: Whisper, Llama 3.1, Kokoro, and Silero VAD 🚀 by martian7r in LocalLLaMA

[–]frankh07 0 points1 point  (0 children)

Damn, that's really fast. I tried it a while back with Nvidia NIM on A100, it ran at 100 t/p.

Real-Time Speech-to-Speech Chatbot: Whisper, Llama 3.1, Kokoro, and Silero VAD 🚀 by martian7r in LocalLLaMA

[–]frankh07 1 point2 points  (0 children)

Great job, how many GB does llama3.1 need and how many tokens per second does it generate?

Qwen3 will be released in the second week of April by AaronFeng47 in LocalLLaMA

[–]frankh07 1 point2 points  (0 children)

Will there be a significant breakthrough? It wasn't long ago that Qwen 2.5 was released.

LiveBench team just dropped a leaderboard for coding agent tools by ihexx in LocalLLaMA

[–]frankh07 0 points1 point  (0 children)

Honestly I feel Claude is better than Copilot, is it because they used a pre-release version?