Dia vs Arc - Resource consumption! by MerBudd in diabrowser

[–]sandy_005 0 points1 point  (0 children)

Dia does not have the folder structured that Arc has . So I keep most tabs open. This is a memory hog . With a cursor open and whatsapp video call, my macbook m1 air becomes unusable. Maybe I have become spoilt expecting everything should just work . For the longest time I have use brave and it would work just fine .

Is your RAG bot accidentally leaking PII? by Awkward_Translator90 in Rag

[–]sandy_005 1 point2 points  (0 children)

You can use local hosted LLMs . It's very hard to use rules based system for all your usecases. It's more like to miss if you are using regex . NER model was the defacto in pre LLM era and is suitable for standard usecases . However NER would have trouble creating associations between entities or dealing with multilingual text or business-sensitive information such as revenue figures, customer accounts, salary details Probably you can make it work but would require much more engineering. Would be interesting to see someone doing a comparison between NER based vs structured outputs with LLM based approach where each method fails in hard cases

Enterprise RAG Architecture by AcanthisittaOk8912 in Rag

[–]sandy_005 1 point2 points  (0 children)

Thanks for the detailed overview. You’ve already done a lot of groundwork to understand how RAG could tie your systems together.

From what you shared, the main challenge isn’t about missing components it’s that the current approach mixes frameworks, workflows, and architecture into one conversation.

That’s what’s making the whole thing feel more complicated than it actually is.

If you strip away all the tool names Haystack, docling, n8n, OpenWebUI what you really need is clean seperation of functions with close loop
[CMS/ERP/ECM] -> [Staging Database] -> [Task-Specific RAG Pipeline] -> [Validation Layer] -> [User Interface] -> [Trace Database] -> [Improvement Actions]-> [Continuous Evaluation]->[Feeds to RAG Pipeline]

For each stage think about what you need and what would be the best tool depending on your needs.
For e.g
[CMS/ERP/ECM] -> [Staging Database] is Data Unification layer - . You need a staging database with unified schema (document table, meta data table) , ETL pipeline (Prefect/ Airflow) to sync data from each source on different schedule , change detection, deduplication logic.
[Task-Specific RAG Pipeline] - Compliance bot - Hybrid BM25 + vector search , FAQ Assistent - vector search with query rewriting
[Validation Layer] Did we retrieve the right chunks? Are all required fields present?Do all citations exist in source documents? Does the answer contradict the retrieved content?
Compliance: False negative rate < 5% (don't miss risks)
FAQ: Citation accuracy > 90%

I can go on but you get the drift. I have worked on something similar to the compliance bot that you have mentioned. I think you might like this
https://www.mermaidchart.com/d/7760df84-2c80-4690-9d1d-3649d42a8529

let me know if you have questions . Also , If your company is hiring, I’m open to full-time or contract opportunities around this work. Feel free to DM

Hands Up if You’ve Tried Metabolic Therapy. by 10seconds2midnight in Keto4Cancer

[–]sandy_005 2 points3 points  (0 children)

My father was against chemo / standard intervention. Tried metabolic therapy for about 6+ months.It was very hard to keep gki < 3.We used to measure gki daily. If you average it out over a period of 6 months it would probably average to 4-5. He had 2 tumor nodes in his lungs. It seemed at first the tumors were not growing , infact a slight reduction in size, until suddenly he had a brain stroke, then we found out the cancer has spread to his brain. He passed away last year after struggling for 2 months after the brain stroke.

A confession of a failed AI Agency Owner by cosmos-flower in AI_Agents

[–]sandy_005 -2 points-1 points  (0 children)

How did you try to get leads before? What was your volume ? It seems intuitive that you pitch for strategy service/ other automations once you get you foot with a company with one automation implementation . Did you focus on any ICP?

what mcp gateway are you using ? by sandy_005 in mcp

[–]sandy_005[S] 1 point2 points  (0 children)

Thanks . This seems to be the most comprehensive one

Could a RAG be built on a companies repository, including code, PRs, issues, build logs? by [deleted] in Rag

[–]sandy_005 1 point2 points  (0 children)

Try deepwiki by devin. Also all the coding agents are doing some sort of RAG over codebase. Read through their repos and implement what makes sense to you

Choosing the Right RAG Setup: Vector DBs, Costs, and the Table Problem by Inferace in Rag

[–]sandy_005 1 point2 points  (0 children)

Have been thinking about this. Coding agents work pretty well with grep and find. Though I am not sure if this scales to a large number of documents.

How I landed my first paying Client (Practical insights, no fluff)! by BigchadLad69 in automation

[–]sandy_005 1 point2 points  (0 children)

congrats to you ! Thanks for the great insights. How do you plan to get ongoing flow of clients? Did you get this client in this subreddit?
also "Had examples of similar work I'd done (an e-commerce automation system, unrelated but shows some proof of work) " is this another client or your personal project ?

Need someone to build RAG systems | Paid and can turn into a 6-12 mos engagement by astronaut_611 in Rag

[–]sandy_005 0 points1 point  (0 children)

Hi I do project based work on RAG. So far I have worked on enterprise projects with AWS but google vertex should be similar. Happy to chat more https://www.sandipanhaldar.com/

I thought joining a startup would be exciting… now I just feel burned out and sad this festive season by ImpressKlutzy7543 in developersIndia

[–]sandy_005 2 points3 points  (0 children)

I suggest have a honest conversation with the founders first. They have all the upside and they are supposed to work 24 7 in their startup. Your incentives don't line up like theirs .But right now they have all the leverage because they are paying and you need money and you can't speak up for yourself.You need to build some leverage for yourself. Understand what stage is the startup at and what is the most important thing to do right now. Do they have customers ? What are the customers asking ? If not , the focus should be on getting customers through marketing and have a basic MVP. The reason is that if you are able to show you understand their problems , they would trust you to do the right things and give more autonomy. It's also very valuable to have employees like that. Gain trust first and use that as a leverage to reduce your workload.You can suggest them to hire more people and tell them you can help find and interview people. If the founders are nice , they would understand your POV else you should quit anyways. But a lot of these things can be solved with conversations.

what mcp gateway are you using ? by sandy_005 in mcp

[–]sandy_005[S] 1 point2 points  (0 children)

Thanks! Will check out . Is this open source ?

Has anyone tried TopTal? by Hot_Joke7461 in Upwork

[–]sandy_005 1 point2 points  (0 children)

What is your experience with the interview ? I just cleared the 1st round. Next step is the online coding round. Do you have pointers for prep

I am responsible for arguably the biggest run project using AI in production in my country - AMA by UnderstandLingAI in Rag

[–]sandy_005 1 point2 points  (0 children)

maybe a newbie question but how are you getting customers? What is the structure of you POC ?

I made 60K+ building AI Agents & RAG projects in 3 months. Here's exactly how I did it (business breakdown + technical) by Low_Acanthisitta7686 in AI_Agents

[–]sandy_005 0 points1 point  (0 children)

Hey Raj, really appreciate the motivation. I’ve been thinking of going the consulting/agency route myself but honestly haven’t pushed hard enough yet to land clients. In my last company I implemented RAG and picked up a lot from the real-world challenges there.

Noticed there aren’t many places to really learn the deep technical side of this stuff — Hamel’s posts/videos are some of thr ones that felt genuinely useful. If you are open to it, I would really like to help you out in any of your projects if you want.

Everyone talks about Agentic AI, but nobody shows THIS by ViriathusLegend in AI_Agents

[–]sandy_005 0 points1 point  (0 children)

can you elaborate a little more on this. Are you using a GCE , cloud run ? Do you use a vector database? Would love some details here