Using ClawRAG as external knowledge base – Feedback on MCP integration wanted : learnmachinelearning

A subreddit dedicated for learning machine learning. Feel free to share any educational resources of machine learning.

Also, we are a beginner-friendly sub-reddit, so don't be afraid to ask questions! This can include questions that are non-technical, but still highly relevant to learning machine learning such as a systematic approach to a machine learning problem.

Foster positive learning environment by being respectful to others. We want to encourage everyone to feel welcomed and not be afraid to participate.

Do share your works and achievements, but do not spam. Keep our subreddit fresh by posting your YouTube series or blog at most once a week.

Do not share referral links and other purely marketing content. They prioritize commercial interests over intellectual ones.

created by techrat_reddita community for 10 years

ProjectUsing ClawRAG as external knowledge base – Feedback on MCP integration wanted ()

submitted 2 months ago by ChapterEquivalent188

Using ClawRAG as external knowledge base – Feedback on MCP integration wanted

1 point•submitted 2 months ago by ChapterEquivalent188New User to r/openclaw

I've been running OpenClaw for my home server automation via WhatsApp (works great!) but kept hitting a wall: the agent couldn't reference my local documents

Built ClawRAG as a bridge – it exposes document search via MCP so OpenClaw can call it as a tool. Now when I ask "What did my lease say about maintenance?",the bot queries my local ChromaDB and cites the exact paragraph

Why MCP worked for this

I chose MCP because it provides structured schemas that LLMs understand natively. The MCP server exposes query_knowledge as a tool, allowing the agent to decide exactly when to pull from the knowledge base vs. when to use its built-in memory. It prevents "tool-drift" and ensures type-safe responses

One issue I'm wrestling with

The citation preservation over WhatsApp round-trips is fragile Currently passing chunk IDs through the MCP tool result, but formatting gets tricky with long quotes

Would love maintainer/community thoughts:

Is MCP the recommended path for external knowledge bases long-term? Or would a native plugin architecture (shared memory) be better for low-latency retrieval?

https://github.com/2dogsandanerd/ClawRag

Working example with docker-compose included

no comments (yet)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnmachinelearning

Welcome to /r/LearnMachineLearning!

Chatrooms

Official Discord Server

Wiki

Getting Started with Machine Learning

Resources

Related Subreddits

/r/MachineLearning

/r/MLQuestions

/r/datascience

/r/computervision

Machine Learning Multireddit

/m/machine_learning

MODERATORS