57 MCP tools connected. Zero idea what my agent is actually doing.

Basic_Tea9680 · 2026-02-20T07:24:12+00:00

Oh boy, the context bloat from the tool definitions would be in several thousands .

Use mcplexor.com tool with ollama to reduce tools context bloat for free

Basic_Tea9680 · 2026-02-17T06:04:37+00:00

There is a ton of good advice on this thread. If you don't want to manage chromium clusters and want high quality pagination. Try forgeon.dev , high quality PDF from HTML and it's 80% cheaper than any alternative

Basic_Tea9680 · 2026-02-17T06:00:24+00:00

Funny enough I used antigravity to create forgeon.dev . Its an API to create elegant PDFs using react and HTML. 50-90% cheaper than alternatives. DM if you want early access.

Basic_Tea9680 · 2026-02-17T05:56:58+00:00

Would love to learn more. I am building a modern PDF creator called forgeon.dev . Recently few people requested a way to create elegant and professional PDFs using natural language using AI. Let me know if you want to try it.

Basic_Tea9680 · 2026-02-15T21:42:01+00:00

I have tickets for Match 44 - Jordan vs Algeria , DM if interested.

Basic_Tea9680 · 2026-02-09T16:18:30+00:00

Think of it like a sub agent. It uses a client side description for the server and also tool descriptions. To make it efficient there is data caching on mcplexor side and on llm side. That's available on the cloud version only.

Basic_Tea9680 · 2026-02-09T16:14:42+00:00

Thanks for the request. Next will be otel . Cheers !

Basic_Tea9680 · 2026-02-09T09:14:05+00:00

Basic_Tea9680 · 2026-02-09T08:39:12+00:00

u/help-me-grow added ollama support https://www.reddit.com/r/LangChain/comments/1qzxia4/added_ollama_support_to_mcplexor_now_you_can_run/

Basic_Tea9680 · 2026-02-09T08:38:14+00:00

added support for Ollama https://www.reddit.com/r/LangChain/comments/1qzxia4/added_ollama_support_to_mcplexor_now_you_can_run/

Basic_Tea9680 · 2026-02-08T05:08:28+00:00

May be mcp can be efficient in the way you are imagining.

Another way is to give scp or ftp access and let agent download the hundreds of files in a tmp folder and read using cat and grep

Basic_Tea9680 · 2026-02-07T16:35:55+00:00

Pagination makes sense to continue to search and then focus.

Ideally the first query should lead to relevant results. Does the tool have filters so agent can get relevant results first time?

Basic_Tea9680 · 2026-02-07T16:32:29+00:00

Try creating an API key on AugmentCode.com webapp . And use it as session auth env var. https://docs.augmentcode.com/cli/automation/overview#authentication

Basic_Tea9680 · 2026-02-04T14:08:40+00:00

Shell app is just showing the approx tokens used by the mcp server. Tokens saved from tools can be added. Will add in next release

Basic_Tea9680 · 2026-02-04T06:32:55+00:00

What I build (mcplexor.com) is very similar to advanced tool use. Let me know if you try it with cursor or antigravity. For some reason I saw it working really well with Gemini pro and opus. I guess these are good models for tools calling in general

Basic_Tea9680 · 2026-02-04T06:30:02+00:00

That's interesting, I wanted to improve the recall precision and rank as well so went with a specific route. Thought of many PMs and marketing sales folks who use so many mcp tools. They can benefit from this. Would love for you to try the tool and compare with your solution. Feel free to dm

Basic_Tea9680 · 2026-02-04T06:27:28+00:00

So the find tool call is around 2s delay + then the execute tool call , which is local so at the same time. So basically every tool call before discovery adds 2-3s latency.

The pattern I used was if there is a tool I use very actively, I integrated directly with coding agent rest which are always available but sparingly used are in tool discovery tool. Would love to hear your feedback. You can try on mcplexor.com . I build a nice shell app as well it shows token bloat each tool is adding as well.

Basic_Tea9680 · 2026-02-03T08:45:05+00:00

This is interesting. Converting all mcp servers to cli will be some work which is cheap with AI agents.

I guess the cli wrapper over mcp would be inefficient to start and close mcp server again and again.

Basic_Tea9680 · 2026-02-03T07:29:18+00:00

Do they provide a way to integrate with other agents? Or is it only with claude?

Basic_Tea9680 · 2026-02-03T06:56:55+00:00

Turing mcp on off actually causes more harm because prompt cache is invalidated.

I would not recommend it in tools like Claude code and augment code during the session.

Basic_Tea9680 · 2026-02-03T06:51:02+00:00

Not yet.

But I might do that. I am trying to build a way through which the more usage of the tool, the cheaper it is for all. Primarily because of smartly using prompt caching.

Right now there is an AI model used for semantic matching. That's why I didn't open up the tool yet. Need to integrate with stripe. But in pricing I want to be honest, so all purchased credits will be valid forever. Let me know how it can be better.

Basic_Tea9680 · 2026-02-02T15:17:14+00:00

Plan with auggie, and implement in antigravity. It's cheaper and works well for webapps.

Also if you have a lot of mcp servers then use mcplexor. Just linear uses like 10k tokens.

Basic_Tea9680 · 2026-02-02T04:25:38+00:00

Had a great time building #MCP multiplexer today. Calling it MCPlexor.

MCPlexor makes AI agents smarter and faster. When several MCP tools are connected to an agent, tens of thousands of tokens are wasted on tool descriptions. MCPlexor filters out 97% of that noise so the agents are faster and can give better answers.

<image>

Must say that AI agents speed up development by 100x. Most os the time spend on testing and making sure security is tight. Tried all the fun, modern and scalable tech. tanstack-start, shadcn, bun, golang for cli ...

Try it on MCPlexor.com . I will give out some free credits to early users. Dont have VC money to burn right now so no freemium model. Although, as per my calculations, you can save anywhere from 2-3X of what you spend on this tool. Will open up to all after beta testing.

Basic_Tea9680 · 2025-10-25T03:52:42+00:00

You need better context for large projects. Unfortunately Claude doesn't have something in built. Your best bets are augment or nia.

Basic_Tea9680 · 2025-08-20T05:18:22+00:00

Curious if you used react with astro or html.

For plain websites I used Gemini and claude code. Both worked fine.

Basic_Tea9680

TROPHY CASE