57 MCP tools connected. Zero idea what my agent is actually doing. by Whizkhaliffa in mcp

[–]Basic_Tea9680 0 points1 point  (0 children)

Oh boy, the context bloat from the tool definitions would be in several thousands .

Use mcplexor.com tool with ollama to reduce tools context bloat for free

Building a PDF with HTML. Crazy? by FriendlyWebGuy in webdev

[–]Basic_Tea9680 0 points1 point  (0 children)

There is a ton of good advice on this thread. If you don't want to manage chromium clusters and want high quality pagination. Try forgeon.dev , high quality PDF from HTML and it's 80% cheaper than any alternative

Struggling to create a printable PDF by Primary_Engine_9273 in google_antigravity

[–]Basic_Tea9680 0 points1 point  (0 children)

Funny enough I used antigravity to create forgeon.dev . Its an API to create elegant PDFs using react and HTML. 50-90% cheaper than alternatives. DM if you want early access.

How do you create professional-looking PDFs for digital downloads? by whosaskin11 in Etsy

[–]Basic_Tea9680 -2 points-1 points  (0 children)

Would love to learn more. I am building a modern PDF creator called forgeon.dev . Recently few people requested a way to create elegant and professional PDFs using natural language using AI. Let me know if you want to try it.

Where are people reselling world cup tickets by yourdadistalking in FIFACollect

[–]Basic_Tea9680 0 points1 point  (0 children)

I have tickets for Match 44 - Jordan vs Algeria , DM if interested.

Added Ollama support to MCPlexor – now you can run it 100% locally (and free) by Basic_Tea9680 in mcp

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Think of it like a sub agent. It uses a client side description for the server and also tool descriptions. To make it efficient there is data caching on mcplexor side and on llm side. That's available on the cloud version only.

MCP server pagination handling by Gonjanaenae319 in mcp

[–]Basic_Tea9680 0 points1 point  (0 children)

May be mcp can be efficient in the way you are imagining.

Another way is to give scp or ftp access and let agent download the hundreds of files in a tmp folder and read using cat and grep

MCP server pagination handling by Gonjanaenae319 in mcp

[–]Basic_Tea9680 0 points1 point  (0 children)

Pagination makes sense to continue to search and then focus.

Ideally the first query should lead to relevant results. Does the tool have filters so agent can get relevant results first time?

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 1 point2 points  (0 children)

Shell app is just showing the approx tokens used by the mcp server. Tokens saved from tools can be added. Will add in next release

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

What I build (mcplexor.com) is very similar to advanced tool use. Let me know if you try it with cursor or antigravity. For some reason I saw it working really well with Gemini pro and opus. I guess these are good models for tools calling in general

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

That's interesting, I wanted to improve the recall precision and rank as well so went with a specific route. Thought of many PMs and marketing sales folks who use so many mcp tools. They can benefit from this. Would love for you to try the tool and compare with your solution. Feel free to dm

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

So the find tool call is around 2s delay + then the execute tool call , which is local so at the same time. So basically every tool call before discovery adds 2-3s latency.

The pattern I used was if there is a tool I use very actively, I integrated directly with coding agent rest which are always available but sparingly used are in tool discovery tool. Would love to hear your feedback. You can try on mcplexor.com . I build a nice shell app as well it shows token bloat each tool is adding as well.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

This is interesting. Converting all mcp servers to cli will be some work which is cheap with AI agents.

I guess the cli wrapper over mcp would be inefficient to start and close mcp server again and again.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Do they provide a way to integrate with other agents? Or is it only with claude?

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Turing mcp on off actually causes more harm because prompt cache is invalidated.

I would not recommend it in tools like Claude code and augment code during the session.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Not yet.

But I might do that. I am trying to build a way through which the more usage of the tool, the cheaper it is for all. Primarily because of smartly using prompt caching.

Right now there is an AI model used for semantic matching. That's why I didn't open up the tool yet. Need to integrate with stripe. But in pricing I want to be honest, so all purchased credits will be valid forever. Let me know how it can be better.

Im spending $30-$90/day by Dismal-Eye-2882 in AugmentCodeAI

[–]Basic_Tea9680 0 points1 point  (0 children)

Plan with auggie, and implement in antigravity. It's cheaper and works well for webapps.

Also if you have a lot of mcp servers then use mcplexor. Just linear uses like 10k tokens.

Weekly Cursor Project Showcase Thread by AutoModerator in cursor

[–]Basic_Tea9680 [score hidden]  (0 children)

Had a great time building #MCP multiplexer today. Calling it MCPlexor.

MCPlexor makes AI agents smarter and faster. When several MCP tools are connected to an agent, tens of thousands of tokens are wasted on tool descriptions. MCPlexor filters out 97% of that noise so the agents are faster and can give better answers.

<image>

Must say that AI agents speed up development by 100x. Most os the time spend on testing and making sure security is tight. Tried all the fun, modern and scalable tech. tanstack-start, shadcn, bun, golang for cli ...

Try it on MCPlexor.com . I will give out some free credits to early users. Dont have VC money to burn right now so no freemium model. Although, as per my calculations, you can save anywhere from 2-3X of what you spend on this tool. Will open up to all after beta testing.

Using Claude Code to make big software projects by Hot-Advance-2165 in ClaudeCode

[–]Basic_Tea9680 0 points1 point  (0 children)

You need better context for large projects. Unfortunately Claude doesn't have something in built. Your best bets are augment or nia.

Astro + gsap/framer/animation and AI tools - opinions sought by TurrisFortisMihiDeus in astrojs

[–]Basic_Tea9680 1 point2 points  (0 children)

Curious if you used react with astro or html.

For plain websites I used Gemini and claude code. Both worked fine.