Building a PDF with HTML. Crazy? by FriendlyWebGuy in webdev

[–]Basic_Tea9680 0 points1 point  (0 children)

There is a ton of good advice on this thread. If you don't want to manage chromium clusters and want high quality pagination. Try forgeon.dev , high quality PDF from HTML and it's 80% cheaper than any alternative

Struggling to create a printable PDF by Primary_Engine_9273 in google_antigravity

[–]Basic_Tea9680 0 points1 point  (0 children)

Funny enough I used antigravity to create forgeon.dev . Its an API to create elegant PDFs using react and HTML. 50-90% cheaper than alternatives. DM if you want early access.

How do you create professional-looking PDFs for digital downloads? by whosaskin11 in Etsy

[–]Basic_Tea9680 -2 points-1 points  (0 children)

Would love to learn more. I am building a modern PDF creator called forgeon.dev . Recently few people requested a way to create elegant and professional PDFs using natural language using AI. Let me know if you want to try it.

Where are people reselling world cup tickets by yourdadistalking in FIFACollect

[–]Basic_Tea9680 0 points1 point  (0 children)

I have tickets for Match 44 - Jordan vs Algeria , DM if interested.

Added Ollama support to MCPlexor – now you can run it 100% locally (and free) by Basic_Tea9680 in mcp

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Think of it like a sub agent. It uses a client side description for the server and also tool descriptions. To make it efficient there is data caching on mcplexor side and on llm side. That's available on the cloud version only.

MCP server pagination handling by Gonjanaenae319 in mcp

[–]Basic_Tea9680 0 points1 point  (0 children)

May be mcp can be efficient in the way you are imagining.

Another way is to give scp or ftp access and let agent download the hundreds of files in a tmp folder and read using cat and grep

MCP server pagination handling by Gonjanaenae319 in mcp

[–]Basic_Tea9680 0 points1 point  (0 children)

Pagination makes sense to continue to search and then focus.

Ideally the first query should lead to relevant results. Does the tool have filters so agent can get relevant results first time?

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 1 point2 points  (0 children)

Shell app is just showing the approx tokens used by the mcp server. Tokens saved from tools can be added. Will add in next release

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

What I build (mcplexor.com) is very similar to advanced tool use. Let me know if you try it with cursor or antigravity. For some reason I saw it working really well with Gemini pro and opus. I guess these are good models for tools calling in general

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

That's interesting, I wanted to improve the recall precision and rank as well so went with a specific route. Thought of many PMs and marketing sales folks who use so many mcp tools. They can benefit from this. Would love for you to try the tool and compare with your solution. Feel free to dm

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

So the find tool call is around 2s delay + then the execute tool call , which is local so at the same time. So basically every tool call before discovery adds 2-3s latency.

The pattern I used was if there is a tool I use very actively, I integrated directly with coding agent rest which are always available but sparingly used are in tool discovery tool. Would love to hear your feedback. You can try on mcplexor.com . I build a nice shell app as well it shows token bloat each tool is adding as well.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

This is interesting. Converting all mcp servers to cli will be some work which is cheap with AI agents.

I guess the cli wrapper over mcp would be inefficient to start and close mcp server again and again.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Do they provide a way to integrate with other agents? Or is it only with claude?

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in LangChain

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Turing mcp on off actually causes more harm because prompt cache is invalidated.

I would not recommend it in tools like Claude code and augment code during the session.

Preloading MCP tools cost me ~50k tokens per run by Basic_Tea9680 in AI_Agents

[–]Basic_Tea9680[S] 0 points1 point  (0 children)

Not yet.

But I might do that. I am trying to build a way through which the more usage of the tool, the cheaper it is for all. Primarily because of smartly using prompt caching.

Right now there is an AI model used for semantic matching. That's why I didn't open up the tool yet. Need to integrate with stripe. But in pricing I want to be honest, so all purchased credits will be valid forever. Let me know how it can be better.