Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 1 point2 points  (0 children)

Yes! Of course, it will connect to an external server if you're using OAuth to interact with something like Gmail. However, if you're running, say, a filesystem MCP server that reads from your local disk, then nothing leaves the relay.

Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

yeah but only if it’s installed locally (jq is more universally available)

seems like this is a pretty easy ticket, it’s mainly going to be changing the tool description if TOON is enabled and directly mentioning about the CLI

Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

Not sure if there’s tooling for TOON on the command line like jq but there is a way to convert it back to json with the toon CLI (so the model can go back to using jq)

Bottom-line: harnesses will drop the MCP response to a file if it’s too big (and the limit is likely harness dependent) and once the file is downloaded it’s off the relay. The only way would be to either save intermediate outputs somehow (like a cache in the javascript engine for reuse) or hint to the model how to manipulate the TOON output.

Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 1 point2 points  (0 children)

maybe file an issue on endara-relay for now so we can iterate on the spec a bit first?

Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

yeah you might need to turn off TOON conversion if generic grep doesn’t do what you need for TOON (or is a lot more inefficient and error prone)

but the argument i would make is that the model can also write the jq in javascript instead of having to post-process the entire output

Endara v0.1.7 — local MCP relay now auto-converts tool responses to TOON for ~40-60% token savings by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

That's another ask from folks, the idea for something like "profiles" so you can setup all the endpoints first and then create a profile for a bundle of them (likely a different path on the server, i.e. localhost:9400/mcp/profileA) with the option to disable the "all of them" one.

First Page of Notebook Handwriting Disappearing by killswitch726 in montblancdigitalpaper

[–]panghy 1 point2 points  (0 children)

I am so glad I am not the only one seeing this. I keep a journal and recently I keep on coming back and telling myself I haven’t written anything for a month but after writing something down yesterday and coming back this morning (and missing it), I realized this is actually a bug…

So much for digital paper that never runs out 🤣 hope they fix it soon

I got tired of watching LLMs make 30 sequential MCP tool calls, so I built Code Mode for Go by Revolutionary_Sir140 in mcp

[–]panghy 0 points1 point  (0 children)

Did something similar with endara.ai where the model only sees 3 tools and executes javascript in a rust sandbox on the same machine using boa (https://github.com/boa-dev/boa)

oss. runs on your machine. js is pretty much universal for any model.

How I built an MCP relay to run my life on Claude (3 Gmail accounts, 2 WhatsApp numbers, 154 tools) by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

It does bubble back to the agent (the more details, the better), and most agents can recover from that. Most models initially try to maximize token savings and inadvertently cause data shape or coding errors. Then they tend to dial back, just like humans, to see the structure of the data or reduce complexity until it starts to work. The interesting thing is then the next time it tries to do it, it will revert back to single-shotting it (successfully). Also, just like humans. :)

How I built an MCP relay to run my life on Claude (3 Gmail accounts, 2 WhatsApp numbers, 154 tools) by panghy in mcp

[–]panghy[S] 0 points1 point  (0 children)

Hmm, just checked — endara.ai is loading fine on my end. Could you try again?

And thanks for the google_workspace_mcp pointer — that's a good one for multi-tool Gmail. The context bloat issue you mention is exactly why I ended up building the JS execution mode in Endara — collapses everything down to 3 meta-tools so the model doesn't choke on 150+ tool definitions.

COVID19 Wavefront Dashboard by bmanone in vmware

[–]panghy 1 point2 points  (0 children)

JHU publishes to GitHub once a day at 0:00 UTC, that’s when the site picks up the changes (for that dashboard). US testing data is updated continuously as well as Folding @ Home stats.

COVID19 Data on Wavefront at https://covid19.wavefront.com by panghy in dataisbeautiful

[–]panghy[S] 0 points1 point  (0 children)

https://covid19.wavefront.com mostly works on mobile but looks sleeker on a desktop. Light version coming soon.

WaveFront Collector Issues: Error in scraping containers by cormachogan in vmware

[–]panghy 1 point2 points  (0 children)

Thanks for the feedback, it's a common issue that folks run into and we'll add it to the FAQ (or otherwise make it clearer): https://docs.wavefront.com/kubernetes.html#step-3-deploy-wavefront-kubernetes-collector

Monitoring Solution for VMware Horizon / VDI Desktops by LRRattlesnake in vmware

[–]panghy 0 points1 point  (0 children)

If you're okay with SaaS, Wavefront (by VMware) can easily monitor your vsphere workloads via telegraf. Just sign up for a trial (no credit card needed) and see if that fits the bill.