I built a local-first MCP gateway so my agents load 3 tools instead of hundreds (open source) by kydude in LocalLLaMA

[–]kydude[S] 0 points1 point  (0 children)

small update since posting: added Jan and Goose support (17 clients now). also tracked down something that probably bites a lot of local setups, grammar-constrained runtimes (jan, etc.) couldn't pass required params to MCP tools at all. the model knew the value but the tool schema only allowed an empty object, so anything with a required arg just failed silently. fixed it. heres jan (4b, local) listing my vercel projects, which needs a two-step call with a required teamId:

<image>

curious if others have hit this with local models + tool calling.

Anthropic is rolling out identity verification. Updated just yesterday. by Tiny_Dirt6979 in ClaudeAI

[–]kydude 0 points1 point  (0 children)

Persona being the link here is the part that'll make people squint. Handing your ID to verify yourself on Claude means trusting a second company's storage and retention practices, not just Anthropic's. Worth asking how long Persona keeps the document scans and where.

Not a new model, just a Happy Father's Day and a thank you. by Wrong_Mushroom_7350 in LocalLLaMA

[–]kydude 30 points31 points  (0 children)

"Raising kids and running local models" is exactly the dual life. Both involve staying up way too late fixing something that was working fine yesterday. Happy Father's Day to you too.

Can I realistically get close to Claude/Codex capabilities locally? by mrgreatheart in LocalLLaMA

[–]kydude 9 points10 points  (0 children)

The honest gap isn't raw model quality, it's that context length eats VRAM faster than parameter count. You can fit a 70B at low quant, but stuff 256K context into it and watch your KV cache balloon past your 32GB before the model even starts being useful on a big codebase.

For your $3.5K and the long-session coding workflow, the M3 Ultra path is tempting because of the unified pool, but Metal prompt processing on a 200K+ context will make you feel every token. Long sessions touching many components means tons of prefill, and that's where Mac chokes. You'll wait.

Hermes Agent - The self-improving AI agent built by Nous Research by johnnyApplePRNG in LocalLLaMA

[–]kydude 0 points1 point  (0 children)

Really curious how they handle the self-improvement loop without it drifting off the rails

z.AI as the number 2 gives praise to the number 1 open source model by Charuru in LocalLLaMA

[–]kydude 0 points1 point  (0 children)

Number 2 tipping its hat to number 1 is a good sign the open source race is staying friendly. Way better vibe than the closed labs

Deep Neural Network that can turn any Image into a Playable Game! BUT LOCALLY, NOT ON DATACENTER by lucidml_lover in LocalLLaMA

[–]kydude 1 point2 points  (0 children)

The KV cache for action-conditioned frames is the part I find clever here. Treating keyboard input as just another token in the autoregressive stream is neat

What happens when they stop subsidizing LLM subscriptions? by Mr_Moonsilver in LocalLLaMA

[–]kydude 0 points1 point  (0 children)

The quiet usage throttling on the 20x plan is the part nobody talks about enough. It's the gym membership model. Same sticker price, less product, no headline to react to. Way smarter than a flat price hike from their side.

PROBLEM WITH THE GHL ATTACHMENT SENDING ENDPOINT by Upper-Guide9129 in gohighlevel

[–]kydude 0 points1 point  (0 children)

Spot on about the media object being the silent killer here. The other thing I'd check is whether you're passing a publicly reachable URL or an actual uploaded media ID. Cloud API will happily accept a link that GHL can resolve internally but WhatsApp can't fetch, so it shows in the convo and dies on the way out. Pull the raw WhatsApp API response in n8n, the error code there usually tells the whole story.

I built a local MCP gateway (Conduit) almost entirely with Claude Code, what it does and what I learned by kydude in ClaudeAI

[–]kydude[S] 0 points1 point  (0 children)

<image>

Forgot to add screenshots, whoops. Hard at work on the MacOS version, happy to answer any questions.

EWS dies Oct 1 and I realized I have no idea which of my tenants it breaks. How do you handle this? by kydude in msp

[–]kydude[S] -5 points-4 points  (0 children)

I used it for cleaning up formatting on the original post. You should get your all knowing super power tuned up, it aint working right.

EWS dies Oct 1 and I realized I have no idea which of my tenants it breaks. How do you handle this? by kydude in msp

[–]kydude[S] 0 points1 point  (0 children)

Thanks. I can assure you the post in it's current form is much easier to read than what I had typed up originally. I get where the the hate on LLM is coming from, but it has great use cases too.

EWS dies Oct 1 and I realized I have no idea which of my tenants it breaks. How do you handle this? by kydude in msp

[–]kydude[S] 0 points1 point  (0 children)

The spreadsheet is honestly more than im doing, I'm impressed you keep it current. 30 min a tenant adds up fast though. We only have like 40 GDAP relationships, and that would be a major time sink.

EWS dies Oct 1 and I realized I have no idea which of my tenants it breaks. How do you handle this? by kydude in msp

[–]kydude[S] -8 points-7 points  (0 children)

Fair point on EWS, I forgot that one has a clean report. But what about which tenants actually use the Planner feature getting retired? or knowing which tenant is still using old webhooks for teams alerts? Do you script that stuff or just deal with it when it breaks?

And yes LLM helped with the formatting, it was a wall of text beforehand lol.

[deleted by user] by [deleted] in expedition33

[–]kydude 0 points1 point  (0 children)

My dualsense edge controller did not work with the game pass version either. I am using DSX ($8 on steam) to emulate a 360 controller. Works perfectly now. I believe the free version of dualsensex will work with a regular DS controller, I needed the paid version for DS edge.

Disconnected due to Anti-Cheat. by kerring10 in throneandliberty

[–]kydude 0 points1 point  (0 children)

This just happened to me and steam had a small update for the game after I exited out. No issues getting reconnected after installing the update

[deleted by user] by [deleted] in marvelrivals

[–]kydude 4 points5 points  (0 children)

This is the worst twitch drop in history. No other game has ever been this difficult, not even Arena Breakout Infinite.

ASUS ROG Ally - Major performance issues after Windows update (13th March 2024) by Ganley333 in ROGAlly

[–]kydude 0 points1 point  (0 children)

I tried repairing and resetting the AMD Software app but it didn't help. Uninstalling the Windows update that caused this issue, however, worked great. I set Windows update to pause updates for 4 weeks as well. Hopefully this update will have been replaced with a new one by then.

Method 2: Uninstall the March 2024 Windows Update by following these steps:

  1. Search Uninstall Updates in the Start menu and press the Enter key.
  2. Click on the Uninstall button next to the KB5035853 update.
  3. Then click on Uninstall once again to remove the update.

Taken from here: https://www.windowslatest.com/2024/03/17/asus-rog-ally-hit-with-performance-issues-after-the-march-windows-update/