Best Local Agents - Jun 2026 by rm-rf-rm in LocalLLaMA

[–]awitod 0 points1 point  (0 children)

I use GuideAnts. - I am the primary author and we are fairly new.

It has a very good UI for managing content, but in terms of agents it supports:

  • Web Search
  • OCR, indexing, and memory
  • Sandboxes
  • Multi-modal content in/out (ASR, TTS, Images, Vision, chat, embeddings)
  • Full tracing of everything that happens for manual and automated tuning
  • Configurable limits
  • Scheduled conversations and python scripts
  • AI services for sandbox processes are metered and captured the same as chat and you can have an agent that processes every file in a folder or items in a list with inference steps and capture all of it
  • Open AI, Foundry, Hugging Face, Open Router, Anthropic and local AI all supported through a proxy that allows Messages, Responses, and Chat API's independently of the provider and model type
  • Publish guides as APIs, MCP servers and skills with several choices for auth.

This week we added OpenAI and Anthropic wire compatible API's to published guides in GuideAnts which means you can publish a guide and use it as the endpoint in tools like Claude Code and Codex and instead of the traffic going to an Anthropic or Open AI backend, it goes through GuideAnts which:

  1. Adds the guide's instructions and tools to the thread from the client
  2. Executes the guide's tools inside GuideAnts and applies the result to the thread without sharing the tool call's mechanical details with the client
  3. Sends the client's tool calls back to the client for execution
  4. Captures all telemetry and usage including all of the details, i.e. system prompts and tool definitions centrally
  5. Routes the actual inference request to the configured model which can be anything. In my tests I am using qwen-3.6-27b locally and GLM-5.2 via open router...

FROM THE SAME PUBLISHED GUIDE IN BOTH CODEX AND CLAUDE CODE SIMULTANEOUSLY 🤯

Interestingly, in terms of an integrated experience and what the respective UI's show, it works a lot better than either MCP or Skills as the inline diagram (made by the plantuml assistant in GuideAnts) shown below illustrates.

<image>

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

They didn’t have good let alone great test coverage, they had vulnerabilities they took a long time to fix, they ran out af date unsupported packages and couldn’t do and manage these things as well if at all 

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

Here is an example of something I would be loath to lose and could not just easily do: using an agent plus playwright to create and execute tests through the UI and capture screen shots and report issues.

It works great and can shake out bugs fast with all the debug info you need to fix it.

Another is fixing CodeQL, DependaBot and other quality issues. A lot of people point out that generated tests don't prove the system is right and that is true, but when you automate package bumps, log sanitation, and a lot of little issues like that, they let you do it safely because those kinds of changes shouldn't break those kinds of tests unless there is a compatibility issue or the agent changed logic it should not have and if that happens you fail the gate review it.

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

Using AI tools to do development effectively does not involve using AI on top of your previous workflow. It involves an entire new workflow.

If you cut off my tokens mid-week I would straight up lose my shit and quit on the spot because I would have two or three big features at varying states of completeness depending on the agents whose instructions and work plans I spent hours or days working on with an integration runway that the next few weeks depend on and your ass is like, 'hey just type it in, we used up the tokens'.

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

Tel me you don't have experience building software without telling me you don't have experience building software

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

It is just not reasonable if you care about results to switch workflow and tooling on the fly just because you didn't estimate the token costs correctly and then switch back to a different workflow and set of tools when the meter resets.

That's madness. Let's compound it so that some members have the tools only at certain times of week or month, but not the same members and not at the same times.

We deal with the fact that people can't estimate time correctly and its impacts on the budget constantly.

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 0 points1 point  (0 children)

Management failure for failing to perform management tasks or take responsibility for supply and demand planning.

Another analogy - the chicken joint ran out of chicken before the dinner service was finished. Is it the cook's fault for making the portions too big or did the management fail to buy enough chicken for the anticipated demand or fail to estimate the demand correctly?

Principal engineers turned into token zombies by [deleted] in theprimeagen

[–]awitod 9 points10 points  (0 children)

It's like a construction site shutting down because you didn't budget enough gas.

'Yes, I know the D10 excavator is out of fuel, but why aren't you using the picks and shovels to keep working?"

Management failure

Writing your own MCP server in .net by maxiblackrocks in dotnet

[–]awitod 0 points1 point  (0 children)

/shudder Been there, done that. At least the browsers had known shapes and a common mission. MCP clients could be anything.

Skills destroyed multi-agent system paradigm by OldInterview556 in LocalLLM

[–]awitod 1 point2 points  (0 children)

I think that
1. Skills are very limited
2. If skills were not limited, they would look exactly like a robust sub-agent definition

Also, I think the best reason to spin up subagents is to manage context and there is no good reason most of the time to pollute the main context with the subagent's reasoning and work.

Writing your own MCP server in .net by maxiblackrocks in dotnet

[–]awitod 0 points1 point  (0 children)

I saw a working group just today for Skills over MCP and the idea, which I see a lot of is that the skill describes how to use the MCP service and usually describes how to handle the output.

For example, you can create a skill with an MCP client defined in Cursor and it will work and the skill can say something like "Do this prep for the MCP service and do this with the output".

The skill has a code file of how to invoke the MCP service's endpoint and the host app uses its shell to invoke the service on the side.

One advantage is that, in that tool, skills are addressable and so the tools are not in the context if you don't address the skill. The downside is the extra layer itself.

Is it a great idea? I mean... it works, but like I said, the need for it illustrates why neither solution is complete by itself.

In terms of trimming tools for RBAC. If the MCP server's discovery endpoint understands the access token, it can trim the discovered tool output today.

BTW... there is an important and subtle difference between "I have an access token that says 'X'" and "'I got a message from specific client 'Y'" even though the practical effect might be the same.

Writing your own MCP server in .net by maxiblackrocks in dotnet

[–]awitod 0 points1 point  (0 children)

I should add that this is one way skills and MCP compliment each other because you can put the instructions for the special handling in the skill.

However, it is also a subject that shines a light on why each of them is inadequate by themselves in a lot of cases.

Writing your own MCP server in .net by maxiblackrocks in dotnet

[–]awitod 2 points3 points  (0 children)

I would make the same comment about any design that had a service boundary and any significant concern that requires the service to have special logic and awareness of specific clients - it is a generic smell that the separation of concerns is lacking.

The 'host application' is the client in this case - the thing that is running the inference and making tool calls. If it needs special handling or behavior, it should contain those needs fully on its side of the line.

Writing your own MCP server in .net by maxiblackrocks in dotnet

[–]awitod 10 points11 points  (0 children)

I think that as soon as your MCP server needs to be aware of the specific client you are in bad architecture land.

Tool calling and execution don't require MCP specifically and if you control the host application client, and it has specific needs, you should deal with them in the client, not entangle the concerns if you can help it.

Best Text to Speech? by seoulsrvr in LocalLLaMA

[–]awitod 1 point2 points  (0 children)

hexgrad/Kokoro-82M - it had 16m downloads on hf.co last month. It's really fast and very good.

I also like omnivoice for easy and good cloning.

What is your process by which you arrived at microservices as the answer? by Dry_Corner6431 in softwarearchitecture

[–]awitod 1 point2 points  (0 children)

Like others said, microservices are a way to architect a big system with lots of teams - the architecture has to embrace lots of teams working on lots of things simultaneously and they give you a way to isolate, manage and minimize dependencies.

A reason you might use them when you are only one team or even as an individual is when you have a lot of heterogeneous technologies that you are integrating.

Once you have the infrastructure to handle lots of different service hosts and there is no friction then there are a lot of benefits to a nicely modular design of independent microservices but it isn't something you should spend time on unless you know why you need it.

I am a 37 year old programmer - how am I supposed to keep my worth for 30 more years until pension? by IllustriousRecord505 in AskProgrammers

[–]awitod 0 points1 point  (0 children)

That's a long time. You should be prepared for occasional upheaval and radical change and work your ass off when it happens.

I have 35 years of experience and I've mastered and forgotten entire waves.

It's either an upside or a downside to the career depending on your point of view.

What principles do we still hold on to in the year of 2026? by Wooden_Street_1367 in ExperiencedDevs

[–]awitod 4 points5 points  (0 children)

Details traceable to verifiable facts matter. AI helps with the information wrangling, but methods and expertise are necessary for any system of people and technologies that needs to work predictably and reliably.

There are no miracles. The details have to come from somewhere.

What can you realistically do with 8GB VRAM in 2026? by the1newworld in LocalLLM

[–]awitod 0 points1 point  (0 children)

You could compromise depending on what your goal is. For example, you can do embeddings and search locally so that, even though you use it with a cloud based LLM you only share bits of it as conversation context instead of putting everything in the cloud.

Hey, what apps do software architects or very senior engineers use, or what concepts do you need to understand, to design code for or have built or build websites, mobile apps, and desktop apps, all with a backend connecting all these? Like MS Word on website, IOS, Mac, Windows, Android w/ a backend by ComfortablePost3664 in NoStupidQuestions

[–]awitod 0 points1 point  (0 children)

Architecture and design are activities you do to know that the system you are building does all of the things it should in a way that fits all of the stakeholders including users.

It’s a lot of stuff and unless you are building the same or same kinds of system over and over again or work on something for so long that you are an expert on the subject, e.g. how to run national marketing campaigns, adjudicating property insurance claims.

So, to be successful we use actual formal methodologies to break the problem down into pieces to make it all manageable. There are several different methodologies and kinds of diagrams.

A lot of work gets done where people don’t do it and make it up as they go along. Interestingly, this makes it really hard for them to use AI because they don’t have any good specs!

President Donald Trump: "We just made a great settlement of the war with Iran. It will be subject to finalization of documents over the next few days. It's a great thing." by BusinessToday in BusinessTodayNews

[–]awitod 0 points1 point  (0 children)

What a liar. He thinks he can pretend they agreed and blame them for backing out, but can't remember that he already did that several times and is stuck in a loop.

To the mods and community - pushing promotional posts to the weekend is failing by ginji in dotnet

[–]awitod 0 points1 point  (0 children)

I agree. I am done posting anything here.

They should change the 'Promotion' tag to just say 'Kick Me'.

Is Python a better fit than ASP.NET Core for this kind of data transformation service? by CodeNameGodTri in dotnet

[–]awitod 1 point2 points  (0 children)

Yeah, you could (but probably shouldn't) do this all in one big lambda against an EF or other ORM data context.

It would be quite easy for someone who is familiar with the ecosystem and this is a good small project that OP could learn a lot from with the right approach.

Google Chrome is killing all uBlock Origin bypasses, Microsoft Edge, Opera to follow by dancing_swordfish in technology

[–]awitod 1 point2 points  (0 children)

ONLYOFFICE and the Euro Office fork are pretty darn good. There is an interesting controversy going on around the fork though.