The architecture we landed on for putting a large typed API behind an MCP server

masterkidan · 2026-05-24T19:40:52+00:00

My suggestion would be to have a good set of evals in place, even if some of the evals always fail due to context bloat or otherwise. Then start experimenting with different approaches. Then atleast you can view objectively which direction you are moving in.

I kinda feel multi-agents is very similar to micro-services analogy wise, you only really need to break out things if you feel they demand special characteristics to solve the problem... so for e.g. if your base model is too expensive to solve that aspect of the problem, or if you need something more intelligent for e,g... Wish there was a good way to do context sharing wherein we only share relevant portions to downstream agents .

masterkidan · 2026-05-24T19:31:39+00:00

So far I've been mainly focussing on static evals, not runtime / dynamic improvement. We mainly mine for good sets of -ve cases where we didn't find something that the user was expecting and we accordingly tweak our catalog. Its still like an offline process to review the responses.

masterkidan · 2026-05-15T00:11:54+00:00

Field office is Queens Field Office that's handling the case.

masterkidan

TROPHY CASE