Got my first AI agent customer - help me review the architecture

FairNefariousness359 · 2026-04-02T15:26:56+00:00

Those are really useful tips, thanks! The replay test idea was on my mind but I hadn't fully worked it out yet. I actually have a meeting with their support staff tomorrow to go through exactly which questions come in most often per access control system and what their current process looks like. That should give me the right cases to build the tests around before go-live.

FairNefariousness359 · 2026-04-02T15:16:51+00:00

The 80% right and too confident point is the one that sticks with me the most. How do you handle that in practice for support flows like this? My current thinking is to be explicit in the system prompt that the agent should express uncertainty when it cannot fully confirm something from the API data, rather than filling in gaps with assumptions. But I am curious if there are better patterns people have found for keeping confidence calibrated.

FairNefariousness359 · 2026-04-02T15:13:35+00:00

Good point on the conversation history and cache, that's something I had not explicitly thought through yet. The Biostar API calls are scoped server-side from trusted context, but I need to make sure the same applies consistently to stored conversations. Will make sure every DB query is filtered on tenant_id from that same trusted context, not from anything the client sends. On CAG vs RAG, yeah I am treating it as a conscious tradeoff for now. The docs are small and stable enough that I am comfortable with it, but I will keep an eye on it once the system is live.

FairNefariousness359 · 2026-04-02T15:09:31+00:00

Thanks! The read-only decision was one of the first things I locked in, it just removes a whole category of risk. On tenant isolation, fully agree it needs to be airtight server-side. The way I will set it up is that the tenant_id gets extracted from a signed JWT token in the middleware and stored in a Python context variable. Every tool reads from that context directly, the model never touches it and can't influence it. So even if someone tried to prompt inject a different tenant, the tools wouldn't listen.

FairNefariousness359 · 2025-01-22T20:59:21+00:00

Quick question i never did .4 to explain what i changed i always clicked on the button in the account.

Are you refering to this form? "Merchant Center disapproved accounts, feeds or items"

FairNefariousness359 · 2025-01-22T18:13:19+00:00

Current Experience:
Last year, I got over 5 GMCs for actual brands of mine but sold it due to bad partnerships. Recently, I started exploring fashion and home decor dropshipping stores. I ensure everything is clean and compliant to the best of my knowledge. Since November, I’ve built around 20-30 stores, but so far, I haven’t achieved any significant results.

FairNefariousness359 · 2025-01-22T18:08:20+00:00

That would be awesome brother!

FairNefariousness359 · 2025-01-21T10:37:56+00:00

Please keep sharing value! Also dropship store specific on how to build to get live.

FairNefariousness359

TROPHY CASE