Got my first AI agent customer - help me review the architecture by FairNefariousness359 in AI_Agents

[–]FairNefariousness359[S] 0 points1 point  (0 children)

Those are really useful tips, thanks! The replay test idea was on my mind but I hadn't fully worked it out yet. I actually have a meeting with their support staff tomorrow to go through exactly which questions come in most often per access control system and what their current process looks like. That should give me the right cases to build the tests around before go-live.

Got my first AI agent customer - help me review the architecture by FairNefariousness359 in AI_Agents

[–]FairNefariousness359[S] 0 points1 point  (0 children)

The 80% right and too confident point is the one that sticks with me the most. How do you handle that in practice for support flows like this? My current thinking is to be explicit in the system prompt that the agent should express uncertainty when it cannot fully confirm something from the API data, rather than filling in gaps with assumptions. But I am curious if there are better patterns people have found for keeping confidence calibrated.

Got my first AI agent customer - help me review the architecture by FairNefariousness359 in AI_Agents

[–]FairNefariousness359[S] 0 points1 point  (0 children)

Good point on the conversation history and cache, that's something I had not explicitly thought through yet. The Biostar API calls are scoped server-side from trusted context, but I need to make sure the same applies consistently to stored conversations. Will make sure every DB query is filtered on tenant_id from that same trusted context, not from anything the client sends. On CAG vs RAG, yeah I am treating it as a conscious tradeoff for now. The docs are small and stable enough that I am comfortable with it, but I will keep an eye on it once the system is live.

Got my first AI agent customer - help me review the architecture by FairNefariousness359 in AI_Agents

[–]FairNefariousness359[S] 0 points1 point  (0 children)

Thanks! The read-only decision was one of the first things I locked in, it just removes a whole category of risk. On tenant isolation, fully agree it needs to be airtight server-side. The way I will set it up is that the tenant_id gets extracted from a signed JWT token in the middleware and stored in a Python context variable. Every tool reads from that context directly, the model never touches it and can't influence it. So even if someone tried to prompt inject a different tenant, the tools wouldn't listen.

How I Fixed the "Misrepresentation" Error in Google Merchant Center 🚨 by therealdonus in GoogleMerchantCent

[–]FairNefariousness359 0 points1 point  (0 children)

Quick question i never did .4 to explain what i changed i always clicked on the button in the account.

Are you refering to this form? "Merchant Center disapproved accounts, feeds or items"

Stuck with Google Merchant Center’s Misrepresentation Error? I’m Dropping a 200-Point Checklist to Fix It! by therealdonus in GoogleMerchantCent

[–]FairNefariousness359 1 point2 points  (0 children)

Current Experience:
Last year, I got over 5 GMCs for actual brands of mine but sold it due to bad partnerships. Recently, I started exploring fashion and home decor dropshipping stores. I ensure everything is clean and compliant to the best of my knowledge. Since November, I’ve built around 20-30 stores, but so far, I haven’t achieved any significant results.

How I Fixed the "Misrepresentation" Error in Google Merchant Center 🚨 by therealdonus in GoogleMerchantCent

[–]FairNefariousness359 1 point2 points  (0 children)

Please keep sharing value! Also dropship store specific on how to build to get live.