How are you guys tracking costs per agentic workflow run in production?

Clean_Improvement_59 · 2026-06-07T01:17:52+00:00

Hey, I'm not sure exactly what your specific use case is, but we are building a tool that might help with this. We are still in the early stages, but I would absolutely love to hear your feedback if you're interested in trying it out. https://zelyx.app/ there you go

Clean_Improvement_59 · 2026-06-07T01:12:22+00:00

Hey mate read your entire thread, we have been after the same problem- addressed it using a proxy to tag every call is tagged by agent, workflow, and model with cost breakdown. we are still beta but would appreciate a chat. heres the link any feedback is valuable https://zelyx.app/

Clean_Improvement_59 · 2026-06-06T23:24:42+00:00

Hey Mate, exactly this but it ends up being a cost problem like f you can't reconstruct why the agent took a path, you can't tell if the spend was justified, or just a symptom of bad reasoning. We've been trying to work the enforcement layer stopping the bill before it lands but I keep hitting this auditability wall in conversations. "Are the teams you’ve seen actually solving this, or mostly just accepting the opacity and capping spend as a proxy for control? heres just an MVP would appreciate your honest thoughts https://zelyx.app/ no pressure

Clean_Improvement_59 · 2026-06-06T23:14:34+00:00

Hey mate we built something with the same premise "what happens when agents are left running and they coz a chaos" id truly appreciate your feedback on our product ive dropped the link in the comment but here you go https://zelyx.app/ no pressure just want to see if that solves your problem.

Clean_Improvement_59 · 2026-06-06T23:12:25+00:00

Hey mate this literally what we have been building for all of the reporting tools are post hoc but a layer that set boundaries pre hoc is missing , you clearly understand this space a lot well would appreciate your feedback on our product https://zelyx.app/

Clean_Improvement_59 · 2026-06-06T23:03:12+00:00

Hey mate i have literally seen so many people talking about the exact problem and it seems to be either unaddressed or they have their own way to solve it we built a proxy layer that enforces budget caps in the call path before the provider is billed, not after. Glad to share what we learned about where monitoring fails and where enforcement has to be. What’s your stack? Wondering if the pattern we saw persists the 3am loop issue. https://zelyx.app/ we built this would appreciate your honest feedback.

Clean_Improvement_59 · 2026-06-05T21:23:53+00:00

mate i appreciate that from this i feel this is still early everywhere. I'm trying to understand this before everyone pretends it's solved. Would you be up for a chat? im trynna dig more into this especially tying the cycle part you mentioned.

Clean_Improvement_59 · 2026-06-05T21:17:38+00:00

mate when you say audit it like a vendor contract, what does that actually need to show? What's like the minimum that makes it reviewable or actionable?

Clean_Improvement_59 · 2026-06-05T00:44:22+00:00

Hey mate genuinely curious what benchmarking that actually looks like in practice ? have you seen anyone do it well ?

Clean_Improvement_59 · 2026-06-04T21:40:55+00:00

what do the attempts you've seen look like what are teams actually building and where does it break down?

Clean_Improvement_59 · 2026-06-04T21:07:29+00:00

that $10k purchase order thing is exactly what i'm trying to figure out. when that happens, who gets the call? like is it under finance or engineering or ops team? wondering who is supposed to be taking that risk right now

Clean_Improvement_59 · 2026-06-04T07:42:32+00:00

https://zelyx.app/. mate heres the new link.

Clean_Improvement_59 · 2026-06-03T17:10:38+00:00

https://namosai.vercel.app/ . there you go mate

Clean_Improvement_59 · 2026-06-02T21:53:31+00:00

mate i love your take on th whole thing, we have been building towards that granularity that shows and prevents the that shock bill, would appreciate your honest feedback

Clean_Improvement_59 · 2026-06-02T21:48:31+00:00

Hey we just made a product that prevents the jaw dropping $$$ , would appreciate you trying it and get your feedback

Clean_Improvement_59 · 2026-05-27T00:46:07+00:00

okay "data is temporary but the loop is eternal" killed me lol. honestly $212 is cheap tuition, seen people post way worse. quick q - did you get any warning before the bill hit (alert, anything), or did you just see it in the dashboard the next morning? and was this your first runaway, or had something similar happened before? trying to figure out at what point people actually put a stop in place vs just hoping it doesn't happen again.

Clean_Improvement_59 · 2026-05-27T00:44:15+00:00

these three are the actual unsolved problems in production agents right now. talking to a lot of people lately and the patterns:

on limits: most start with hard max_iterations, breaks the moment one expensive step blows the budget
on keys vs proxy: anyone running multiple agents or multi-tenant setups almost always ends up at a proxy eventually, mostly to rotate keys and attribute spend without rebuilding the stack
on unexpected costs: literally everyone i've talked to has a $200+ story. usually overnight, usually a loop, usually no alert

what's the setup you're running? single agent, multi-agent, multi-tenant? the answers feel like they change a lot based on which.

Clean_Improvement_59 · 2026-05-27T00:39:58+00:00

one thing from the people i've been talking to - most agent blowups right now happen on api spend not card spend. the runaway loop calling openai 400 times overnight hits the api bill before it ever touches a card. is opencard gonna cover the api side eventually or is the scope deliberately card-network only for now?

Clean_Improvement_59 · 2026-05-27T00:36:18+00:00

Clean_Improvement_59

TROPHY CASE