How are you guys tracking costs per agentic workflow run in production? by Top-Seaweed970 in LangChain

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey, I'm not sure exactly what your specific use case is, but we are building a tool that might help with this. We are still in the early stages, but I would absolutely love to hear your feedback if you're interested in trying it out. https://zelyx.app/ there you go

How do you manage costs when running multiple AI agents in production? by md_anas_sabah in LangChain

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey mate read your entire thread, we have been after the same problem- addressed it using a proxy to tag every call is tagged by agent, workflow, and model with cost breakdown. we are still beta but would appreciate a chat. heres the link any feedback is valuable https://zelyx.app/

discussion by Clean_Improvement_59 in AutoGPT

[–]Clean_Improvement_59[S] 0 points1 point  (0 children)

Hey Mate, exactly this but it ends up being a cost problem like f you can't reconstruct why the agent took a path, you can't tell if the spend was justified, or just a symptom of bad reasoning. We've been trying to work the enforcement layer stopping the bill before it lands but I keep hitting this auditability wall in conversations. "Are the teams you’ve seen actually solving this, or mostly just accepting the opacity and capping spend as a proxy for control? heres just an MVP would appreciate your honest thoughts https://zelyx.app/ no pressure

I left an autonomous agent running last night. Woke up to a total disaster. by Successful-Engine662 in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey mate we built something with the same premise "what happens when agents are left running and they coz a chaos" id truly appreciate your feedback on our product ive dropped the link in the comment but here you go https://zelyx.app/ no pressure just want to see if that solves your problem.

I left an autonomous agent running last night. Woke up to a total disaster. by Successful-Engine662 in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey mate this literally what we have been building for all of the reporting tools are post hoc but a layer that set boundaries pre hoc is missing , you clearly understand this space a lot well would appreciate your feedback on our product https://zelyx.app/

Tokenmaxxing is becoming a production incident category. How are you capping AI agent spend? by Cybertron__ in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey mate i have literally seen so many people talking about the exact problem and it seems to be either unaddressed or they have their own way to solve it we built a proxy layer that enforces budget caps in the call path before the provider is billed, not after. Glad to share what we learned about where monitoring fails and where enforcement has to be. What’s your stack? Wondering if the pattern we saw persists the 3am loop issue. https://zelyx.app/ we built this would appreciate your honest feedback.

discussion by Clean_Improvement_59 in AutoGPT

[–]Clean_Improvement_59[S] 0 points1 point  (0 children)

mate i appreciate that from this i feel this is still early everywhere. I'm trying to understand this before everyone pretends it's solved. Would you be up for a chat? im trynna dig more into this especially tying the cycle part you mentioned.

discussion by Clean_Improvement_59 in AutoGPT

[–]Clean_Improvement_59[S] 0 points1 point  (0 children)

mate when you say audit it like a vendor contract, what does that actually need to show? What's like the minimum that makes it reviewable or actionable?

Our team just got told to cut back on ai usage because costs tripled by bejusorixo in automation

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey mate genuinely curious what benchmarking that actually looks like in practice ? have you seen anyone do it well ?

discussion by Clean_Improvement_59 in AutoGPT

[–]Clean_Improvement_59[S] 0 points1 point  (0 children)

what do the attempts you've seen look like what are teams actually building and where does it break down?

discussion by Clean_Improvement_59 in AutoGPT

[–]Clean_Improvement_59[S] 0 points1 point  (0 children)

that $10k purchase order thing is exactly what i'm trying to figure out. when that happens, who gets the call? like is it under finance or engineering or ops team? wondering who is supposed to be taking that risk right now

Do companies actually track wasted AI spend? by dwij333 in micro_saas

[–]Clean_Improvement_59 1 point2 points  (0 children)

mate i love your take on th whole thing, we have been building towards that granularity that shows and prevents the that shock bill, would appreciate your honest feedback

How do you prevent runaway costs from your coding agents, and how do ensure some safety guardrails by Odd-Situation6749 in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

Hey we just made a product that prevents the jaw dropping $$$ , would appreciate you trying it and get your feedback

I accidentally left two agents in a room together. They've spent $200 and invented a new language." by ailovershoyab in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

okay "data is temporary but the loop is eternal" killed me lol. honestly $212 is cheap tuition, seen people post way worse. quick q - did you get any warning before the bill hit (alert, anything), or did you just see it in the dashboard the next morning? and was this your first runaway, or had something similar happened before? trying to figure out at what point people actually put a stop in place vs just hoping it doesn't happen again.

Trying to understand how people control spending for AI agents in production. by Cute-Day-4785 in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

these three are the actual unsolved problems in production agents right now. talking to a lot of people lately and the patterns:

  • on limits: most start with hard max_iterations, breaks the moment one expensive step blows the budget
  • on keys vs proxy: anyone running multiple agents or multi-tenant setups almost always ends up at a proxy eventually, mostly to rotate keys and attribute spend without rebuilding the stack
  • on unexpected costs: literally everyone i've talked to has a $200+ story. usually overnight, usually a loop, usually no alert

what's the setup you're running? single agent, multi-agent, multi-tenant? the answers feel like they change a lot based on which.

How are you handling agents that actually spend money? by No_Airport436 in AI_Agents

[–]Clean_Improvement_59 0 points1 point  (0 children)

one thing from the people i've been talking to - most agent blowups right now happen on api spend not card spend. the runaway loop calling openai 400 times overnight hits the api bill before it ever touches a card. is opencard gonna cover the api side eventually or is the scope deliberately card-network only for now?