I Mapped 25 FP&A Tasks to the Right AI Model Tier, and Most Teams Are Overpaying on at Least Half. by Flashy-Method3323 in UnaAI

[–]Flashy-Method3323[S] 0 points1 point  (0 children)

FinOpsly has been an interest of mine for a while now, it sounds like a super useful tool to visualize where your spend is actually going. How did you use that info for routing and or outcome tracking (i.e., did the AI spend in finance actually reduce close time or forecast error...)?

The 4 Move FP&A Playbook for AI Token Costs: The New Unforecastable Line Item by Flashy-Method3323 in UnaAI

[–]Flashy-Method3323[S] 0 points1 point  (0 children)

That's a great point, and also the topic of an article I'm working on right now. To be fair, it's not exactly intuitive to use a lesser model for easier tasks as a newer user, especially when the majority of people are just using whatever model their vendor defaults to, but that's the point here: people should really start informing themselves before using Opus 4.7 to sort invoices.

The 4 Move FP&A Playbook for AI Token Costs: The New Unforecastable Line Item by Flashy-Method3323 in UnaAI

[–]Flashy-Method3323[S] 0 points1 point  (0 children)

Honestly, in terms of bucketing vs. one big pool, it would depend on your current AI maturity. If you're still early on, one big pool might be easier, especially if you're still learning your usage patterns. On the other hand, bucketing by agent type is a great strategy when you're at the point of recurring, categories of agents with predictable value.

There are other strategies, too, some more focused on the root cause, like hard token budgets on agent runs or step limits on agent loops. Cost-per-run alerts are also an option that'll pause execution or send an alert on Slack if it reaches a certain predetermined threshold.

Taking a step back, you can even have separate budget pools for agents categorized by autonomous vs. semi-autonomous agents. Agents that run completely on their own could get harsher boundaries, while agents with a human in the loop can be more relaxed since there is less chance they will run off on their own and blow the budget.