Want to become a partner or sponsor, click here to schedule time to talk with the team.

FinOps & Beyond is what engineering, finance, and IT leaders read to understand FinOps, and what it means for operating models, accountability, and spend decisions.

FinOps X wrapped last week in San Diego, and the headline wrote itself. The token is becoming the atomic unit of AI.

That is not just a clever phrase. The Linux Foundation announced its intent to launch the Tokenomics Foundation, a new effort focused on open standards, benchmarks, and best practices for AI infrastructure economics. A week later, at FinOps X, the FinOps Foundation announced Tokenomicon, a dedicated conference for the economics of AI, and released FOCUS 1.4, the next version of the open billing specification.

That matters.

FinOps needed a better way to talk about AI consumption. Tokens are visible. Tokens are measurable. Tokens give the industry a shared unit for comparing usage across models, providers, gateways, applications, and teams. The standards work is real progress, and it is the foundation FinOps needs if AI spend is going to move from scattered experimentation into operating discipline.

But there was a second signal too.

AWS announced the public preview of AWS FinOps Agent, an agentic AI solution for cloud financial management. It can answer cost questions, investigate anomalies, generate recurring reports, surface optimization opportunities, and route work into Jira or Slack (only for AWS, but that’s for another day).

That may end up being the more important signal. The first signal says FinOps is getting a better meter for AI. The second says FinOps work itself is starting to move into agents. The meter matters. But the agent is the bigger shift. And that shift is still missing its trust layer.

The AWS Announcement

AWS FinOps Agent is interesting because it points directly at where the discipline is going.

For years, FinOps has had a familiar operating pattern. A central team looks at dashboards, reviews cost changes, investigates anomalies, builds reports, and tries to route findings back to the teams that own the work. That model is useful, but it is also slow. It depends on people noticing the right signal, interpreting it correctly, finding the owner, explaining the issue, and following up until something happens.

AWS is aiming at that gap.

The agent draws from AWS Cost Explorer, Cost Anomaly Detection, Cost Optimization Hub, and Compute Optimizer. It can investigate a cost anomaly, correlate it with operational context, produce a summary, and send the finding to the team through Jira or Slack. It can answer natural-language questions about cost and usage. It can generate recurring reports in formats people actually use.

That is not just a new interface. It is a different operating model.

FinOps moves from periodic review toward continuous investigation. Cost questions move closer to engineers. Reports move from manually assembled artifacts to scheduled outputs. Anomaly response moves from “someone should look at this” to “the agent already started.”

That is the right direction. It is also early.

Useful Is Not the Same as Trustworthy

This is where the conversation needs to get more honest.

AWS FinOps Agent is not a toy. It is not a random chatbot bolted onto a billing console. It is a serious product from the largest cloud provider in the world, aimed at a real operational problem. It is exactly the kind of thing FinOps teams should be paying attention to.

But the early hands-on experience already shows the gap between agent potential and agent trust.

One hands-on writeup, Trying Out AWS FinOps Agent with Slack Integration, found the setup straightforward, but noted that the Slack integration did not necessarily deliver the full operational detail someone might expect directly in-channel. Another post, FinOps Without the Walk of Doom, was broadly positive, but included a useful caveat: in one environment, the agent said it could not search CloudTrail, while similar CloudTrail-backed investigation worked elsewhere.

That is not a dunk on AWS. That is what real life looks like.

Agents behave differently across accounts, permissions, services, integrations, context files, data availability, and organizational structure. The same prompt can produce a useful answer in one environment and an incomplete or incorrect answer in another. The agent may have the right general capability but lack the right access, the right context, or the right interpretation for the situation in front of it.

That is the part the demo never fully captures.

The hard part is not producing an answer. The hard part is knowing whether the answer is correct, complete, and operationally useful.

I Saw the Same Thing in My Own Work

I have been experimenting with this directly.

I built a FinOps agent using open-source agents with Claude Code as the harness and pointed it at a real client account. This was not a synthetic demo. It was real spend, real account structure, and real questions a FinOps lead would ask.

The agent was useful. It saved time. It could move through data faster than I could manually. It could produce summaries, investigate patterns, and help shape the first pass of analysis.

But it also gave back answers that were not completely accurate.

Not because the model was useless. Not because agents are hype. Not because the whole thing failed. It was wrong because real FinOps work depends on context the agent did not automatically have.

A cloud account is not just a collection of services and charges. It has history. It has commitments. It has credits. It has tagging decisions, ownership patterns, shared platforms, weird exceptions, migration artifacts, account structures, abandoned experiments, and finance assumptions that live outside the billing API.

A person who has worked the account knows when a number looks strange. A person knows when a dashboard is technically correct but practically misleading. A person knows when a recommendation is reasonable in general but wrong for this customer, this workload, this quarter, or this business constraint.

The agent did not know those things until I supplied them, corrected it, or reframed the question.

That is the trust gap.

The Token Meter Will Not Solve This

This is why I think the token conversation and the agent conversation need to be connected carefully.

Tokens matter because they help us measure AI consumption. They tell us something important about the cost of running the agentic layer. The Tokenomics Foundation and FOCUS roadmap are important because FinOps needs a normalized way to understand AI usage and spend.

But tokens do not tell us whether the agent’s work was right.

A token meter can tell you how much the agent consumed. It cannot tell you whether the agent understood an account. It cannot tell you whether the recommendation was valid. In my example, it could not tell me whether a cost anomaly was actually a problem or a business signal. It could not tell me whether a report is safe to send to finance without someone checking the assumptions underneath it.

That distinction matters because the industry is going to be tempted to measure the easy part first and declare victory. How many tokens did the agent use? How much cheaper was the agent than a person? How many reports did it generate? How many tickets did it open?

Those are useful metrics, but they are not the same as trust.

The better question is whether the agent produced a correct outcome.

Did it identify the actual driver of the cost change? Did it route the finding to the right owner? Did it understand the account structure? Did it respect known exceptions? Did it avoid creating noise? Did it produce something the team could act on without redoing the analysis?

That is the the measurement that matters. Cost per correct outcome.

The Trust Layer Is the Next FinOps Problem

AWS already hints at part of the answer. The FinOps Agent supports context files and memory. The documentation and launch materials point to account-to-owner mappings, team definitions, tagging conventions, and review cadences. That is not a small feature. That is the beginning of the trust layer.

Agents need context to do useful work.

They need account ownership. They need tag standards. They need known exceptions. They need business calendars. They need commitment strategy. They need anomaly thresholds that match reality. They need rules for when to open a ticket, when to ask a human, and when to stay quiet.

Without that layer, an agent is just a fast analyst with partial context.

With that layer, it becomes more interesting. But building that layer is not automatic. It is work. This is the part practitioners should pay attention to. The agent does not eliminate the need for FinOps expertise. It changes where that expertise shows up.

Instead of spending all your time pulling reports, you may spend more time defining the context the agent needs to reason correctly. Instead of manually routing every finding, you may design the ownership model that lets routing happen safely. Instead of answering every engineer’s cost question, you may build the account, tag, and business mappings that let the agent answer without making things worse.

That is still FinOps. It is just moving from analysis as output to context as infrastructure.

What This Means Monday Morning

If your organization is looking at AWS FinOps Agent, or any FinOps agent, the first question should not be whether it can generate a report. It probably can.

The better question is whether the report can be trusted. Start there.

Can the agent see the right data? Does it understand the account structure? Does it know which teams own which accounts, services, tags, and workloads? Does it know the difference between a real anomaly and an expected seasonal pattern? Does it know which recommendations are allowed, which are blocked, and which require human review? Can its answers be reconciled back to Cost Explorer, CUR, dashboards, invoices, or whatever your organization treats as the source of truth?

And most importantly, who is responsible for checking it?

That may sound like a cautious way to talk about an exciting product category, but I think it is the opposite. This is how agents become real.

The agent era of FinOps will not be won by the first tool that produces the prettiest summary. It will be won by the systems that make agent output trustworthy enough to enter the operating model.

AWS FinOps Agent is a clear signal that the market is moving there. The early hands-on experience is a clear reminder that we are not done.

The token is becoming measurable.

The agent is becoming useful.

Now FinOps has to build the trust layer between them.

Written with the help of AI. All the ideas expressed are mine and mine alone.

Subscribe

FinOps Directory

Give Me Your Opinion