Cutting Cloud Spend Without Slowing Down
Cloud bills creep, an oversized instance here, a forgotten environment there. FinOps makes that creep visible and reversible without slowing engineering.
Cloud bills don't explode because of one bad decision. They creep, an oversized instance here, a forgotten environment there, an on-demand workload that's been running 24/7 for eight months. By the time finance asks the question, the waste is structural. FinOps is the practice of making that creep visible and reversible without turning every engineer into an accountant.
The good news: most of the savings come from a short list of moves, and they don't require slowing the team down. Often they speed it up, because a tidy cost model is a tidy architecture.
The four levers that actually move the bill
Ignore the long tail. In practice four levers account for the overwhelming majority of realistic savings:
- Rightsizing, matching instance size to actual utilisation. The most common finding in any audit is fleets running at 15% CPU on instances picked by a guess two years ago.
- Commitment discounts, reserved instances or savings plans for the steady-state baseline you'll run regardless. This is free money you're declining by paying on-demand rates for predictable load.
- Spot / preemptible, for fault-tolerant, interruptible work (batch, CI, stateless workers) at a fraction of on-demand.
- Eliminating waste, idle resources, orphaned disks, unattached IPs, dev environments running over the weekend.
Model the savings yourself
The calculator below is deliberately simple but directionally honest. Set a baseline on-demand spend, then dial in how much of your fleet is steady-state (commit-able), how much is interruptible (spot-able), and how much idle capacity you can cut. Watch the optimised bill, and the annual saving, move in real time.
Don't over-commit
Reserved instances and savings plans are a forecast bet. Commit to the baseline you're confident you'll run, over-committing locks you into paying for capacity you no longer use.
Make cost a first-class signal
You can't optimise what nobody sees. The single highest-leverage FinOps investment isn't a tool, it's tagging discipline. Every resource tagged with a team, environment, and service turns an opaque bill into a per-team scoreboard.
# Enforce cost-allocation tags at provision time
locals {
required_tags = {
team = "checkout"
environment = "production"
service = "payment-api"
cost_center = "CC-4471"
}
}
# Policy-as-code rejects any resource missing these tags
# → untagged spend drops to ~0, every dollar is attributable
Cost is an engineering decision
The teams that stay efficient don't run quarterly cost fire-drills, they fold cost into the same place they handle latency and reliability. Showback dashboards, anomaly alerts on spend, and a culture where "what does this cost to run?" is a normal design question. The bill stops creeping when the people who create it can see it.
Key takeaways
- Four levers, rightsizing, commitments, spot, and killing waste, capture most savings.
- Commit only to the steady-state baseline you're confident you'll keep running.
- Tagging discipline turns an opaque bill into a per-team scoreboard.
- Fold cost into design reviews; efficiency is a culture, not a quarterly fire-drill.
Curious what's hiding in your bill?
We run a cost audit across rightsizing, commitments, spot, and waste, then wire up tagging and showback so the savings actually stick.
Run a cost audit