Stop wasting money on agent runs

Vibe Billing scans Claude Code, OpenClaw, and OpenAI-compatible agent logs, shows exactly where money is being burned, then fixes it in one command.

186 developers · $8,566.96 saved · 831 loops killed
$ npx vibe-billing scan
View on GitHub
Find wasted spend in under 30 seconds. No signup.

Live Activity

$8,566.96 saved
claude-haiku-4-5-20251001
1k tokens
389ms
Proxied
claude-haiku-4-5-20251001
1k tokens
497ms
Proxied
claude-sonnet-4-6
32k tokens
1.8s
$0.09
Cache Hit
claude-sonnet-4-6
32k tokens
3.6s
$0.09
Cache Hit
claude-sonnet-4-6
32k tokens
2.4s
$0.09
Cache Hit
claude-sonnet-4-6
30k tokens
2.6s
$0.08
Cache Hit
claude-sonnet-4-6
110k tokens
2.1s
$0.30
Cache Hit
claude-sonnet-4-6
110k tokens
2.5s
$0.30
Cache Hit
claude-sonnet-4-6
110k tokens
1.8s
$0.30
Cache Hit
claude-sonnet-4-6
108k tokens
3.4s
$0.29
Cache Hit
claude-sonnet-4-6
106k tokens
5.8s
$0.29
Cache Hit
claude-sonnet-4-6
30k tokens
5.0s
$0.08
Cache Hit
claude-sonnet-4-6
84k tokens
1.7s
$0.23
Cache Hit
claude-sonnet-4-6
84k tokens
1.3s
$0.23
Cache Hit

From scan to safe mode in 60 seconds

Find the waste first. Route traffic second. Keep the agent under control after that.

1

Scan

Analyze local transcripts and see retry loops, context re-sends, and overkill model usage before you install anything.

npx vibe-billing scan
2

Setup

Run one command to patch configs, verify the connection, and route agent traffic through the firewall.

npx vibe-billing setup
3

Stay in control

Prompt caching, loop detection, budget caps, and smarter routing kick in once traffic is flowing.

Agent → Firewall → LLM

Example Scan Output

Run the scan first. If the waste is real, route traffic through the firewall after that.

Example Scan Output
$ npx vibe-billing scan
Analyzing your agent usage for waste patterns and savings opportunities...
Agent Waste Report
Runs analyzed:184
Retry loops:6
Context re-sends:34
Overkill model usage:51
Total agent spend:$381.42
Estimated wasted spend:$312.76
Fix with:
$ npx vibe-billing setup
Reads local Claude Code and OpenClaw logs first. No signup required to see the waste.

What a 2-hour Claude Code session actually costs

Real numbers from a production coding session with Opus.

Without Firewall
Prompt tokens (repeated context)2.4M
Duplicate full-codebase reads12x
Stuck retry loops3 loops
Cache hit rate0%
Total cost $47.20
With Firewall
Prompt tokens (cached)2.4M
Cache hit rate (auto-injected)90%
Loops killed before waste3 killed
Effective cost after caching$7.80
Total cost $7.80
$39.40 saved per session
That's $591 / month for a developer running 3 sessions per day.