Take control of
your AI spend.

Building in public · Waitlist open · Beta Q3 2026

Retken is the AI spend control layer we're building for the people whose Claude, ChatGPT, and API bills keep climbing without explanation. Join the waitlist to shape the product — the first 1,000 lock in Legacy Member pricing for life.

No spam. Early-access invites go out in waves as we onboard the first 1,000.

✓ You're on the waitlist. We'll send your invite as we onboard the first 1,000 Legacy Members.

Free to join. See your savings before you pay a cent.

9:41

Dashboard

April 2026 · 5 errors detected

Saved

$108.40

prevented + optimised

Recovered

$11.40

cash-equivalent

Expiring

$17.30

in 28 days

Total impact

$137.10

saved · recovered · protected

Recent errors See all

Routed Opus → Sonnet

Apr 02 · Saved (prevention)

+$32.50

Plan downgrade — Claude

Apr 09 · Optimised (plan-fit)

+$58.20

Credit expiring — Anthropic

Apr 18 · 28 days left

⚠ $17.30

Dashboard

History

Claims

Account

The Real Cost

AI errors aren't just frustrating.
They're expensive.

You don't just lose money on bad outputs. You lose it on the wrong plans, retries, timeouts, and silent failures across every AI tool you use.

$127

Average monthly credit waste

For professionals using 3+ AI tools daily, failed outputs, retries, and plan mismatch can quietly waste 20–35% of monthly spend.

47×

Iterations on a single bad prompt

Without spend visibility, users re-prompt the same failed request repeatedly — burning credits and wasting time every time.

$28K

Quietly expires into nothing

OpenAI, Anthropic, and Groq expire prepaid API credits after 12 months — often with no warning. Documented user losses range from $60 to $28,000.

Feature · Credit Expiry Timer

Stop watching credits
quietly expire into nothing.

Most prepaid API credits on OpenAI, Anthropic, and Groq evaporate after 12 months — silently, with no email, no refund. Retoken tracks every credit balance you have, counts down to expiry, and tells you what to burn down before the timer hits zero.

✓Live countdown per platform — unified view of every credit pool, ranked by days until expiry.
✓Burn-down recommendations — practical workloads (eval runs, batch summarisation, embedding refreshes) you can ship before the deadline.
✓Smart alerts at 60 / 30 / 7 days — no last-minute scramble, no forfeited spend.
✓Auto-reroute idle spend — if a balance is at risk, Retoken can suggest shifting future workloads to that provider so you actually use what you paid for.

Credit Expiry Tracker 3 at risk

OpenAI APIPrepaid balance

$184.20

23d

Anthropic APIWorkbench credits

$92.40

41d

Groq CloudPromo credits

$25.00

86d

Mistral APIPay-as-you-go

$48.10

no expiry

$301.60 at risk across 3 providers within 90 days. Retoken will recommend a burn-down plan if you don't have workloads queued.

Under the Hood

Three steps.
Almost zero manual work.

Our engine watches every AI interaction, spots waste, recommends cheaper paths, and generates dispute-ready evidence automatically.

Observe every interaction Every AI request, response, and cost is logged in real time — not just failures. You finally see where your spend actually goes.

Spot waste and the wrong plan Retoken classifies failures and checks whether your usage fits your current plan and model — surfacing cheaper, safer options before you overpay.

Surface what's actually claimable Each claim ships with an evidence pack. Retoken either auto-files where allowed, queues it for your approval, or exports a ready-to-send package.

~200ms

Error detection latency

12+

Waste & error types classified

100%

Audit log coverage

3 modes

Auto-file · approve-first · export

How It Works

From spend to savings
in three steps.

No workflow changes. You keep using your AI tools. Retoken handles logging, savings recommendations, and policy-aware recovery in the background.

Tell us what you use

Once you're off the waitlist, you'll link your AI tools via API key or OAuth in under five minutes. No code changes, no new habits — Retoken reads your usage, costs, and errors immediately.

Waste, errors, and plan fit are flagged

Retoken captures spend patterns and failures, matches each against platform policy, and routes the result to save, recover, or monitor — instead of always auto-filing.

Savings and credits land back

Depending on the platform, Retoken either files the claim for you, prepares it for your approval, or exports a ready-to-send package. You see the full audit trail in your dashboard.

Why Retoken

The AI spend control layer
your tools are missing.

See what your AI bill actually looks like — and where Retoken saves, optimises, and recovers on your behalf.

Charged

SPEND AUDIT · APR 2026
account: you@company.com

Saved & Recovered

Error event Amount Saved / Recovered Status

Routed Opus → Sonnet — quality matched Apr 02 · ongoing · Anthropic Claude

−$32.50

+$32.50

✓ Saved Prevention · auto

Cached duplicate prompt — reused result Apr 05 · 02:31pm · OpenAI GPT-4o

−$11.40

+$11.40

✓ Saved Prevention · auto

Plan downgrade — Claude Max → Pro fits usage Apr 09 · next cycle · Anthropic

−$58.20

+$58.20

✓ Optimised Plan-fit · 1-click

API timeout — 500 server error refunded Apr 14 · 04:58pm · OpenAI API

−$11.40

+$11.40

✓ Recovered Refund · 1 day

Credit expiry alert — burn-down recommended Apr 18 · expires in 28 days · Anthropic API

−$17.30

+$17.30

⏳ Pending Action required

Monthly total −$143.00

+$125.70 1 claim pending · avg 3 day turnaround

Methodology

How we calculate
every number you see.

No black box. Every figure on your dashboard is computed from public platform data and your own usage — and we'll show you the formula.

Plan-fit savings

current_plan_cost − best_fit_plan_cost

For each LLM, we model your last 90 days of usage against every published tier (free, Plus, Pro, Team, API pay-as-you-go). The delta against your current plan is the monthly savings figure.

Source: published platform pricing pages

Model-routing savings

Σ (premium_tokens × Δprice)

When a cheaper model could have answered a prompt at equivalent quality, we count the price difference as savings. Quality equivalence is determined by classification rules tuned on public benchmarks (MMLU, GSM8K, HumanEval).

Source: token pricing + benchmarks

Prompt-waste prevention

retry_tokens + over-spec_tokens

Tokens spent on duplicate retries, oversized context windows, and known-failure prompt patterns. We count the spend that would have happened if Retoken had not intervened.

Source: your own usage logs

Credit-expiry exposure

balance × (1 − burn_rate × days_left)

For each prepaid balance, we project burn against historical usage. Anything not burnt by the platform's expiry window is flagged as at-risk — not as recovered savings, just as exposure.

Source: provider Ts & Cs (12-month expiry)

Recovered credits

Σ successful_claims (post-platform approval)

Only counted after the platform has approved the credit or refund. Pending claims appear separately. We never count "likely" or "eligible" credits in the recovered total.

Source: platform credit API receipts

Total monthly impact

savings + optimised + recovered

The headline figure on your dashboard. Prevention and optimisation typically do the heavy lifting; recovery is the smaller, harder-won piece. We show all three separately so you always know which lever moved.

Source: sum of the above

Our savings logic will be open-sourced under MIT before public launch. If a number on your dashboard doesn't match your own audit, email hello@re-token.com and we'll walk you through the math.

Pricing

Simple, transparent pricing.
Founder pricing locked for life.

First 1,000 users become Legacy Members: 30-day free trial, Pro locked at $19/mo for life, founder Slack access, and priority on new features.

Starter

Free

For individuals exploring their AI spend.

✓ Up to 2 platforms connected
✓ Spend & waste monitoring
✓ Basic savings insights
✓ 10 disputes per month
– Policy-aware auto-filing
– Priority claim processing

Join Waitlist

What early testers are saying.

"I had no idea how much I was losing to failed prompts. Retoken flagged $84 in claimable credits in the first week alone."

James R.

Indie developer · 6 AI tools daily

"The JSON error classification is genuinely impressive. It caught a Midjourney generation failure I'd completely written off."

Sarah K.

AI product manager · Series A startup

"Finally — someone built the tool I've been manually doing in a spreadsheet for 8 months. Should've existed years ago."

Marcus T.

Freelance engineer · Claude & GPT-4 power user

FAQ

Questions we get asked.

How does Retoken connect to my AI tools?

When the beta opens, you'll connect via API key or OAuth — the same way any developer tool integrates. We'll observe your usage and minimal prompt/response metadata to power savings insights and recovery. We never act outside what each platform's policy allows, and we never resell your data.

Is my data safe?

Yes. We only store metadata about failed outputs — timestamps, error codes, credit amounts. Prompt content and output text are hashed and never stored in plaintext. All data is encrypted at rest and in transit.

What can Retoken actually recover from each platform?

Most platforms (OpenAI ChatGPT, Anthropic Claude Pro/Max, Google Gemini, Perplexity, Cursor, GitHub Copilot) treat used credits as non-refundable. We don't pretend otherwise. Where recovery does work — API server errors, expiring unused credits, EU/UK/Turkey 14-day cancellation rights, published failed-job policies on Midjourney and Runway, and self-serve refund flows on DeepSeek and OpenRouter — Retoken automates it. Everywhere else, the bigger savings come from prevention and plan-fit, which is where most of your money actually leaks.

I'm in Australia — does this work for me?

Yes, and it's a particular strength. Australian Consumer Law gives you statutory rights to remedies for AI services that fail to perform as advertised — rights no platform's terms of service can override. Retoken is built in Melbourne and flags ACL-eligible claims explicitly, preparing them in the format Australian users actually need.

Is this just a refund tool?

No. Retoken is an AI spend control layer. It prevents waste with prompt linting, smart routing, and budgets, optimises plans and models per LLM, and then recovers credits where genuine failures still happen. Refunds are the last step, not the whole product.

How does policy-aware recovery work?

Every platform is different, so Retoken offers three modes per integration: auto-file where the platform clearly allows it, approve-first where you click to send, and manual export where automation is risky or unsupported. We never bot support channels blindly.

Do I need to change how I use my AI tools?

No. Retoken works in the background. You use your tools exactly as you do today. The only difference is you start seeing what's wasted, what's recoverable, and which plan or model actually fits your usage.

Which platforms are supported at launch?

At launch: OpenAI (ChatGPT + API), Anthropic (Claude), Google (Gemini), Midjourney, and Replit. We're adding Perplexity, Grok, Meta AI, DeepSeek, and Hugging Face in the months following launch. Founding users shape the priority order.

Take control of
your AI spend.