tokenRoute
Eval gate
AI routing intelligence

Model routing, backed by benchmark evidence.

TokenRoute compares prompt workloads across local, private, and BYOK cloud models, then turns the evidence into route decisions and paid benchmark intelligence.

Benchmark route story
prompt to route decision
live flow
prompt: I was charged twice after upgrading my plan yesterday.
promptcloudlocalprivatejudge 1judge 2policy
1 policy
Recommended for this benchmark

The result includes recommendation, fallback order, score evidence, latency, and confidence.

route_decision:
{
  model: "qwen-2.5-7b",
  score: 94, latency: "1.28s",
  fallback: ["llama3.2"]
}
Prompt in
Contract captured
Route
Candidates run
Judge
Evidence scored
Result
Policy exported
Benchmark
Prompt contract against selected models
Score
Quality, latency, cost, and failure evidence
Intelligence
Paid plans include aggregate benchmark snapshots
What TokenRoute is

A benchmark intelligence layer for AI model selection.

Not another generic gateway. TokenRoute decides what your gateway, app, or agent should route to.

Benchmark packs

Repeatable runs across candidate models with score, latency, cost, failures, and raw output evidence.

Route-decision API

Apps and agents ask which model to use for a workload under quality, latency, and cost constraints.

Policy exports

Push advisory fallback order and thresholds into gateways such as LiteLLM or your own router.

Benchmark intelligence

Paid plans include aggregate model/category intelligence from TokenRoute-owned benchmark corpuses.

Integration-first

Bring your models. Keep your stack.

TokenRoute is the glue layer between model catalogs, local connectors, gateways, CI, and production apps.

Connector surface
available and planned integration paths
Local/private
Ollama and private endpoints through the Local Connector Agent
BYOK cloud
OpenRouter, OpenAI, Anthropic, and provider adapters
Gateways
LiteLLM-style policy export first, live routing later
Engineering
JSON/CSV packs, CI gates, and route-decision API
Why it matters

Make model changes defensible.

No blind routing

Every recommendation links back to benchmark evidence.

BYOK friendly

Use local, private, and cloud models without TokenRoute carrying inference spend.

Intelligence included

Paid plans get benchmark intelligence without customer prompt reuse.

Pricing

Adoption-first pricing for model-routing teams.

Bring your own provider keys. Paid plans include benchmark intelligence, route-decision usage, and evidence history; provider inference remains BYOK/local.

Free

$0 USD
per month
No card required
Free workspace access

Useful trial for validating a real workload before committing.

Included
10 benchmark packs / month
1,000 route API calls / month
1 project and 1 user
30-day history
3 models per run
25 prompts per run
BYOK and local/private models
Start free

Builder

Recommended
$7.50 USD
per month
Two months off
$90 USD billed annually

Low-friction plan for builders, small apps, and client experiments.

Included
75 benchmark packs / month
10,000 route API calls / month
3 projects and 2 users
90-day history
5 models per run
50 prompts per run
Benchmark intelligence snapshots
Choose Builder

Team

$24.17 USD
per month
Two months off
$290 USD billed annually

Shared benchmark evidence for early AI product and platform teams.

Included
300 benchmark packs / month
50,000 route API calls / month
10 projects and 5 users
1-year history
Shared benchmark packs
Broader intelligence and trend history
Launch-gate reports and priority support
Choose Team
Annual prices are billed upfront and equal 10 paid months.
Provider inference remains BYOK/local; TokenRoute does not resell tokens.
Benchmark intelligence is aggregate by default; customer prompts stay tenant-private.
Benchmark Pack Add-on 100
$5 USD/month / $50 USD/year
Route API Add-on 100k
$2 USD/month / $20 USD/year
Ready for the build path.

Start with a benchmark pack, inspect the evidence, then use paid benchmark intelligence to choose model routes with more confidence.