local-first AI dispatch — a serverless router that sends each request to the cheapest capable model, your infra + your keys.
prompt
│ embed @ edge (Workers AI · bge-base-en)
▼
classify ──▶ local_code · local_search · local_summarize · direct · claude_reason
│
└─ low-margin ─▶ forward to origin granite4 (the accurate path)
BYOK — your API keys + data + inference stay on your infra. We only ever return a routing decision.