Added Llama 4 to routable models
Meta's Llama 4 is now available through /v1/auto/route for text, with both instruction-tuned and reasoning variants.
Every model, capability, and platform change from the GreatRouter team — published as it ships.
Meta's Llama 4 is now available through /v1/auto/route for text, with both instruction-tuned and reasoning variants.
The dashboard now breaks down spend by model, provider, and task type with daily, weekly, and monthly views. Export to CSV is one click.
Volume discounts, committed-use pricing, SSO, audit logs, and a dedicated engineering contact. Email contact@greatrouterai.com for details.
ASR is now a first-class routed task. The classifier detects audio inputs and routes to the best transcription model for the language and audio profile.
Per-model error rates now flow into the scoring engine in real time. The router avoids degraded models automatically and surfaces the cause in your activity log.
OpenAI's GPT-5 is available via /v1/auto/route and /v1/chat/completions. Pin a specific model id or let the router pick based on task and budget.
Set a default optimization mode, exclude specific providers, and prefer specific models — per organization, with API and dashboard management.
The budget_dollars parameter now hard-stops the router from considering models above the cap. Works across text, image, video, and audio requests.
New EU data residency option on enterprise plans. Billing metadata and request logs are stored in Frankfurt; provider calls remain closest-region.
Black Forest Labs' flagship image model is now in the registry. Supports photorealistic, stylized, and brand-LoRA workflows through the same endpoint.
Configure auto-recharge with configurable thresholds and top-up amounts. Wallet events are now visible in the activity log.
POST /v1/auto/suggest returns the top-ranked models for a task without executing the request. Useful for high-stakes flows that need approval before inference.
Anthropic's Sonnet 4 is in the registry with both text and vision modalities. Available via auto-route and the OpenAI-compatible endpoint.
GreatStudios now uses GreatRouter as its default inference layer. Every creative tool — image, video, music, chat — routes through the same intelligent endpoint.
Image, video, music, and code prompts under 120 characters are auto-expanded using a small reasoning model before being sent to the upstream provider.
Each entry here also ships to the newsletter. Subscribe from the blog or follow the public changelog RSS feed (coming soon).