One API. Every Model.

1,516 routable models across 104 inference providers (3,109 models and 792 catalog entries in the full registry). Images, videos, music, audio, editing, multimodal, text, semantic search, classification, and translation — all through a single endpoint.

Suggestion: Add capability filters like web_search or vision in your API payload — the router narrows to matching models automatically.

Generative AI

Representative model lines below are drawn from the live registry and change as the catalog grows.

Core AI

Text Generation

Claude Fable 5, Claude Opus 4.6, Claude Opus 4.7, Claude Opus 4.8, Claude Sonnet 4, and more — chat, code, reason, create.

From $0.10 / 1M tokens

Classification

Distilbert Sst 2 Int8, Resnet 50 — content moderation, sentiment analysis, categorization.

From $0.0000025 / request

Semantic Search & Translation

Bge Large En V1.5, Embeddinggemma 300m for semantic search and similarity. Indictrans2 En Indic 1B for 200+ language translation.

From $0.008 / 1M tokens

API Features

Cost Optimization

Set optimization: "price-optimized" or a budget_dollars cap per request, and GreatRouter automatically picks the cheapest capable model. No manual price comparisons — the routing engine scores options in real time across 1516 models from 792 providers.

Every request includes the actual cost in the response metadata. Track spending by model, provider, or task type from your dashboard. Auto-recharge keeps your balance topped up so you never hit a service interruption.

Budget per Request

Pass budget_dollars and the router excludes any model whose estimated cost exceeds the cap. Works for text, image, video, and audio requests.

Price Tiers

Filter by price tier — Economy, Standard, Balanced, Premium, or Flagship — to narrow the model pool before routing. Each tier represents a pricing percentile within its category, so Balanced models sit near the average cost.

Fine-Grained Parameters

Control routing with precise parameters. Everything is optional — use as much or as little control as you need.

task_typetext_generation, text_to_image, text_to_video, music_generation, etc.
content_modeauto, generate, edit — controls whether to create new content or modify existing
capabilitiesArray — vision, reasoning, function_calling, web_search, realtime, lora, batch
providerPin to a specific provider — openai, anthropic, google, meta, etc.
budget_dollarsMaximum estimated cost in USD for this request. The router excludes models above the cap.
maxCosteconomy, standard, balanced, premium, flagship — filters by model pricing percentile within its category

Intent Classification

GreatRouter reads your prompt and automatically detects the intent — chat, image generation, transcription, translation, and more.

Content Mode

generate creates new content from a prompt. edit modifies existing content. auto lets the router decide based on context.

Tools and Functions

Web Search

Models with web search capability can access real-time information from the internet. Route to models like GPT-4o with search for up-to-date answers.

{"model": "router",
 "capabilities": ["web_search"],
 "messages": [...]}

Function Calling

Send tool definitions and get structured JSON responses. Works with GPT-4o, Claude, Gemini, Mistral, and other models that support function calling.

{"model": "router",
 "capabilities": ["function_calling"],
 "tools": [...],
 "messages": [...]}

Vision

Send images alongside text prompts. Route to multimodal models that can analyze, describe, and reason about visual content.

{"model": "router",
 "capabilities": ["vision"],
 "messages": [
   {"role": "user",
    "content": [
      {"type": "image_url",
       "image_url": {"url": "..."}},
      {"type": "text",
       "text": "Describe this"}]}]}

Reasoning

Route to models optimized for complex reasoning — math, logic, analysis. Includes o1, DeepSeek R1, and other chain-of-thought models.

{"model": "router",
 "capabilities": ["reasoning"],
 "messages": [...]}

Capabilities

EditingModify existing images, video, and text with content_mode edit
CodingGenerate, review, and debug code across 39 code-capable models
RealtimeStreaming responses for chat and transcription with low time-to-first-token on supported models
LoRAFine-tuned image generation with custom LoRA adapters
BatchSubmit batches of requests for asynchronous processing
VisionImage understanding, OCR, document analysis, and visual Q&A

Providers

OpenAI GPT-5.4, Whisper Large V3 Turbo, GPT Image 2 API, TTS-1
Anthropic Claude Fable 5
Google Nano Banana, Veo 3, Gemini 3 Flash, Embeddinggemma 300m, Gemini 3.1 Flash TTS
Meta M2m100 1.2b, Llama 3.3 70b Instruct Fp8 Fast, Llama 4 Scout 17B Vision VQA
Mistral Mistral Small 3.1 24b Instruct, Mistral Small 3.1 24B Vision
DeepSeek DeepSeek V4 Pro
xAI Grok Imagine Image, Grok Imagine Video, Grok TTS, Grok 4.20 0309 Non Reasoning, Grok Imagine Image
Alibaba Hh1 T2v, Wan 2.6 Image, Qwen3 Max, Happy Horse
Qwen Qwq 32b, Qwen3 Embedding 0.6b
Microsoft Mai Image 2.5 Text to Image, Mai Image 2.5, Resnet 50, Speecht5 Tts
NVIDIA Nemotron 3 Nano Omni, Cosmos 3 Super Image to Video, Cosmos 3 Super, Canary 1b Flash
Stability AI Stable Diffusion Xl Base 1.0
Black Forest Labs Flux 1 Schnell
Runway Gen 4.5, Stable Diffusion V1 5 Img2img
Suno Suno V4.5 All, Bark Small
Deepgram Nova 3, Aura 1
ByteDance Seedance 2 Image to Video, Seedream 4.0, Seedance 2.0
Recraft Recraftv4 1 Pro Vector
MiniMax Speech 2.8 Hd, Music 2.6, MiniMax M3, Hailuo 2.3
PixVerse V6
Vidu Q3 Turbo
AssemblyAI Universal 3 Pro

By task type

Image Generation 221+ models $0.004 – $0.12 per image
Image Editing 563+ models $0.004 – $0.08 per image
Image-to-Image 364+ models $0.004 – $0.10 per image
Video Generation 320+ models $0.05 – $0.50 per second
Image-to-Video 183+ models $0.08 – $0.60 per second
Music Generation 73+ models $0.08 – $0.20 per track
Text-to-Speech 64+ models $0.015 – $0.03 per 1K chars
Vision (Image-to-Text) 40 models $0.02 – $0.08 per image
Background Removal 364+ models $0.002 – $0.01 per image
Style Transfer 364+ models $0.004 – $0.06 per image
Chat / Text 171+ models $0.10 – $15.00 / 1M tokens
Speech-to-Text 20 models $0.0043 – $0.036 per minute
Semantic Search 18 models $0.008 – $0.10 / 1M tokens
Classification 8 models $0.0000025 – $0.0001 / req
Translation 32 models $0.008 – $0.50 / 1M tokens