One API. Every Model.

1,516 routable models across 104 inference providers (3,109 models and 792 catalog entries in the full registry). Images, videos, music, audio, editing, multimodal, text, semantic search, classification, and translation — all through a single endpoint.

Suggestion: Add capability filters like web_search or vision in your API payload — the router narrows to matching models automatically.

Generative AI

Representative model lines below are drawn from the live registry and change as the catalog grows.

Image Generation

Nano Banana, Flux 1 Schnell, Flux 2 Flex, Flux 2 Klein 4b, Seedream 4.0, and more — generate photorealistic, artistic, or stylized images from text prompts. Supports LoRA fine-tuning for consistent brand styles.

From $0.004 per image

Text-to-Image Style Transfer LoRA

Video Generation

Veo 3, Seedance 2 Image to Video, Seedance 2 Reference to Video, V6, Q3 Turbo — create cinematic video from text or image prompts. Supports camera motion control, character consistency, and scene transitions.

From $0.05 per second

Text-to-Video Image-to-Video Motion Control

Music & Audio

Suno V4.5 All, Suno V5, Music 2.6 for full song and instrumental generation. Nova 3, Whisper Large V3 Turbo for transcription. Speech 2.8 Hd, Speech 2.8 Turbo for natural speech synthesis.

From $0.0043 per minute

Text-to-Music Speech-to-Text Text-to-Speech

Image Editing

Background removal, inpainting, outpainting, upscaling, and style transfer across GPT Image 2 API, Fibo Edit [Erase by Text], Fibo Edit [Relight]. Edit existing images with text prompts — change objects, extend canvases, or enhance resolution.

From $0.004 per edit

Background Removal Inpainting Upscaling

Multimodal Generation

Vision, image-guided generation, and mixed text-and-image inputs. Combine prompts with reference images, video frames, or audio for complex creative workflows.

From $0.01 per request

Vision Image+Text Reference Media

Text (LLM)

Claude Fable 5, Claude Opus 4.6, Claude Opus 4.7, Claude Opus 4.8, Claude Sonnet 4, and more — chat, reasoning, code generation, instruction following, and creative writing. Long-context models support windows up to 2M tokens.

From $0.10 / 1M tokens

Chat & Completion Code Generation Reasoning

Core AI

Text Generation

Claude Fable 5, Claude Opus 4.6, Claude Opus 4.7, Claude Opus 4.8, Claude Sonnet 4, and more — chat, code, reason, create.

From $0.10 / 1M tokens

Classification

Distilbert Sst 2 Int8, Resnet 50 — content moderation, sentiment analysis, categorization.

From $0.0000025 / request

Semantic Search & Translation

Bge Large En V1.5, Embeddinggemma 300m for semantic search and similarity. Indictrans2 En Indic 1B for 200+ language translation.

From $0.008 / 1M tokens

API Features

Cost Optimization

Set optimization: "price-optimized" or a budget_dollars cap per request, and GreatRouter automatically picks the cheapest capable model. No manual price comparisons — the routing engine scores options in real time across 1516 models from 792 providers.

Every request includes the actual cost in the response metadata. Track spending by model, provider, or task type from your dashboard. Auto-recharge keeps your balance topped up so you never hit a service interruption.

Budget per Request

Pass budget_dollars and the router excludes any model whose estimated cost exceeds the cap. Works for text, image, video, and audio requests.

Price Tiers

Filter by price tier — Economy, Standard, Balanced, Premium, or Flagship — to narrow the model pool before routing. Each tier represents a pricing percentile within its category, so Balanced models sit near the average cost.

Fine-Grained Parameters

Control routing with precise parameters. Everything is optional — use as much or as little control as you need.

task_typetext_generation, text_to_image, text_to_video, music_generation, etc.

content_modeauto, generate, edit — controls whether to create new content or modify existing

capabilitiesArray — vision, reasoning, function_calling, web_search, realtime, lora, batch

providerPin to a specific provider — openai, anthropic, google, meta, etc.

budget_dollarsMaximum estimated cost in USD for this request. The router excludes models above the cap.

maxCosteconomy, standard, balanced, premium, flagship — filters by model pricing percentile within its category

Intent Classification

GreatRouter reads your prompt and automatically detects the intent — chat, image generation, transcription, translation, and more.

Content Mode

generate creates new content from a prompt. edit modifies existing content. auto lets the router decide based on context.

Tools and Functions

Web Search

Models with web search capability can access real-time information from the internet. Route to models like GPT-4o with search for up-to-date answers.

{"model": "router",
 "capabilities": ["web_search"],
 "messages": [...]}

Function Calling

Send tool definitions and get structured JSON responses. Works with GPT-4o, Claude, Gemini, Mistral, and other models that support function calling.

{"model": "router",
 "capabilities": ["function_calling"],
 "tools": [...],
 "messages": [...]}

Vision

Send images alongside text prompts. Route to multimodal models that can analyze, describe, and reason about visual content.

{"model": "router",
 "capabilities": ["vision"],
 "messages": [
   {"role": "user",
    "content": [
      {"type": "image_url",
       "image_url": {"url": "..."}},
      {"type": "text",
       "text": "Describe this"}]}]}

Reasoning

Route to models optimized for complex reasoning — math, logic, analysis. Includes o1, DeepSeek R1, and other chain-of-thought models.

{"model": "router",
 "capabilities": ["reasoning"],
 "messages": [...]}

Capabilities

EditingModify existing images, video, and text with content_mode edit

CodingGenerate, review, and debug code across 39 code-capable models

RealtimeStreaming responses for chat and transcription with low time-to-first-token on supported models

LoRAFine-tuned image generation with custom LoRA adapters

BatchSubmit batches of requests for asynchronous processing

VisionImage understanding, OCR, document analysis, and visual Q&A

Providers

OpenAI GPT-5.4, Whisper Large V3 Turbo, GPT Image 2 API, TTS-1

Anthropic Claude Fable 5

Google Nano Banana, Veo 3, Gemini 3 Flash, Embeddinggemma 300m, Gemini 3.1 Flash TTS

Meta M2m100 1.2b, Llama 3.3 70b Instruct Fp8 Fast, Llama 4 Scout 17B Vision VQA

Mistral Mistral Small 3.1 24b Instruct, Mistral Small 3.1 24B Vision

DeepSeek DeepSeek V4 Pro

xAI Grok Imagine Image, Grok Imagine Video, Grok TTS, Grok 4.20 0309 Non Reasoning, Grok Imagine Image

Alibaba Hh1 T2v, Wan 2.6 Image, Qwen3 Max, Happy Horse

Qwen Qwq 32b, Qwen3 Embedding 0.6b

Microsoft Mai Image 2.5 Text to Image, Mai Image 2.5, Resnet 50, Speecht5 Tts

NVIDIA Nemotron 3 Nano Omni, Cosmos 3 Super Image to Video, Cosmos 3 Super, Canary 1b Flash

Stability AI Stable Diffusion Xl Base 1.0

Black Forest Labs Flux 1 Schnell

Runway Gen 4.5, Stable Diffusion V1 5 Img2img

Suno Suno V4.5 All, Bark Small

Deepgram Nova 3, Aura 1

ByteDance Seedance 2 Image to Video, Seedream 4.0, Seedance 2.0

Recraft Recraftv4 1 Pro Vector

MiniMax Speech 2.8 Hd, Music 2.6, MiniMax M3, Hailuo 2.3

PixVerse V6

Vidu Q3 Turbo

AssemblyAI Universal 3 Pro

By task type

Image Generation 221+ models $0.004 – $0.12 per image

Image Editing 563+ models $0.004 – $0.08 per image

Image-to-Image 364+ models $0.004 – $0.10 per image

Video Generation 320+ models $0.05 – $0.50 per second

Image-to-Video 183+ models $0.08 – $0.60 per second

Music Generation 73+ models $0.08 – $0.20 per track

Text-to-Speech 64+ models $0.015 – $0.03 per 1K chars

Vision (Image-to-Text) 40 models $0.02 – $0.08 per image

Background Removal 364+ models $0.002 – $0.01 per image

Style Transfer 364+ models $0.004 – $0.06 per image

Chat / Text 171+ models $0.10 – $15.00 / 1M tokens

Speech-to-Text 20 models $0.0043 – $0.036 per minute

Semantic Search 18 models $0.008 – $0.10 / 1M tokens

Classification 8 models $0.0000025 – $0.0001 / req

Translation 32 models $0.008 – $0.50 / 1M tokens