WorldRouter

Skip to Content
Quickstart

Quickstart

WorldRouter is a unified OpenAI-compatible endpoint for top AI models. This page walks you through the four things you need to start using it: an API key, the base URL, a connection test, and the model IDs you can call.

Get an API key

  1. Open the API Keys dashboard.
  2. Sign in or create a WorldRouter account.
  3. Click Create API key, name it, and copy the generated key (format: sk-...).
  4. Add credits on the Credits page so the key can make paid calls.
Warning:

Keys are shown only once. Store them in a password manager or server-side secret store. Never commit them to source control or paste them into client-side code.

Base URL

Every WorldRouter request goes to the same OpenAI-compatible endpoint. Use this value for the base_url / baseURL field in any SDK or the target URL for direct HTTP calls:

Base URL
https://inference-api.worldrouter.ai/v1

The URL above is resolved for the environment you are signed in to, so copy-paste works without further editing.

Verify the connection

Run one of the snippets below with your key in place of your_api_key. A successful call returns a chat completion JSON payload, which confirms the key, the base URL, and your network are all working:

curl https://inference-api.worldrouter.ai/v1/chat/completions \
  -H "Authorization: Bearer your_api_key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-5.4",
    "messages": [{ "role": "user", "content": "Hello" }]
  }'

A successful response looks like this — the choices[0].message.content is the model’s reply, and the usage block shows the tokens billed against your credits:

200 OK
{
"id": "chatcmpl-9f2abc123",
"object": "chat.completion",
"created": 1716240000,
"model": "gpt-5.4",
"choices": [
  {
    "index": 0,
    "message": {
      "role": "assistant",
      "content": "Hello! How can I help you today?"
    },
    "finish_reason": "stop"
  }
],
"usage": {
  "prompt_tokens": 9,
  "completion_tokens": 11,
  "total_tokens": 20
}
}
Tip:

A 401 means the key is wrong or missing. A 404 on /chat/completions almost always means the base URL is missing the /v1 suffix; copy it from the block above.

Supported models

These are the model IDs you can use to test the connection — drop any one of them into the model field of the verification request above to check whether that specific model is currently callable on your account. The full catalog with live pricing lives on the Models page.

VendorModel IDInput (Credits / 1M)Output (Credits / 1M)Cache read (Credits / 1M)Cache write (Credits / 1M)
Anthropic
claude-opus-4-7
3501,75035437.5
claude-opus-4-6
3501,75035437.5
claude-sonnet-4-6
2101,05021262.5
claude-haiku-4-5
70350787.5
OpenAI
gpt-5.5
7003,15070700
gpt-5.4
3501,57535350
gpt-5.4-mini
52.53155.2552.5
gpt-oss-120b
3.9193.93.9
gpt-oss-20b
31433
Google
gemini-3.1-pro-preview
2801,26028280
gemini-3.1-flash-lite-preview
17.51051.7517.5
gemma-4-31b-it
13381313
gemma-4-26b-a4b-it
13401313
gemma-3-27b-it
81688
gemma-3-12b-it
41344
Alibaba
qwen3.5-plus
40.11240.84.01150.1375
qwen3.5-flash
12.04120.41.20415.05
qwen3.6-plus
77.07462.147.70796.3375
qwen3-coder-plus
200.762,006.9720.076250.95
qwen3.5-flash-02-23
1040112.5
qwen3-235b-a22b-2507
10601010
qwen3.5-397b-a17b
3923419.539
qwen3.5-9b
10151010
qwen3.5-35b-a3b
252002525
qwen3.5-27b
302403030
qwen3.5-plus-02-15
50300562.5
qwen3-vl-235b-a22b-instruct
20881120
qwen3-coder-next
1480914
qwen3-next-80b-a3b-instruct
911099
qwen3-coder
221802222
qwen3-30b-a3b-instruct-2507
93099
qwen3-32b
82448
Moonshot
kimi-k2.6
66.528011.266.5
kimi-k2.5
42210742
MiniMax
MiniMax-M2.7
21844.226.25
MiniMax-M2.5
21844.226.25
Zhipu
glm-5.1
11235220.8112
glm-5
702241470
glm-5-turbo
12040024120
glm-4.5-air
13852.513
glm-4.7
40175840
glm-4.6
391903939
glm-4.7-flash
64016
DeepSeek
deepseek-v4-pro
121.8243.610.15121.8
deepseek-v4-flash
9.819.61.969.8
deepseek-v3.2
19.629.41.9619.6
deepseek-chat-v3-0324
207713.520
deepseek-chat-v3.1
15751515
deepseek-v3.1-terminus
21791321
deepseek-v3.2-exp
27412727
xAI
grok-4.1-fast
2050520
grok-4-fast
2050520
grok-4.20
4001,20040400
StepFun
step-3.5-flash
10301010
Xiaomi
mimo-v2-pro
20060040200
mimo-v2-flash
9294.59
mimo-v2.5-pro
20060040200
mimo-v2-omni
40200840
mimo-v2.5
804001680
Mistral
mistral-nemo
2422
mistral-small-3.2-24b-instruct
7.5207.57.5
Meta
llama-3.1-8b-instruct
2522
llama-3.1-70b-instruct
40404040
llama-4-maverick
15601515
llama-3.3-70b-instruct
10321010
NVIDIA
nemotron-3-nano-30b-a3b
52055
nemotron-3-super-120b-a12b
94599

Supported tools

WorldRouter works with any client that accepts a custom OpenAI-compatible base URL. Pick the integration path that matches how you use AI, and follow its setup guide:

  • CLI agents: terminal agents including Codex, Claude Code, and OpenCode.
  • VS Code extensions: the first-party Codex plugin and other extensions that read a custom OpenAI endpoint.
  • Direct API: any application, SDK, or script that speaks the OpenAI /chat/completions protocol. See the API reference for full details.

What’s next

  • Base URL for this environment: https://inference-api.worldrouter.ai/v1.
  • Manage keys and view usage on the API Keys dashboard.
  • Browse the full Models catalog with live pricing.
  • Read the API reference for authentication, error codes, and privacy.
Last updated on