Model pricing

When using Octavus-managed API keys, the provider cost for each request is passed through at the rates below. These prices are automatically synced from provider APIs. With BYOK (Bring Your Own Keys), provider costs are not charged.

OpenAI(72 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
computer-use-preview	$3.00	$12.00	-	-	-
OpenAI: GPT-3.5 Turbo	$0.50	$1.50	-	-	16K
OpenAI: GPT-3.5 Turbo (older v0613)	$1.00	$2.00	-	-	4K
OpenAI: GPT-3.5 Turbo 16k	$3.00	$4.00	-	-	16K
OpenAI: GPT-3.5 Turbo Instruct	$1.50	$2.00	-	-	4K
OpenAI: GPT-4	$30.00	$60.00	-	-	8K
OpenAI: GPT-4 Turbo	$10.00	$30.00	-	-	128K
OpenAI: GPT-4 Turbo Preview	$10.00	$30.00	-	-	128K
OpenAI: GPT-4.1	$2.00	$8.00	$0.50	-	1M
OpenAI: GPT-4.1 Mini	$0.40	$1.60	$0.10	-	1M
OpenAI: GPT-4.1 Nano	$0.10	$0.40	$0.025	-	1M
OpenAI: GPT-4o	$2.50	$10.00	$1.25	-	128K
OpenAI: GPT-4o (2024-05-13)	$5.00	$15.00	-	-	128K
OpenAI: GPT-4o (2024-08-06)	$2.50	$10.00	$1.25	-	128K
OpenAI: GPT-4o (2024-11-20)	$2.50	$10.00	$1.25	-	128K
OpenAI: GPT-4o-mini	$0.15	$0.60	$0.075	-	128K
OpenAI: GPT-4o-mini (2024-07-18)	$0.15	$0.60	$0.075	-	128K
OpenAI: GPT-4o-mini Search Preview	$0.15	$0.60	-	-	128K
OpenAI: GPT-4o Search Preview	$2.50	$10.00	-	-	128K
OpenAI: GPT-5	$1.25	$10.00	$0.125	-	400K
OpenAI: GPT-5 Chat	$1.25	$10.00	$0.125	-	128K
OpenAI: GPT-5 Codex	$1.25	$10.00	$0.125	-	400K
OpenAI: GPT-5 Image	$10.00	$10.00	$1.25	-	400K
OpenAI: GPT-5 Image Mini	$2.50	$2.00	$0.25	-	400K
OpenAI: GPT-5 Mini	$0.25	$2.00	$0.025	-	400K
OpenAI: GPT-5 Nano	$0.05	$0.40	$0.005	-	400K
OpenAI: GPT-5 Pro	$15.00	$120.00	-	-	400K
OpenAI: GPT-5.1	$1.25	$10.00	$0.125	-	400K
OpenAI: GPT-5.1 Chat	$1.25	$10.00	$0.125	-	128K
OpenAI: GPT-5.1-Codex	$1.25	$10.00	$0.125	-	400K
OpenAI: GPT-5.1-Codex-Max	$1.25	$10.00	$0.125	-	400K
OpenAI: GPT-5.1-Codex-Mini	$0.25	$2.00	$0.025	-	400K
OpenAI: GPT-5.2	$1.75	$14.00	$0.175	-	400K
OpenAI: GPT-5.2 Chat	$1.75	$14.00	$0.175	-	128K
OpenAI: GPT-5.2-Codex	$1.75	$14.00	$0.175	-	400K
OpenAI: GPT-5.2 Pro	$21.00	$168.00	-	-	400K
OpenAI: GPT-5.3 Chat	$1.75	$14.00	$0.175	-	128K
OpenAI: GPT-5.3-Codex	$1.75	$14.00	$0.175	-	400K
OpenAI: GPT-5.4	$2.50$5.00 >272K	$15.00$22.50 >272K	$0.25$0.50 >272K	-	1M
OpenAI: GPT-5.4 Image 2	$8.00	$15.00	$2.00	-	272K
OpenAI: GPT-5.4 Mini	$0.75	$4.50	$0.075	-	400K
OpenAI: GPT-5.4 Nano	$0.20	$1.25	$0.02	-	400K
OpenAI: GPT-5.4 Pro	$30.00$60.00 >272K	$180.00$270.00 >272K	-	-	1M
OpenAI: GPT-5.5	$5.00$10.00 >272K	$30.00$45.00 >272K	$0.50$1.00 >272K	-	1M
OpenAI: GPT-5.5 Pro	$30.00$60.00 >272K	$180.00$270.00 >272K	-	-	1M
gpt-5.6	$5.00$10.00 >272K	$30.00$45.00 >272K	$0.50$1.00 >272K	-	-
OpenAI: GPT-5.6 Luna	$1.00$2.00 >272K	$6.00$9.00 >272K	$0.10$0.20 >272K	-	1M
OpenAI: GPT-5.6 Luna Pro	$1.00	$6.00	$0.10	-	1M
OpenAI: GPT-5.6 Sol	$5.00$10.00 >272K	$30.00$45.00 >272K	$0.50$1.00 >272K	-	1M
OpenAI: GPT-5.6 Sol Pro	$5.00	$30.00	$0.50	-	1M
OpenAI: GPT-5.6 Terra	$2.50$5.00 >272K	$15.00$22.50 >272K	$0.25$0.50 >272K	-	1M
OpenAI: GPT-5.6 Terra Pro	$2.50	$15.00	$0.25	-	1M
OpenAI: GPT Audio	$2.50	$10.00	-	-	128K
OpenAI: GPT Audio Mini	$0.60	$2.40	-	-	128K
OpenAI: GPT Chat Latest	$5.00	$30.00	$0.50	-	400K
gpt-image-1	$5.00	$40.00	$1.25	-	-
gpt-image-1-mini	$2.50	$8.00	$0.25	-	-
gpt-image-1.5	$5.00	$10.00	$1.25	-	-
OpenAI: gpt-oss-120b	$0.037	$0.17	-	-	131K
OpenAI: gpt-oss-20b	$0.03	$0.13	$0.03	-	131K
OpenAI: gpt-oss-safeguard-20b	$0.075	$0.30	$0.0375	-	131K
OpenAI: o1	$15.00	$60.00	$7.50	-	200K
o1-mini	$1.10	$4.40	$0.55	-	-
OpenAI: o1-pro	$150.00	$600.00	-	-	200K
OpenAI: o3	$2.00	$8.00	$0.50	-	200K
OpenAI: o3 Deep Research	$10.00	$40.00	$2.50	-	200K
OpenAI: o3 Mini	$1.10	$4.40	$0.55	-	200K
OpenAI: o3 Mini High	$1.10	$4.40	$0.55	-	200K
OpenAI: o3 Pro	$20.00	$80.00	-	-	200K
OpenAI: o4 Mini	$1.10	$4.40	$0.275	-	200K
OpenAI: o4 Mini Deep Research	$2.00	$8.00	$0.50	-	200K
OpenAI: o4 Mini High	$1.10	$4.40	$0.275	-	200K

Anthropic(15 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Anthropic: Claude 3 Haiku	$0.25	$1.25	$0.03	-	200K
Anthropic: Claude Fable 5	$10.00	$50.00	$1.00	-	1M
Anthropic: Claude Haiku 4.5	$1.00	$5.00	$0.10	-	200K
Anthropic: Claude Opus 4	$15.00	$75.00	$1.50	-	200K
Anthropic: Claude Opus 4.1	$15.00	$75.00	$1.50	-	200K
Anthropic: Claude Opus 4.5	$5.00	$25.00	$0.50	-	200K
Anthropic: Claude Opus 4.6	$5.00	$25.00	$0.50	-	1M
Anthropic: Claude Opus 4.7	$5.00	$25.00	$0.50	-	1M
Anthropic: Claude Opus 4.8	$5.00	$25.00	$0.50	-	1M
Anthropic: Claude Opus 4.7 (Fast)	$30.00	$150.00	$3.00	-	1M
Anthropic: Claude Opus 4.8 (Fast)	$10.00	$50.00	$1.00	-	1M
Anthropic: Claude Sonnet 4	$3.00	$15.00	$0.30	-	1M
Anthropic: Claude Sonnet 4.5	$3.00	$15.00	$0.30	-	1M
Anthropic: Claude Sonnet 4.6	$3.00	$15.00	$0.30	-	1M
Anthropic: Claude Sonnet 5	$3.00	$15.00	$0.30	-	1M

Google(27 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
gemini-2.0-flash	$0.10	$0.40	$0.025	-	-
gemini-2.0-flash-lite	$0.075	$0.30	-	-	-
Google: Gemini 2.5 Flash	$0.30	$2.50	$0.03	$2.50	1M
Google: Nano Banana (Gemini 2.5 Flash Image)	$0.30	$30.00	$0.03	$2.50	33K
Google: Gemini 2.5 Flash Lite	$0.10	$0.40	$0.01	$0.40	1M
Google: Gemini 2.5 Pro	$1.25$2.50 >200K	$10.00$15.00 >200K	$0.125$0.25 >200K	$10.00	1M
Google: Gemini 2.5 Pro Preview 06-05	$1.25	$10.00	$0.125	$10.00	1M
Google: Gemini 2.5 Pro Preview 05-06	$1.25	$10.00	$0.125	$10.00	1M
Google: Gemini 3 Flash Preview	$0.50	$3.00	$0.05	$3.00	1M
Google: Nano Banana Pro (Gemini 3 Pro Image)	$2.00	$12.00	$0.20	$12.00	66K
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)	$2.00	$12.00	$0.20	$12.00	66K
Google: Nano Banana 2 (Gemini 3.1 Flash Image)	$0.50	$60.00	-	-	131K
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)	$0.50	$3.00	-	-	131K
Google: Gemini 3.1 Flash Lite	$0.25	$1.50	$0.025	$1.50	1M
Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)	$0.25	$1.50	-	-	66K
Google: Gemini 3.1 Flash Lite Preview	$0.25	$1.50	$0.025	$1.50	1M
Google: Gemini 3.1 Pro Preview	$2.00$4.00 >200K	$12.00$18.00 >200K	$0.20$0.40 >200K	$12.00	1M
Google: Gemini 3.1 Pro Preview Custom Tools	$2.00	$12.00	$0.20	$12.00	1M
Google: Gemini 3.5 Flash	$1.50	$9.00	$0.15	$9.00	1M
Google: Gemma 2 27B	$0.65	$0.65	-	-	8K
Google: Gemma 3 12B	$0.05	$0.15	-	-	131K
Google: Gemma 3 27B	$0.08	$0.45	$0.04	-	131K
Google: Gemma 3 4B	$0.05	$0.10	-	-	131K
Google: Gemma 3n 4B	$0.06	$0.12	-	-	33K
Google: Gemma 4 26B A4B	$0.10	$0.30	-	-	262K
Google: Gemma 4 31B	$0.22	$0.55	$0.12	-	262K
imagen-4.0-generate-001	$0.04	$0.04	-	-	-

~anthropic(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Anthropic: Claude Fable Latest	$10.00	$50.00	$1.00	-	1M
Anthropic Claude Haiku Latest	$1.00	$5.00	$0.10	-	200K
Anthropic: Claude Opus Latest	$5.00	$25.00	$0.50	-	1M
Anthropic Claude Sonnet Latest	$2.00	$10.00	$0.20	-	1M

~google(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Google Gemini Flash Latest	$1.50	$9.00	$0.15	$9.00	1M
Google Gemini Pro Latest	$2.00	$12.00	$0.20	$12.00	1M

~moonshotai(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
MoonshotAI Kimi Latest	$0.66	$3.41	$0.15	-	262K

~openai(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
OpenAI GPT Latest	$5.00	$30.00	$0.50	-	1M
OpenAI GPT Mini Latest	$0.75	$4.50	$0.075	-	400K

~x-ai(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
xAI: Grok Latest	$2.00	$6.00	$0.50	-	500K

ai21(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
AI21: Jamba Large 1.7	$2.00	$8.00	-	-	256K

aion-labs(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
AionLabs: Aion-2.0	$0.80	$1.60	$0.20	-	131K
AionLabs: Aion-3.0	$3.00	$6.00	$0.75	-	131K
AionLabs: Aion-3.0-Mini	$0.70	$1.40	$0.18	-	131K
AionLabs: Aion-RP 1.0 (8B)	$0.80	$1.60	-	-	33K

allenai(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
AllenAI: Olmo 3 32B Think	$0.15	$0.50	-	-	66K

Amazon(5 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Amazon: Nova 2 Lite	$0.30	$2.50	-	-	1M
Amazon: Nova Lite 1.0	$0.06	$0.24	-	-	300K
Amazon: Nova Micro 1.0	$0.035	$0.14	-	-	128K
Amazon: Nova Premier 1.0	$2.50	$12.50	$0.625	-	1M
Amazon: Nova Pro 1.0	$0.80	$3.20	-	-	300K

anthracite-org(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Magnum v4 72B	$3.00	$5.00	-	-	33K

arcee-ai(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Arcee AI: Trinity Large Thinking	$0.25	$0.80	$0.06	-	262K
Arcee AI: Virtuoso Large	$0.75	$1.20	-	-	131K

baidu(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Baidu: ERNIE 4.5 VL 424B A47B	$0.42	$1.25	-	-	131K

bytedance(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
ByteDance: UI-TARS 7B	$0.10	$0.20	$0.10	-	128K

bytedance-seed(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
ByteDance Seed: Seed 1.6	$0.25	$2.00	-	-	262K
ByteDance Seed: Seed 1.6 Flash	$0.075	$0.30	-	-	262K
ByteDance Seed: Seed-2.0-Lite	$0.25	$2.00	-	-	262K
ByteDance Seed: Seed-2.0-Mini	$0.10	$0.40	-	-	262K

cognitivecomputations(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Venice: Uncensored	$0.20	$0.90	-	-	128K

Cohere(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Cohere: Command A	$2.50	$10.00	-	-	256K
Cohere: Command R (08-2024)	$0.15	$0.60	-	-	128K
Cohere: Command R+ (08-2024)	$2.50	$10.00	-	-	128K
Cohere: Command R7B (12-2024)	$0.0375	$0.15	-	-	128K

deepcogito(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Deep Cogito: Cogito v2.1 671B	$1.25	$1.25	-	-	128K

DeepSeek(11 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
DeepSeek: DeepSeek V3	$0.2002	$0.8001	-	-	131K
DeepSeek: DeepSeek V3 0324	$0.27	$1.12	$0.135	-	164K
DeepSeek: DeepSeek V3.1	$0.25	$0.95	$0.13	-	164K
DeepSeek: R1	$0.70	$2.50	-	-	164K
DeepSeek: R1 0528	$0.50	$2.15	$0.35	-	164K
DeepSeek: R1 Distill Llama 70B	$0.80	$0.80	-	-	128K
DeepSeek: DeepSeek V3.1 Terminus	$0.27	$1.00	$0.135	-	131K
DeepSeek: DeepSeek V3.2	$0.269	$0.40	$0.1345	-	164K
DeepSeek: DeepSeek V3.2 Exp	$0.27	$0.41	-	-	164K
DeepSeek: DeepSeek V4 Flash	$0.098	$0.196	$0.02	-	1M
DeepSeek: DeepSeek V4 Pro	$0.435	$0.87	$0.0036	-	1M

gryphe(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
MythoMax 13B	$0.06	$0.06	-	-	4K

ibm-granite(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
IBM: Granite 4.0 Micro	$0.017	$0.112	-	-	131K
IBM: Granite 4.1 8B	$0.05	$0.10	$0.05	-	131K

inception(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Inception: Mercury 2	$0.25	$0.75	$0.025	-	128K

inclusionai(3 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
inclusionAI: Ling-2.6-1T	$0.075	$0.625	$0.015	-	262K
inclusionAI: Ling-2.6-flash	$0.01	$0.03	$0.002	-	262K
inclusionAI: Ring-2.6-1T	$0.075	$0.625	$0.015	-	262K

inflection(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Inflection: Inflection 3 Pi	$2.50	$10.00	-	-	8K
Inflection: Inflection 3 Productivity	$2.50	$10.00	-	-	8K

kwaipilot(3 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Kwaipilot: KAT-Coder-Air V2.5	$0.15	$0.60	$0.03	-	256K
Kwaipilot: KAT-Coder-Pro V2	$0.30	$1.20	$0.06	-	256K
Kwaipilot: KAT-Coder-Pro V2.5	$0.74	$2.96	$0.15	-	256K

mancer(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Mancer: Weaver (alpha)	$0.75	$1.00	-	-	8K

Meta(9 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Meta: Llama 3.1 70B Instruct	$0.40	$0.40	-	-	131K
Meta: Llama 3.1 8B Instruct	$0.05	$0.08	$0.025	-	131K
Meta: Llama 3.2 11B Vision Instruct	$0.345	$0.345	-	-	131K
Meta: Llama 3.2 1B Instruct	$0.027	$0.201	-	-	131K
Meta: Llama 3.2 3B Instruct	$0.05	$0.33	-	-	131K
Meta: Llama 3.3 70B Instruct	$0.13	$0.40	-	-	131K
Meta: Llama 4 Maverick	$0.20	$0.80	-	-	1M
Meta: Llama 4 Scout	$0.10	$0.30	-	-	10M
Meta: Llama Guard 4 12B	$0.18	$0.18	-	-	164K

microsoft(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Microsoft: Phi 4	$0.07	$0.14	-	-	16K
WizardLM-2 8x22B	$0.62	$0.62	-	-	66K

MiniMax(8 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
MiniMax: MiniMax-01	$0.20	$1.10	-	-	1M
MiniMax: MiniMax M1	$0.55	$2.20	-	-	1M
MiniMax: MiniMax M2	$0.255	$1.02	-	-	205K
MiniMax: MiniMax M2-her	$0.30	$1.20	$0.03	-	66K
MiniMax: MiniMax M2.1	$0.30	$1.20	$0.03	-	205K
MiniMax: MiniMax M2.5	$0.15	$0.90	$0.05	-	205K
MiniMax: MiniMax M2.7	$0.30	$1.20	$0.06	-	205K
MiniMax: MiniMax M3	$0.30	$1.20	$0.06	-	1M

Mistral(19 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Mistral: Codestral 2508	$0.30	$0.90	$0.03	-	256K
Mistral: Devstral 2 2512	$0.40	$2.00	$0.04	-	262K
Mistral: Ministral 3 14B 2512	$0.20	$0.20	$0.02	-	262K
Mistral: Ministral 3 3B 2512	$0.10	$0.10	$0.01	-	131K
Mistral: Ministral 3 8B 2512	$0.15	$0.15	$0.015	-	262K
Mistral Large	$2.00	$6.00	$0.20	-	128K
Mistral Large 2407	$2.00	$6.00	$0.20	-	131K
Mistral: Mistral Large 3 2512	$0.50	$1.50	$0.05	-	262K
Mistral: Mistral Medium 3	$0.40	$2.00	$0.04	-	131K
Mistral: Mistral Medium 3.5	$1.50	$7.50	-	-	262K
Mistral: Mistral Medium 3.1	$0.40	$2.00	$0.04	-	131K
Mistral: Mistral Nemo	$0.02	$0.04	-	-	131K
Mistral: Saba	$0.20	$0.60	$0.02	-	33K
Mistral: Mistral Small 3	$0.05	$0.08	-	-	33K
Mistral: Mistral Small 4	$0.15	$0.60	$0.015	-	262K
Mistral: Mistral Small 3.1 24B	$0.351	$0.555	-	-	128K
Mistral: Mistral Small 3.2 24B	$0.10	$0.30	$0.01	-	131K
Mistral: Mixtral 8x22B Instruct	$2.00	$6.00	$0.20	-	66K
Mistral: Voxtral Small 24B 2507	$0.10	$0.30	$0.01	-	32K

Moonshot(6 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
MoonshotAI: Kimi K2 0711	$0.57	$2.30	-	-	131K
MoonshotAI: Kimi K2 0905	$0.60	$2.50	-	-	262K
MoonshotAI: Kimi K2 Thinking	$0.60	$2.50	-	-	262K
MoonshotAI: Kimi K2.5	$0.57	$2.85	$0.095	-	262K
MoonshotAI: Kimi K2.6	$0.66	$3.41	$0.144	-	262K
MoonshotAI: Kimi K2.7 Code	$0.719	$3.49	$0.149	-	262K

morph(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Morph: Morph V3 Fast	$0.80	$1.20	-	-	82K
Morph: Morph V3 Large	$0.90	$1.90	-	-	262K

nex-agi(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Nex AGI: Nex-N2-Mini	$0.025	$0.10	$0.0025	-	262K
Nex AGI: Nex-N2-Pro	$0.25	$1.00	$0.025	-	262K

nousresearch(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Nous: Hermes 3 405B Instruct	$1.00	$1.00	-	-	131K
Nous: Hermes 3 70B Instruct	$0.70	$0.70	-	-	131K
Nous: Hermes 4 405B	$1.00	$3.00	-	-	131K
Nous: Hermes 4 70B	$0.13	$0.40	-	-	131K

NVIDIA(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5	$0.40	$0.40	-	-	131K
NVIDIA: Nemotron 3 Nano 30B A3B	$0.05	$0.20	-	-	262K
NVIDIA: Nemotron 3 Super	$0.21	$0.455	$0.06	-	1M
NVIDIA: Nemotron 3 Ultra	$0.60	$3.60	$0.20	-	1M

openrouter(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Auto Router	$-1000000.00	$-1000000.00	-	-	2M
Body Builder (beta)	$-1000000.00	$-1000000.00	-	-	128K
OpenRouter: Fusion	$-1000000.00	$-1000000.00	-	-	1M
Pareto Code Router	$-1000000.00	$-1000000.00	-	-	2M

perceptron(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Perceptron: Perceptron Mk1	$0.15	$1.50	-	-	33K

perplexity(5 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Perplexity: Sonar	$1.00	$1.00	-	-	127K
Perplexity: Sonar Deep Research	$2.00	$8.00	-	$3.00	128K
Perplexity: Sonar Pro	$3.00	$15.00	-	-	200K
Perplexity: Sonar Pro Search	$3.00	$15.00	-	-	200K
Perplexity: Sonar Reasoning Pro	$2.00	$8.00	-	-	128K

poolside(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Poolside: Laguna M.1	$0.20	$0.40	$0.10	-	262K
Poolside: Laguna XS 2.1	$0.06	$0.12	$0.03	-	262K

Qwen(47 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Qwen2.5 72B Instruct	$0.36	$0.40	-	-	131K
Qwen: Qwen2.5 7B Instruct	$0.04	$0.10	-	-	131K
Qwen2.5 Coder 32B Instruct	$0.66	$1.00	-	-	128K
Qwen: Qwen-Plus	$0.26	$0.78	$0.052	-	1M
Qwen: Qwen Plus 0728	$0.26	$0.78	-	-	1M
Qwen: Qwen Plus 0728 (thinking)	$0.26	$0.78	-	-	1M
Qwen: Qwen2.5 VL 72B Instruct	$0.80	$1.00	$0.40	-	131K
Qwen: Qwen3 14B	$0.12	$0.24	-	-	132K
Qwen: Qwen3 235B A22B	$0.455	$1.82	-	-	131K
Qwen: Qwen3 235B A22B Instruct 2507	$0.09	$0.55	-	-	262K
Qwen: Qwen3 235B A22B Thinking 2507	$0.1495	$1.50	-	-	262K
Qwen: Qwen3 30B A3B	$0.12	$0.50	-	-	131K
Qwen: Qwen3 30B A3B Instruct 2507	$0.10	$0.30	-	-	262K
Qwen: Qwen3 30B A3B Thinking 2507	$0.13	$1.56	-	-	131K
Qwen: Qwen3 32B	$0.08	$0.28	-	-	131K
Qwen: Qwen3 8B	$0.117	$0.455	-	-	131K
Qwen: Qwen3 Coder 480B A35B	$0.30	$1.00	$0.10	-	1M
Qwen: Qwen3 Coder 30B A3B Instruct	$0.07	$0.27	-	-	160K
Qwen: Qwen3 Coder Flash	$0.195	$0.975	$0.039	-	1M
Qwen: Qwen3 Coder Next	$0.11	$0.80	$0.07	-	262K
Qwen: Qwen3 Coder Plus	$0.65	$3.25	$0.13	-	1M
Qwen: Qwen3 Max	$0.78	$3.90	$0.156	-	262K
Qwen: Qwen3 Max Thinking	$0.78	$3.90	-	-	262K
Qwen: Qwen3 Next 80B A3B Instruct	$0.10	$1.10	$0.07	-	262K
Qwen: Qwen3 Next 80B A3B Thinking	$0.0975	$0.78	-	-	262K
Qwen: Qwen3 VL 235B A22B Instruct	$0.21	$1.90	$0.10	-	131K
Qwen: Qwen3 VL 235B A22B Thinking	$0.26	$2.60	-	-	131K
Qwen: Qwen3 VL 30B A3B Instruct	$0.13	$0.52	-	-	262K
Qwen: Qwen3 VL 30B A3B Thinking	$0.13	$1.56	-	-	131K
Qwen: Qwen3 VL 32B Instruct	$0.104	$0.416	-	-	262K
Qwen: Qwen3 VL 8B Instruct	$0.117	$0.455	-	-	256K
Qwen: Qwen3 VL 8B Thinking	$0.117	$1.36	-	-	256K
Qwen: Qwen3.5-122B-A10B	$0.26	$2.08	-	-	262K
Qwen: Qwen3.5-27B	$0.195	$1.56	-	-	262K
Qwen: Qwen3.5-35B-A3B	$0.14	$1.00	-	-	262K
Qwen: Qwen3.5 397B A17B	$0.45	$3.00	$0.225	-	262K
Qwen: Qwen3.5-9B	$0.10	$0.15	-	-	262K
Qwen: Qwen3.5-Flash	$0.065	$0.26	-	-	1M
Qwen: Qwen3.5 Plus 2026-02-15	$0.26	$1.56	-	-	1M
Qwen: Qwen3.5 Plus 2026-04-20	$0.30	$1.80	-	-	1M
Qwen: Qwen3.6 27B	$0.45	$2.70	-	-	262K
Qwen: Qwen3.6 35B A3B	$0.14	$1.00	-	-	262K
Qwen: Qwen3.6 Flash	$0.1875	$1.13	-	-	1M
Qwen: Qwen3.6 Max Preview	$1.04	$6.24	-	-	262K
Qwen: Qwen3.6 Plus	$0.325	$1.95	-	-	1M
Qwen: Qwen3.7 Max	$1.48	$4.42	$0.295	-	1M
Qwen: Qwen3.7 Plus	$0.32	$1.28	$0.064	-	1M

rekaai(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Reka Edge	$0.10	$0.10	-	-	16K
Reka Flash 3	$0.10	$0.20	-	-	66K

relace(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Relace: Relace Apply 3	$0.85	$1.25	-	-	256K
Relace: Relace Search	$1.00	$3.00	-	-	256K

sakana(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Sakana: Fugu Ultra	$5.00	$30.00	$0.50	-	1M

sao10k(3 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Sao10K: Llama 3 8B Lunaris	$0.04	$0.05	-	-	8K
Sao10K: Llama 3.1 Euryale 70B v2.2	$0.85	$0.85	-	-	131K
Sao10K: Llama 3.3 Euryale 70B	$0.65	$0.75	-	-	131K

stepfun(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
StepFun: Step 3.5 Flash	$0.10	$0.30	-	-	262K
StepFun: Step 3.7 Flash	$0.20	$1.15	$0.04	-	256K

tencent(3 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Tencent: Hunyuan A13B Instruct	$0.14	$0.57	-	-	131K
Tencent: Hy3	$0.20	$0.80	$0.05	-	262K
Tencent: Hy3 preview	$0.063	$0.21	$0.021	-	262K

thedrummer(4 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
TheDrummer: Cydonia 24B V4.1	$0.30	$0.50	$0.15	-	131K
TheDrummer: Rocinante 12B	$0.25	$0.50	-	-	66K
TheDrummer: Skyfall 36B V2	$0.55	$0.80	$0.25	-	33K
TheDrummer: UnslopNemo 12B	$0.40	$0.40	-	-	33K

undi95(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
ReMM SLERP 13B	$0.45	$0.65	-	-	6K

upstage(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Upstage: Solar Pro 3	$0.15	$0.60	$0.015	-	128K

writer(1 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Writer: Palmyra X5	$0.60	$6.00	-	-	1M

xAI(5 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
xAI: Grok 4.20	$1.25	$2.50	$0.20	-	2M
xAI: Grok 4.20 Multi-Agent	$1.25	$2.50	$0.20	-	2M
xAI: Grok 4.3	$1.25	$2.50	$0.20	-	1M
xAI: Grok 4.5	$2.00	$6.00	$0.50	-	500K
xAI: Grok Build 0.1	$1.00	$2.00	$0.20	-	256K

xiaomi(2 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Xiaomi: MiMo-V2.5	$0.14	$0.28	$0.0028	-	1M
Xiaomi: MiMo-V2.5-Pro	$0.435	$0.87	$0.0036	-	1M

Z.ai(12 models)

Model	Input / 1M tokens	Output / 1M tokens	Cache Read / 1M	Reasoning / 1M	Context
Z.ai: GLM 4.5	$0.60	$2.20	$0.11	-	131K
Z.ai: GLM 4.5 Air	$0.13	$0.85	$0.025	-	131K
Z.ai: GLM 4.5V	$0.60	$1.80	$0.11	-	66K
Z.ai: GLM 4.6	$0.50	$2.00	$0.10	-	203K
Z.ai: GLM 4.6V	$0.30	$0.90	$0.055	-	131K
Z.ai: GLM 4.7	$0.40	$1.75	$0.08	-	203K
Z.ai: GLM 4.7 Flash	$0.06	$0.40	$0.01	-	203K
Z.ai: GLM 5	$0.95	$3.15	$0.19	-	203K
Z.ai: GLM 5 Turbo	$1.20	$4.00	$0.24	-	203K
Z.ai: GLM 5.1	$0.966	$3.04	$0.1794	-	203K
Z.ai: GLM 5.2	$0.9478	$2.98	$0.176	-	1M
Z.ai: GLM 5V Turbo	$1.20	$4.00	$0.24	-	203K