API Pricing
Every major LLM provider in one table. Sortable, filterable. Prices per 1 million tokens.
| Model ▲▼ | Provider ▲▼ | Input / 1M ▲ | Output / 1M ▲▼ | Context ▲▼ |
|---|---|---|---|---|
| GPT-OSS 120B | OpenAI | $0.039 | $0.190 | 131K |
| GPT-5 Nano | OpenAI | $0.050 | $0.400 | 128K |
| Qwen3.5-9B | Alibaba | $0.050 | $0.150 | 262K |
| Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.260 | 262K |
| GPT-OSS 20B | OpenAI | $0.075 | $0.300 | 131K |
| Gemini 2.0 Flash-Lite | $0.075 | $0.300 | 1.0M | |
| Llama 4 Scout | Meta | $0.080 | $0.300 | 1.0M |
| GPT-4.1 Nano | OpenAI | $0.100 | $0.400 | 1.0M |
| Gemini 2.5 Flash-Lite | $0.100 | $0.400 | 1.0M | |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1.0M | |
| Devstral Small | Mistral | $0.100 | $0.300 | 256K |
| Mistral Small 3.2 | Mistral | $0.100 | $0.300 | 128K |
| Gemma 4 26B A4B | $0.130 | $0.400 | 262K | |
| Gemma 4 31B | $0.140 | $0.400 | 262K | |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 128K |
| Mistral Small 4 | Mistral | $0.150 | $0.600 | 256K |
| Ministral 3 8B | Mistral | $0.150 | $0.150 | 262K |
| Command R | Cohere | $0.150 | $0.600 | 128K |
| Command R 08 2024 | Cohere | $0.150 | $0.600 | 128K |
| Command R7b 12 2024 | Cohere | $0.150 | $0.038 | 128K |
| Llama 3.3 70B | Meta | $0.180 | $0.180 | 131K |
| GPT-5.4 Nano | OpenAI | $0.200 | $1.25 | 400K |
| Grok 4.1 Fast | xAI | $0.200 | $0.500 | 2.0M |
| Grok 4.1 Fast Reasoning | xAI | $0.200 | $0.500 | 2.0M |
| Grok Code Fast | xAI | $0.200 | $1.50 | 256K |
| Ministral 3 14B | Mistral | $0.200 | $0.200 | 262K |
| Jamba 1.5 | AI21 | $0.200 | $0.400 | 256K |
| Jamba 1.5 Mini | AI21 | $0.200 | $0.400 | 256K |
| Jamba 1.5 Mini@001 | AI21 | $0.200 | $0.400 | 256K |
| Jamba Mini 1.6 | AI21 | $0.200 | $0.400 | 256K |
| Jamba Mini 1.7 | AI21 | $0.200 | $0.400 | 256K |
| GPT-5 Mini | OpenAI | $0.250 | $2.00 | 400K |
| Gemini 3.1 Flash-Lite | $0.250 | $1.50 | 1.0M | |
| Claude 3 Haiku 20240307 | Anthropic | $0.250 | $1.25 | 200K |
| Llama 4 Maverick | Meta | $0.270 | $0.850 | 1.0M |
| Qwen3.6-Plus | Alibaba | $0.276 | $1.65 | 1.0M |
| DeepSeek V3.2 (Chat) | DeepSeek | $0.280 | $0.420 | 128K |
| DeepSeek V3.2 (Reasoner) | DeepSeek | $0.280 | $0.420 | 128K |
| DeepSeek Reasoner | DeepSeek | $0.280 | $0.420 | 131K |
| Gemini 2.5 Flash | $0.300 | $2.50 | 1.0M | |
| Grok 3 Mini | xAI | $0.300 | $0.500 | 131K |
| Codestral | Mistral | $0.300 | $0.900 | 256K |
| Qwen 3.5 27B | Alibaba | $0.300 | $2.40 | 128K |
| Nova 2.0 Lite | Amazon | $0.300 | $2.50 | 128K |
| Nemotron 3 Super 120B | NVIDIA | $0.300 | $0.800 | 1.0M |
| MiniMax M2.5 | MiniMax | $0.300 | $1.20 | 128K |
| Command Light | Cohere | $0.300 | $0.600 | 4K |
| GPT-4.1 Mini | OpenAI | $0.400 | $1.60 | 1.0M |
| Mistral Medium | Mistral | $0.400 | $2.00 | 131K |
| Devstral | Mistral | $0.400 | $2.00 | 256K |
| Qwen3.5-Omni Plus | Alibaba | $0.400 | $4.80 | 262K |
| Gemini 3 Flash | $0.500 | $3.00 | 1.0M | |
| Gemini 3 Flash Reasoning | $0.500 | $3.00 | 1.0M | |
| Mistral Large 3 | Mistral | $0.500 | $1.50 | 262K |
| Magistral Small | Mistral | $0.500 | $1.50 | 40K |
| DeepSeek R1 | DeepSeek | $0.550 | $2.19 | 128K |
| Qwen 3.5 397B | Alibaba | $0.600 | $3.60 | 128K |
| Kimi K2.5 | Moonshot | $0.600 | $3.00 | 262K |
| Kimi K2 Thinking | Moonshot | $0.600 | $2.50 | 262K |
| GPT-5.4 Mini | OpenAI | $0.750 | $4.50 | 400K |
| Gemini 3.1 Flash Live | $0.750 | $4.50 | 1.0M | |
| Claude Haiku 3.5 | Anthropic | $0.800 | $4.00 | 200K |
| Claude Haiku 4.5 | Anthropic | $1.00 | $5.00 | 200K |
| Claude 4.5 Haiku Reasoning | Anthropic | $1.00 | $5.00 | 200K |
| Gemini 3.1 Flash TTS | $1.00 | $20.00 | 32K | |
| GLM-5 | Zhipu | $1.00 | $3.20 | 128K |
| MiMo-V2-Pro | Xiaomi | $1.00 | $3.00 | 1.0M |
| Claude Haiku 4 5 20251001 | Anthropic | $1.00 | $5.00 | 200K |
| Claude Haiku 4 5 | Anthropic | $1.00 | $5.00 | 200K |
| o4 Mini | OpenAI | $1.10 | $4.40 | 200K |
| o3 Mini | OpenAI | $1.10 | $4.40 | 200K |
| Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8.00 | 262K |
| GLM-5 Turbo | Zhipu | $1.20 | $4.00 | 200K |
| GPT-5.1 | OpenAI | $1.25 | $10.00 | 400K |
| GPT-5 | OpenAI | $1.25 | $10.00 | 400K |
| GPT-5 Medium | OpenAI | $1.25 | $10.00 | 400K |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1.0M | |
| Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10.00 | 128K |
| GPT-5.2 | OpenAI | $1.75 | $14.00 | 400K |
| GPT-5.3 Codex | OpenAI | $1.75 | $14.00 | 400K |
| GPT-4.1 | OpenAI | $2.00 | $8.00 | 1.0M |
| o3 | OpenAI | $2.00 | $8.00 | 200K |
| o4 Mini Deep Research | OpenAI | $2.00 | $8.00 | 200K |
| Gemini 3.1 Pro | $2.00 | $12.00 | 1.0M | |
| Gemini 3 Pro | $2.00 | $12.00 | 1.0M | |
| Grok 4.20 | xAI | $2.00 | $6.00 | 2.0M |
| Grok 2 | xAI | $2.00 | $10.00 | 131K |
| Magistral Medium | Mistral | $2.00 | $5.00 | 40K |
| Jamba 1.5 Large | AI21 | $2.00 | $8.00 | 256K |
| Jamba 1.5 Large@001 | AI21 | $2.00 | $8.00 | 256K |
| Jamba Large 1.6 | AI21 | $2.00 | $8.00 | 256K |
| Jamba Large 1.7 | AI21 | $2.00 | $8.00 | 256K |
| GPT-5.4 | OpenAI | $2.50 | $15.00 | 1.1M |
| GPT-4o | OpenAI | $2.50 | $10.00 | 128K |
| Command A | Cohere | $2.50 | $10.00 | 128K |
| Command A 03 2025 | Cohere | $2.50 | $10.00 | 256K |
| Command R Plus | Cohere | $2.50 | $10.00 | 128K |
| Command R Plus 08 2024 | Cohere | $2.50 | $10.00 | 128K |
| Claude Sonnet 4.6 Adaptive | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 4.6 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 4.5 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 200K |
| Claude 3.7 Sonnet | Anthropic | $3.00 | $15.00 | 200K |
| Grok 4 | xAI | $3.00 | $15.00 | 2.0M |
| Grok 3 | xAI | $3.00 | $15.00 | 131K |
| Sonar Pro | Perplexity | $3.00 | $15.00 | 128K |
| Claude 3 7 Sonnet 20250219 | Anthropic | $3.00 | $15.00 | 200K |
| Claude 4 Sonnet 20250514 | Anthropic | $3.00 | $15.00 | 1.0M |
| Claude Sonnet 4 5 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 4 5 20250929 | Anthropic | $3.00 | $15.00 | 200K |
| Claude Sonnet 4 6 | Anthropic | $3.00 | $15.00 | 1.0M |
| Claude Sonnet 4 20250514 | Anthropic | $3.00 | $15.00 | 1.0M |
| Claude Opus 4.7 | Anthropic | $5.00 | $25.00 | 1.0M |
| Claude Opus 4.6 Adaptive | Anthropic | $5.00 | $25.00 | 200K |
| Claude Opus 4.6 | Anthropic | $5.00 | $25.00 | 200K |
| Claude Opus 4.5 | Anthropic | $5.00 | $25.00 | 200K |
| MAI-Image-2 | Microsoft | $5.00 | $33.00 | 32K |
| Claude Opus 4 5 20251101 | Anthropic | $5.00 | $25.00 | 200K |
| Claude Opus 4 5 | Anthropic | $5.00 | $25.00 | 200K |
| Claude Opus 4 6 | Anthropic | $5.00 | $25.00 | 1.0M |
| Claude Opus 4 6 20260205 | Anthropic | $5.00 | $25.00 | 1.0M |
| Claude Opus 4 7 | Anthropic | $5.00 | $25.00 | 1.0M |
| Claude Opus 4 7 20260416 | Anthropic | $5.00 | $25.00 | 1.0M |
| o3 Deep Research | OpenAI | $10.00 | $40.00 | 200K |
| o1 | OpenAI | $15.00 | $60.00 | 200K |
| Claude Opus 4.1 | Anthropic | $15.00 | $75.00 | 200K |
| Claude 3 Opus 20240229 | Anthropic | $15.00 | $75.00 | 200K |
| Claude 4 Opus 20250514 | Anthropic | $15.00 | $75.00 | 200K |
| Claude Opus 4 1 | Anthropic | $15.00 | $75.00 | 200K |
| Claude Opus 4 1 20250805 | Anthropic | $15.00 | $75.00 | 200K |
| Claude Opus 4 20250514 | Anthropic | $15.00 | $75.00 | 200K |
| Voxtral TTS | Mistral | $16.00 | $< 0.01 | 128K |
| o3-pro | OpenAI | $20.00 | $80.00 | 200K |
| Claude Mythos Preview | Anthropic | $25.00 | $125 | 1.0M |
| GPT-5.4 Pro | OpenAI | $30.00 | $180 | 1.1M |
Pricing as of March 2026. Open-source model prices reflect hosted API providers. Always verify with official pages. Value = quality index / input cost per 1M tokens (higher is better).
Speed & quality data by Artificial AnalysisHow to Compare LLM API Pricing
- 1
Browse the pricing table
400+ models are listed with input and output pricing per million tokens, plus context window sizes and benchmark scores.
- 2
Sort and filter
Click any column header to sort by price, context window, or benchmark score. Use the provider filter to focus on specific vendors.
- 3
Evaluate price vs. performance
Quality and speed metrics from Artificial Analysis help you weigh cost against actual model capability.
Why Use This Pricing Comparison
- 400+ models from 15+ providers in a single sortable table — no tab switching
- Enriched with quality index and speed benchmarks from Artificial Analysis
- Provider-colored badges for quick visual scanning across vendors
- Context window and max output token data alongside pricing
- Data sourced from official provider docs and LiteLLM open-source project
Common Use Cases
Vendor selection
Compare pricing across all major providers before committing to an API integration. Sort by cost to find the cheapest option.
Cost optimization
Find cheaper alternatives to your current model by sorting on price and checking if the quality benchmark is close enough.
Technical evaluation
Use the quality index and speed metrics to shortlist models, then compare their pricing to make a final decision.
Model comparison
Use the compare tool to see side-by-side pricing and specs for any two models.
Related Tools
Frequently Asked Questions
Common questions about LLM API pricing