Cheapest LLM Models in 2026
All 135+ LLM API models ranked from cheapest to most expensive by average token price. Find the most budget-friendly AI model for your project.
| Rank | Model | Provider | Input/1M | Output/1M | Avg/1M | Context |
|---|---|---|---|---|---|---|
| #1 | Gemma 4 12B | $0 | $0 | $0.00 | 262K | |
| #2 | DiffusionGemma 26B | $0 | $0 | $0.00 | 262K | |
| #3 | Qwen3.5-9B | Alibaba | $0.05 | $0.15 | $0.10 | 262K |
| #4 | GPT-OSS 120B | OpenAI | $0.039 | $0.19 | $0.11 | 131K |
| #5 | Ministral 3 8B | Mistral | $0.15 | $0.15 | $0.15 | 262K |
| #6 | Pixtral 12B | Mistral | $0.15 | $0.15 | $0.15 | 128K |
| #7 | Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.26 | $0.16 | 262K |
| #8 | Hunyuan HY3 Preview | Tencent | $0.066 | $0.26 | $0.16 | 262K |
| #9 | Gemma 4 26B A4B | $0.06 | $0.3 | $0.18 | 262K | |
| #10 | Llama 3.3 70B | Meta | $0.18 | $0.18 | $0.18 | 131K |
| #11 | GPT-OSS 20B (Bedrock) | OpenAI | $0.07 | $0.3 | $0.18 | 16K |
| #12 | GPT-OSS 20B | OpenAI | $0.075 | $0.3 | $0.19 | 131K |
| #13 | Gemini 2.0 Flash-Lite | $0.075 | $0.3 | $0.19 | 1.0M | |
| #14 | Llama 4 Scout | Meta | $0.08 | $0.3 | $0.19 | 1.0M |
| #15 | Devstral Small | Mistral | $0.1 | $0.3 | $0.20 | 256K |
| #16 | Mistral Small 3.2 | Mistral | $0.1 | $0.3 | $0.20 | 128K |
| #17 | Ministral 3 14B | Mistral | $0.2 | $0.2 | $0.20 | 262K |
| #18 | DeepSeek V4-Flash | DeepSeek | $0.14 | $0.28 | $0.21 | 1.0M |
| #19 | GPT-5 Nano | OpenAI | $0.05 | $0.4 | $0.23 | 128K |
| #20 | GLM-4.7-flash | Zhipu | $0.07 | $0.4 | $0.24 | 200K |
| #21 | Gemma 4 31B | $0.12 | $0.36 | $0.24 | 262K | |
| #22 | GPT-4.1 Nano | OpenAI | $0.1 | $0.4 | $0.25 | 1.0M |
| #23 | Gemini 2.5 Flash-Lite | $0.1 | $0.4 | $0.25 | 1.0M | |
| #24 | Gemini 2.0 Flash | $0.1 | $0.4 | $0.25 | 1.0M | |
| #25 | Grok 4.1 Fast | xAI | $0.2 | $0.5 | $0.35 | 2.0M |
| #26 | Grok 4.1 Fast Reasoning | xAI | $0.2 | $0.5 | $0.35 | 2.0M |
| #27 | DeepSeek V3.2 (Chat) | DeepSeek | $0.28 | $0.42 | $0.35 | 128K |
| #28 | DeepSeek V3.2 (Reasoner) | DeepSeek | $0.28 | $0.42 | $0.35 | 128K |
| #29 | GPT-OSS 120B (Bedrock) | OpenAI | $0.15 | $0.6 | $0.38 | 16K |
| #30 | GPT-4o Mini | OpenAI | $0.15 | $0.6 | $0.38 | 128K |
| #31 | Mistral Small 4 | Mistral | $0.15 | $0.6 | $0.38 | 256K |
| #32 | Grok 3 Mini | xAI | $0.3 | $0.5 | $0.40 | 131K |
| #33 | Qwen3-Next-80B-A3B-Thinking | Alibaba | $0.0975 | $0.78 | $0.44 | 262K |
| #34 | Qwen3 Coder Next | Alibaba | $0.11 | $0.8 | $0.46 | 262K |
| #35 | Mercury 2 | Inception Labs | $0.25 | $0.75 | $0.50 | 128K |
| #36 | Nemotron 3 Super 120B | NVIDIA | $0.3 | $0.8 | $0.55 | 1.0M |
| #37 | Llama 4 Maverick | Meta | $0.27 | $0.85 | $0.56 | 1.0M |
| #38 | Codestral | Mistral | $0.3 | $0.9 | $0.60 | 256K |
| #39 | DeepSeek V4-Pro | DeepSeek | $0.435 | $0.87 | $0.65 | 1.0M |
| #40 | MiMo-V2.5-Pro | Xiaomi | $0.435 | $0.87 | $0.65 | 1.0M |
| #41 | GPT-5.4 Nano | OpenAI | $0.2 | $1.25 | $0.72 | 400K |
| #42 | MiniMax M2.7 | MiniMax | $0.3 | $1.2 | $0.75 | 205K |
| #43 | MiniMax M2.5 | MiniMax | $0.3 | $1.2 | $0.75 | 128K |
| #44 | Grok Code Fast | xAI | $0.2 | $1.5 | $0.85 | 256K |
| #45 | Gemini 3.1 Flash-Lite | $0.25 | $1.5 | $0.88 | 1.0M | |
| #46 | Qwen3.6-Plus | Alibaba | $0.276 | $1.651 | $0.96 | 1.0M |
| #47 | GPT-4.1 Mini | OpenAI | $0.4 | $1.6 | $1.00 | 1.0M |
| #48 | Mistral Large 3 | Mistral | $0.5 | $1.5 | $1.00 | 262K |
| #49 | Magistral Small | Mistral | $0.5 | $1.5 | $1.00 | 40K |
| #50 | Qwen3.7 Plus | Alibaba | $0.4 | $1.6 | $1.00 | 1.0M |
| #51 | GPT-5 Mini | OpenAI | $0.25 | $2 | $1.13 | 400K |
| #52 | Mistral Medium | Mistral | $0.4 | $2 | $1.20 | 131K |
| #53 | Devstral | Mistral | $0.4 | $2 | $1.20 | 256K |
| #54 | Qwen 3.5 27B | Alibaba | $0.3 | $2.4 | $1.35 | 128K |
| #55 | DeepSeek R1 | DeepSeek | $0.55 | $2.19 | $1.37 | 128K |
| #56 | Gemini 2.5 Flash | $0.3 | $2.5 | $1.40 | 1.0M | |
| #57 | Nova 2.0 Lite | Amazon | $0.3 | $2.5 | $1.40 | 1.0M |
| #58 | GLM-4.7 | Zhipu | $0.6 | $2.2 | $1.40 | 200K |
| #59 | Grok Build 0.1 | xAI | $1 | $2 | $1.50 | 256K |
| #60 | Nemotron 3 Ultra 550B | NVIDIA | $0.5 | $2.5 | $1.50 | 1.0M |
| #61 | MiniMax M3 | MiniMax | $0.6 | $2.4 | $1.50 | 1.0M |
| #62 | Kimi K2 Thinking | Moonshot | $0.6 | $2.5 | $1.55 | 262K |
| #63 | QwQ-Plus | Alibaba | $0.8 | $2.4 | $1.60 | 131K |
| #64 | Gemini 3 Flash | $0.5 | $3 | $1.75 | 1.0M | |
| #65 | Gemini 3 Flash Reasoning | $0.5 | $3 | $1.75 | 1.0M | |
| #66 | Kimi K2.5 | Moonshot | $0.6 | $3 | $1.80 | 262K |
| #67 | Grok 4.3 | xAI | $1.25 | $2.5 | $1.88 | 1.0M |
| #68 | MiMo-V2-Pro | Xiaomi | $1 | $3 | $2.00 | 1.0M |
| #69 | Qwen 3.5 397B | Alibaba | $0.6 | $3.6 | $2.10 | 128K |
| #70 | GLM-5 | Zhipu | $1 | $3.2 | $2.10 | 128K |
| #71 | Grok 3 Mini Fast | xAI | $0.6 | $4 | $2.30 | 131K |
| #72 | Claude Haiku 3.5 | Anthropic | $0.8 | $4 | $2.40 | 200K |
| #73 | Kimi K2.6 | Moonshot | $0.95 | $4 | $2.48 | 262K |
| #74 | Qwen3.5-Omni Plus | Alibaba | $0.4 | $4.8 | $2.60 | 262K |
| #75 | GLM-5 Turbo | Zhipu | $1.2 | $4 | $2.60 | 200K |
| #76 | GPT-5.4 Mini | OpenAI | $0.75 | $4.5 | $2.63 | 400K |
| #77 | Gemini 3.1 Flash Live | $0.75 | $4.5 | $2.63 | 1.0M | |
| #78 | o4 Mini | OpenAI | $1.1 | $4.4 | $2.75 | 200K |
| #79 | o3 Mini | OpenAI | $1.1 | $4.4 | $2.75 | 200K |
| #80 | GLM-5.1 | Zhipu | $1.4 | $4.4 | $2.90 | 200K |
| #81 | Claude Haiku 4.5 | Anthropic | $1 | $5 | $3.00 | 200K |
| #82 | Claude 4.5 Haiku Reasoning | Anthropic | $1 | $5 | $3.00 | 200K |
| #83 | Magistral Medium | Mistral | $2 | $5 | $3.50 | 40K |
| #84 | Grok 4.20 | xAI | $2 | $6 | $4.00 | 2.0M |
| #85 | Pixtral Large | Mistral | $2 | $6 | $4.00 | 128K |
| #86 | Mistral Medium 3.5 | Mistral | $1.5 | $7.5 | $4.50 | 256K |
| #87 | Qwen3.6-Max-Preview | Alibaba | $1.3 | $7.8 | $4.55 | 262K |
| #88 | Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8 | $4.58 | 262K |
| #89 | GPT-4.1 | OpenAI | $2 | $8 | $5.00 | 1.0M |
| #90 | o3 | OpenAI | $2 | $8 | $5.00 | 200K |
| #91 | o4 Mini Deep Research | OpenAI | $2 | $8 | $5.00 | 200K |
| #92 | Qwen3.7 Max | Alibaba | $2.5 | $7.5 | $5.00 | 1.0M |
| #93 | Gemini 3.5 Flash | $1.5 | $9 | $5.25 | 1.0M | |
| #94 | GPT-5.1 | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #95 | GPT-5 | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #96 | GPT-5 Medium | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #97 | Gemini 2.5 Pro | $1.25 | $10 | $5.63 | 1.0M | |
| #98 | Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10 | $5.63 | 128K |
| #99 | Grok 2 | xAI | $2 | $10 | $6.00 | 131K |
| #100 | GPT-4o | OpenAI | $2.5 | $10 | $6.25 | 128K |
| #101 | Command A+ | Cohere | $2.5 | $10 | $6.25 | 128K |
| #102 | Command A | Cohere | $2.5 | $10 | $6.25 | 128K |
| #103 | Gemini 3.1 Pro | $2 | $12 | $7.00 | 1.0M | |
| #104 | Gemini 3 Pro | $2 | $12 | $7.00 | 1.0M | |
| #105 | GPT-5.2 | OpenAI | $1.75 | $14 | $7.88 | 400K |
| #106 | GPT-5.3 Codex | OpenAI | $1.75 | $14 | $7.88 | 400K |
| #107 | Voxtral TTS | Mistral | $16 | $0 | $8.00 | 128K |
| #108 | GPT-5.4 | OpenAI | $2.5 | $15 | $8.75 | 1.1M |
| #109 | Claude Sonnet 4.6 Adaptive | Anthropic | $3 | $15 | $9.00 | 200K |
| #110 | Claude Sonnet 4.6 | Anthropic | $3 | $15 | $9.00 | 200K |
| #111 | Claude Sonnet 4.5 | Anthropic | $3 | $15 | $9.00 | 200K |
| #112 | Claude Sonnet 4 | Anthropic | $3 | $15 | $9.00 | 200K |
| #113 | Claude 3.7 Sonnet | Anthropic | $3 | $15 | $9.00 | 200K |
| #114 | Grok 4 | xAI | $3 | $15 | $9.00 | 2.0M |
| #115 | Grok 3 | xAI | $3 | $15 | $9.00 | 131K |
| #116 | Sonar Pro | Perplexity | $3 | $15 | $9.00 | 128K |
| #117 | Gemini 3.1 Flash TTS | $1 | $20 | $10.50 | 32K | |
| #118 | Claude Opus 4.8 | Anthropic | $5 | $25 | $15.00 | 1.0M |
| #119 | Claude Opus 4.7 | Anthropic | $5 | $25 | $15.00 | 1.0M |
| #120 | Claude Opus 4.6 Adaptive | Anthropic | $5 | $25 | $15.00 | 200K |
| #121 | Claude Opus 4.6 | Anthropic | $5 | $25 | $15.00 | 200K |
| #122 | Claude Opus 4.5 | Anthropic | $5 | $25 | $15.00 | 200K |
| #123 | Grok 3 Fast | xAI | $5 | $25 | $15.00 | 131K |
| #124 | GPT-5.5 | OpenAI | $5 | $30 | $17.50 | 1.1M |
| #125 | MAI-Image-2 | Microsoft | $5 | $33 | $19.00 | 32K |
| #126 | o3 Deep Research | OpenAI | $10 | $40 | $25.00 | 200K |
| #127 | Claude Fable 5 | Anthropic | $10 | $50 | $30.00 | 1.0M |
| #128 | Claude Mythos 5 | Anthropic | $10 | $50 | $30.00 | 1.0M |
| #129 | o1 | OpenAI | $15 | $60 | $37.50 | 200K |
| #130 | Claude Opus 4.1 | Anthropic | $15 | $75 | $45.00 | 200K |
| #131 | o3-pro | OpenAI | $20 | $80 | $50.00 | 200K |
| #132 | Claude Mythos Preview | Anthropic | $25 | $125 | $75.00 | 1.0M |
| #133 | GPT-5.5 Pro | OpenAI | $30 | $180 | $105.00 | 1.1M |
| #134 | GPT-5.4 Pro | OpenAI | $30 | $180 | $105.00 | 1.1M |
| #135 | o1 Pro | OpenAI | $150 | $600 | $375.00 | 200K |
How We Rank the Cheapest LLMs
Models are ranked by their average price per 1 million tokens, calculated as (input price + output price) / 2. This gives a balanced view of overall cost since most workloads use both input and output tokens.
Keep in mind that the cheapest model isn't always the best choice. Consider quality benchmarks, context window size, and output speed when making your decision. Use our leaderboard to compare quality alongside cost.