Cheapest LLM Models in 2026
All 102+ LLM API models ranked from cheapest to most expensive by average token price. Find the most budget-friendly AI model for your project.
| Rank | Model | Provider | Input/1M | Output/1M | Avg/1M | Context |
|---|---|---|---|---|---|---|
| #1 | Qwen3.5-9B | Alibaba | $0.05 | $0.15 | $0.10 | 262K |
| #2 | GPT-OSS 120B | OpenAI | $0.039 | $0.19 | $0.11 | 131K |
| #3 | Ministral 3 8B | Mistral | $0.15 | $0.15 | $0.15 | 262K |
| #4 | Qwen3.5-Omni Flash | Alibaba | $0.065 | $0.26 | $0.16 | 262K |
| #5 | Llama 3.3 70B | Meta | $0.18 | $0.18 | $0.18 | 131K |
| #6 | GPT-OSS 20B | OpenAI | $0.075 | $0.3 | $0.19 | 131K |
| #7 | Gemini 2.0 Flash-Lite | $0.075 | $0.3 | $0.19 | 1.0M | |
| #8 | Llama 4 Scout | Meta | $0.08 | $0.3 | $0.19 | 1.0M |
| #9 | Devstral Small | Mistral | $0.1 | $0.3 | $0.20 | 256K |
| #10 | Mistral Small 3.2 | Mistral | $0.1 | $0.3 | $0.20 | 128K |
| #11 | Ministral 3 14B | Mistral | $0.2 | $0.2 | $0.20 | 262K |
| #12 | GPT-5 Nano | OpenAI | $0.05 | $0.4 | $0.23 | 128K |
| #13 | GPT-4.1 Nano | OpenAI | $0.1 | $0.4 | $0.25 | 1.0M |
| #14 | Gemini 2.5 Flash-Lite | $0.1 | $0.4 | $0.25 | 1.0M | |
| #15 | Gemini 2.0 Flash | $0.1 | $0.4 | $0.25 | 1.0M | |
| #16 | Gemma 4 26B A4B | $0.13 | $0.4 | $0.27 | 262K | |
| #17 | Gemma 4 31B | $0.14 | $0.4 | $0.27 | 262K | |
| #18 | Grok 4.1 Fast | xAI | $0.2 | $0.5 | $0.35 | 2.0M |
| #19 | Grok 4.1 Fast Reasoning | xAI | $0.2 | $0.5 | $0.35 | 2.0M |
| #20 | DeepSeek V3.2 (Chat) | DeepSeek | $0.28 | $0.42 | $0.35 | 128K |
| #21 | DeepSeek V3.2 (Reasoner) | DeepSeek | $0.28 | $0.42 | $0.35 | 128K |
| #22 | GPT-4o Mini | OpenAI | $0.15 | $0.6 | $0.38 | 128K |
| #23 | Mistral Small 4 | Mistral | $0.15 | $0.6 | $0.38 | 256K |
| #24 | Grok 3 Mini | xAI | $0.3 | $0.5 | $0.40 | 131K |
| #25 | Nemotron 3 Super 120B | NVIDIA | $0.3 | $0.8 | $0.55 | 1.0M |
| #26 | Llama 4 Maverick | Meta | $0.27 | $0.85 | $0.56 | 1.0M |
| #27 | Codestral | Mistral | $0.3 | $0.9 | $0.60 | 256K |
| #28 | GPT-5.4 Nano | OpenAI | $0.2 | $1.25 | $0.72 | 400K |
| #29 | MiniMax M2.5 | MiniMax | $0.3 | $1.2 | $0.75 | 128K |
| #30 | Grok Code Fast | xAI | $0.2 | $1.5 | $0.85 | 256K |
| #31 | Gemini 3.1 Flash-Lite | $0.25 | $1.5 | $0.88 | 1.0M | |
| #32 | Qwen3.6-Plus | Alibaba | $0.276 | $1.651 | $0.96 | 1.0M |
| #33 | GPT-4.1 Mini | OpenAI | $0.4 | $1.6 | $1.00 | 1.0M |
| #34 | Mistral Large 3 | Mistral | $0.5 | $1.5 | $1.00 | 262K |
| #35 | Magistral Small | Mistral | $0.5 | $1.5 | $1.00 | 40K |
| #36 | GPT-5 Mini | OpenAI | $0.25 | $2 | $1.13 | 400K |
| #37 | Mistral Medium | Mistral | $0.4 | $2 | $1.20 | 131K |
| #38 | Devstral | Mistral | $0.4 | $2 | $1.20 | 256K |
| #39 | Qwen 3.5 27B | Alibaba | $0.3 | $2.4 | $1.35 | 128K |
| #40 | DeepSeek R1 | DeepSeek | $0.55 | $2.19 | $1.37 | 128K |
| #41 | Gemini 2.5 Flash | $0.3 | $2.5 | $1.40 | 1.0M | |
| #42 | Nova 2.0 Lite | Amazon | $0.3 | $2.5 | $1.40 | 128K |
| #43 | Kimi K2 Thinking | Moonshot | $0.6 | $2.5 | $1.55 | 262K |
| #44 | Gemini 3 Flash | $0.5 | $3 | $1.75 | 1.0M | |
| #45 | Gemini 3 Flash Reasoning | $0.5 | $3 | $1.75 | 1.0M | |
| #46 | Kimi K2.5 | Moonshot | $0.6 | $3 | $1.80 | 262K |
| #47 | MiMo-V2-Pro | Xiaomi | $1 | $3 | $2.00 | 1.0M |
| #48 | Qwen 3.5 397B | Alibaba | $0.6 | $3.6 | $2.10 | 128K |
| #49 | GLM-5 | Zhipu | $1 | $3.2 | $2.10 | 128K |
| #50 | Grok 3 Mini Fast | xAI | $0.6 | $4 | $2.30 | 131K |
| #51 | Claude Haiku 3.5 | Anthropic | $0.8 | $4 | $2.40 | 200K |
| #52 | Kimi K2.6 | Moonshot | $0.95 | $4 | $2.48 | 262K |
| #53 | Qwen3.5-Omni Plus | Alibaba | $0.4 | $4.8 | $2.60 | 262K |
| #54 | GLM-5 Turbo | Zhipu | $1.2 | $4 | $2.60 | 200K |
| #55 | GPT-5.4 Mini | OpenAI | $0.75 | $4.5 | $2.63 | 400K |
| #56 | Gemini 3.1 Flash Live | $0.75 | $4.5 | $2.63 | 1.0M | |
| #57 | o4 Mini | OpenAI | $1.1 | $4.4 | $2.75 | 200K |
| #58 | o3 Mini | OpenAI | $1.1 | $4.4 | $2.75 | 200K |
| #59 | Claude Haiku 4.5 | Anthropic | $1 | $5 | $3.00 | 200K |
| #60 | Claude 4.5 Haiku Reasoning | Anthropic | $1 | $5 | $3.00 | 200K |
| #61 | Magistral Medium | Mistral | $2 | $5 | $3.50 | 40K |
| #62 | Grok 4.20 | xAI | $2 | $6 | $4.00 | 2.0M |
| #63 | Kimi K2 Thinking Turbo | Moonshot | $1.15 | $8 | $4.58 | 262K |
| #64 | GPT-4.1 | OpenAI | $2 | $8 | $5.00 | 1.0M |
| #65 | o3 | OpenAI | $2 | $8 | $5.00 | 200K |
| #66 | o4 Mini Deep Research | OpenAI | $2 | $8 | $5.00 | 200K |
| #67 | GPT-5.1 | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #68 | GPT-5 | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #69 | GPT-5 Medium | OpenAI | $1.25 | $10 | $5.63 | 400K |
| #70 | Gemini 2.5 Pro | $1.25 | $10 | $5.63 | 1.0M | |
| #71 | Nova 2.0 Pro Reasoning | Amazon | $1.25 | $10 | $5.63 | 128K |
| #72 | Grok 2 | xAI | $2 | $10 | $6.00 | 131K |
| #73 | GPT-4o | OpenAI | $2.5 | $10 | $6.25 | 128K |
| #74 | Command A | Cohere | $2.5 | $10 | $6.25 | 128K |
| #75 | Gemini 3.1 Pro | $2 | $12 | $7.00 | 1.0M | |
| #76 | Gemini 3 Pro | $2 | $12 | $7.00 | 1.0M | |
| #77 | GPT-5.2 | OpenAI | $1.75 | $14 | $7.88 | 400K |
| #78 | GPT-5.3 Codex | OpenAI | $1.75 | $14 | $7.88 | 400K |
| #79 | Voxtral TTS | Mistral | $16 | $0 | $8.00 | 128K |
| #80 | GPT-5.4 | OpenAI | $2.5 | $15 | $8.75 | 1.1M |
| #81 | Claude Sonnet 4.6 Adaptive | Anthropic | $3 | $15 | $9.00 | 200K |
| #82 | Claude Sonnet 4.6 | Anthropic | $3 | $15 | $9.00 | 200K |
| #83 | Claude Sonnet 4.5 | Anthropic | $3 | $15 | $9.00 | 200K |
| #84 | Claude Sonnet 4 | Anthropic | $3 | $15 | $9.00 | 200K |
| #85 | Claude 3.7 Sonnet | Anthropic | $3 | $15 | $9.00 | 200K |
| #86 | Grok 4 | xAI | $3 | $15 | $9.00 | 2.0M |
| #87 | Grok 3 | xAI | $3 | $15 | $9.00 | 131K |
| #88 | Sonar Pro | Perplexity | $3 | $15 | $9.00 | 128K |
| #89 | Gemini 3.1 Flash TTS | $1 | $20 | $10.50 | 32K | |
| #90 | Claude Opus 4.7 | Anthropic | $5 | $25 | $15.00 | 1.0M |
| #91 | Claude Opus 4.6 Adaptive | Anthropic | $5 | $25 | $15.00 | 200K |
| #92 | Claude Opus 4.6 | Anthropic | $5 | $25 | $15.00 | 200K |
| #93 | Claude Opus 4.5 | Anthropic | $5 | $25 | $15.00 | 200K |
| #94 | Grok 3 Fast | xAI | $5 | $25 | $15.00 | 131K |
| #95 | MAI-Image-2 | Microsoft | $5 | $33 | $19.00 | 32K |
| #96 | o3 Deep Research | OpenAI | $10 | $40 | $25.00 | 200K |
| #97 | o1 | OpenAI | $15 | $60 | $37.50 | 200K |
| #98 | Claude Opus 4.1 | Anthropic | $15 | $75 | $45.00 | 200K |
| #99 | o3-pro | OpenAI | $20 | $80 | $50.00 | 200K |
| #100 | Claude Mythos Preview | Anthropic | $25 | $125 | $75.00 | 1.0M |
| #101 | GPT-5.4 Pro | OpenAI | $30 | $180 | $105.00 | 1.1M |
| #102 | o1 Pro | OpenAI | $150 | $600 | $375.00 | 200K |
How We Rank the Cheapest LLMs
Models are ranked by their average price per 1 million tokens, calculated as (input price + output price) / 2. This gives a balanced view of overall cost since most workloads use both input and output tokens.
Keep in mind that the cheapest model isn't always the best choice. Consider quality benchmarks, context window size, and output speed when making your decision. Use our leaderboard to compare quality alongside cost.