Skip to main content
TokenCost logoTokenCost

Cheapest LLM Models in 2026

All 131+ LLM API models ranked from cheapest to most expensive by average token price. Find the most budget-friendly AI model for your project.

RankModelProviderInput/1MOutput/1MAvg/1MContext
#1Gemma 4 12BGoogle$0$0$0.00262K
#2Qwen3.5-9BAlibaba$0.05$0.15$0.10262K
#3GPT-OSS 120BOpenAI$0.039$0.19$0.11131K
#4Ministral 3 8BMistral$0.15$0.15$0.15262K
#5Pixtral 12BMistral$0.15$0.15$0.15128K
#6Qwen3.5-Omni FlashAlibaba$0.065$0.26$0.16262K
#7Hunyuan HY3 PreviewTencent$0.066$0.26$0.16262K
#8Gemma 4 26B A4BGoogle$0.06$0.3$0.18262K
#9Llama 3.3 70BMeta$0.18$0.18$0.18131K
#10GPT-OSS 20B (Bedrock)OpenAI$0.07$0.3$0.1816K
#11GPT-OSS 20BOpenAI$0.075$0.3$0.19131K
#12Gemini 2.0 Flash-LiteGoogle$0.075$0.3$0.191.0M
#13Llama 4 ScoutMeta$0.08$0.3$0.191.0M
#14Devstral SmallMistral$0.1$0.3$0.20256K
#15Mistral Small 3.2Mistral$0.1$0.3$0.20128K
#16Ministral 3 14BMistral$0.2$0.2$0.20262K
#17DeepSeek V4-FlashDeepSeek$0.14$0.28$0.211.0M
#18GPT-5 NanoOpenAI$0.05$0.4$0.23128K
#19GLM-4.7-flashZhipu$0.07$0.4$0.24200K
#20Gemma 4 31BGoogle$0.12$0.36$0.24262K
#21GPT-4.1 NanoOpenAI$0.1$0.4$0.251.0M
#22Gemini 2.5 Flash-LiteGoogle$0.1$0.4$0.251.0M
#23Gemini 2.0 FlashGoogle$0.1$0.4$0.251.0M
#24Grok 4.1 FastxAI$0.2$0.5$0.352.0M
#25Grok 4.1 Fast ReasoningxAI$0.2$0.5$0.352.0M
#26DeepSeek V3.2 (Chat)DeepSeek$0.28$0.42$0.35128K
#27DeepSeek V3.2 (Reasoner)DeepSeek$0.28$0.42$0.35128K
#28GPT-OSS 120B (Bedrock)OpenAI$0.15$0.6$0.3816K
#29GPT-4o MiniOpenAI$0.15$0.6$0.38128K
#30Mistral Small 4Mistral$0.15$0.6$0.38256K
#31Grok 3 MinixAI$0.3$0.5$0.40131K
#32Qwen3-Next-80B-A3B-ThinkingAlibaba$0.0975$0.78$0.44262K
#33Qwen3 Coder NextAlibaba$0.11$0.8$0.46262K
#34Mercury 2Inception Labs$0.25$0.75$0.50128K
#35Nemotron 3 Super 120BNVIDIA$0.3$0.8$0.551.0M
#36Llama 4 MaverickMeta$0.27$0.85$0.561.0M
#37CodestralMistral$0.3$0.9$0.60256K
#38DeepSeek V4-ProDeepSeek$0.435$0.87$0.651.0M
#39GPT-5.4 NanoOpenAI$0.2$1.25$0.72400K
#40MiniMax M2.7MiniMax$0.3$1.2$0.75205K
#41MiniMax M2.5MiniMax$0.3$1.2$0.75128K
#42Grok Code FastxAI$0.2$1.5$0.85256K
#43Gemini 3.1 Flash-LiteGoogle$0.25$1.5$0.881.0M
#44Qwen3.6-PlusAlibaba$0.276$1.651$0.961.0M
#45GPT-4.1 MiniOpenAI$0.4$1.6$1.001.0M
#46Mistral Large 3Mistral$0.5$1.5$1.00262K
#47Magistral SmallMistral$0.5$1.5$1.0040K
#48Qwen3.7 PlusAlibaba$0.4$1.6$1.001.0M
#49GPT-5 MiniOpenAI$0.25$2$1.13400K
#50Mistral MediumMistral$0.4$2$1.20131K
#51DevstralMistral$0.4$2$1.20256K
#52Qwen 3.5 27BAlibaba$0.3$2.4$1.35128K
#53DeepSeek R1DeepSeek$0.55$2.19$1.37128K
#54Gemini 2.5 FlashGoogle$0.3$2.5$1.401.0M
#55Nova 2.0 LiteAmazon$0.3$2.5$1.401.0M
#56GLM-4.7Zhipu$0.6$2.2$1.40200K
#57Grok Build 0.1xAI$1$2$1.50256K
#58Nemotron 3 Ultra 550BNVIDIA$0.5$2.5$1.501.0M
#59MiniMax M3MiniMax$0.6$2.4$1.501.0M
#60Kimi K2 ThinkingMoonshot$0.6$2.5$1.55262K
#61QwQ-PlusAlibaba$0.8$2.4$1.60131K
#62Gemini 3 FlashGoogle$0.5$3$1.751.0M
#63Gemini 3 Flash ReasoningGoogle$0.5$3$1.751.0M
#64Kimi K2.5Moonshot$0.6$3$1.80262K
#65Grok 4.3xAI$1.25$2.5$1.881.0M
#66MiMo-V2-ProXiaomi$1$3$2.001.0M
#67Qwen 3.5 397BAlibaba$0.6$3.6$2.10128K
#68GLM-5Zhipu$1$3.2$2.10128K
#69Grok 3 Mini FastxAI$0.6$4$2.30131K
#70Claude Haiku 3.5Anthropic$0.8$4$2.40200K
#71Kimi K2.6Moonshot$0.95$4$2.48262K
#72Qwen3.5-Omni PlusAlibaba$0.4$4.8$2.60262K
#73GLM-5 TurboZhipu$1.2$4$2.60200K
#74GPT-5.4 MiniOpenAI$0.75$4.5$2.63400K
#75Gemini 3.1 Flash LiveGoogle$0.75$4.5$2.631.0M
#76o4 MiniOpenAI$1.1$4.4$2.75200K
#77o3 MiniOpenAI$1.1$4.4$2.75200K
#78GLM-5.1Zhipu$1.4$4.4$2.90200K
#79Claude Haiku 4.5Anthropic$1$5$3.00200K
#80Claude 4.5 Haiku ReasoningAnthropic$1$5$3.00200K
#81Magistral MediumMistral$2$5$3.5040K
#82Grok 4.20xAI$2$6$4.002.0M
#83Pixtral LargeMistral$2$6$4.00128K
#84Mistral Medium 3.5Mistral$1.5$7.5$4.50256K
#85Qwen3.6-Max-PreviewAlibaba$1.3$7.8$4.55262K
#86Kimi K2 Thinking TurboMoonshot$1.15$8$4.58262K
#87GPT-4.1OpenAI$2$8$5.001.0M
#88o3OpenAI$2$8$5.00200K
#89o4 Mini Deep ResearchOpenAI$2$8$5.00200K
#90Qwen3.7 MaxAlibaba$2.5$7.5$5.001.0M
#91Gemini 3.5 FlashGoogle$1.5$9$5.251.0M
#92GPT-5.1OpenAI$1.25$10$5.63400K
#93GPT-5OpenAI$1.25$10$5.63400K
#94GPT-5 MediumOpenAI$1.25$10$5.63400K
#95Gemini 2.5 ProGoogle$1.25$10$5.631.0M
#96Nova 2.0 Pro ReasoningAmazon$1.25$10$5.63128K
#97Grok 2xAI$2$10$6.00131K
#98GPT-4oOpenAI$2.5$10$6.25128K
#99Command A+Cohere$2.5$10$6.25128K
#100Command ACohere$2.5$10$6.25128K
#101Gemini 3.1 ProGoogle$2$12$7.001.0M
#102Gemini 3 ProGoogle$2$12$7.001.0M
#103GPT-5.2OpenAI$1.75$14$7.88400K
#104GPT-5.3 CodexOpenAI$1.75$14$7.88400K
#105Voxtral TTSMistral$16$0$8.00128K
#106GPT-5.4OpenAI$2.5$15$8.751.1M
#107Claude Sonnet 4.6 AdaptiveAnthropic$3$15$9.00200K
#108Claude Sonnet 4.6Anthropic$3$15$9.00200K
#109Claude Sonnet 4.5Anthropic$3$15$9.00200K
#110Claude Sonnet 4Anthropic$3$15$9.00200K
#111Claude 3.7 SonnetAnthropic$3$15$9.00200K
#112Grok 4xAI$3$15$9.002.0M
#113Grok 3xAI$3$15$9.00131K
#114Sonar ProPerplexity$3$15$9.00128K
#115Gemini 3.1 Flash TTSGoogle$1$20$10.5032K
#116Claude Opus 4.8Anthropic$5$25$15.001.0M
#117Claude Opus 4.7Anthropic$5$25$15.001.0M
#118Claude Opus 4.6 AdaptiveAnthropic$5$25$15.00200K
#119Claude Opus 4.6Anthropic$5$25$15.00200K
#120Claude Opus 4.5Anthropic$5$25$15.00200K
#121Grok 3 FastxAI$5$25$15.00131K
#122GPT-5.5OpenAI$5$30$17.501.1M
#123MAI-Image-2Microsoft$5$33$19.0032K
#124o3 Deep ResearchOpenAI$10$40$25.00200K
#125o1OpenAI$15$60$37.50200K
#126Claude Opus 4.1Anthropic$15$75$45.00200K
#127o3-proOpenAI$20$80$50.00200K
#128Claude Mythos PreviewAnthropic$25$125$75.001.0M
#129GPT-5.5 ProOpenAI$30$180$105.001.1M
#130GPT-5.4 ProOpenAI$30$180$105.001.1M
#131o1 ProOpenAI$150$600$375.00200K

How We Rank the Cheapest LLMs

Models are ranked by their average price per 1 million tokens, calculated as (input price + output price) / 2. This gives a balanced view of overall cost since most workloads use both input and output tokens.

Keep in mind that the cheapest model isn't always the best choice. Consider quality benchmarks, context window size, and output speed when making your decision. Use our leaderboard to compare quality alongside cost.