How do I estimate my monthly LLM API cost?

Enter your average prompt size (in tokens), expected response size, and how many API calls you make per day. The calculator multiplies this by 30 days and applies each model's per-token pricing to give you a monthly estimate.

What usage presets are available?

The calculator includes presets for common patterns: Chatbot (short prompts, moderate responses), RAG (large context, short answers), Code Generation (medium prompts, long outputs), and Summarizer (long inputs, short outputs).

Can I compare costs in different currencies?

Yes. The calculator supports USD, EUR, GBP, INR, and JPY with live exchange rate approximations.

Are these cost estimates accurate?

Estimates use the latest published API pricing from each provider. Actual costs may vary based on prompt caching, batch discounts, and rate tier pricing that some providers offer.

Which models are included in the calculator?

All 60+ models from OpenAI, Anthropic, Google, xAI, Meta, Mistral, DeepSeek, and more — including GPT-5.4, Claude Opus 4.6, Gemini 3.1 Pro, Grok 4, and more.

Monthly Cost Calculator

Estimate your monthly LLM API spend. Set your average prompt size, response size, and daily call volume.

Quick Presets

Avg Input Tokens

per API call

Avg Output Tokens

per API call

Calls per Day

30,000/month

Currency

Cheapest / month

$5.19

GPT-OSS 120B

Most expensive / month

$18,000

o1 Pro

Monthly calls

30,000

2500 tokens/call

Model	Provider	Per Call	Daily	Monthly	Context
GPT-OSS 120B	OpenAI	$< 0.01	$0.173	$5.19	131K
Qwen3.5-9B	Alibaba	$< 0.01	$0.175	$5.25	262K
Qwen3.5-Omni Flash	Alibaba	$< 0.01	$0.260	$7.80	262K
Hunyuan HY3 Preview	Tencent	$< 0.01	$0.262	$7.86	262K
GPT-OSS 20B (Bedrock)	OpenAI	$< 0.01	$0.290	$8.70	16K
GPT-OSS 20B	OpenAI	$< 0.01	$0.300	$9.00	131K
Gemini 2.0 Flash-Lite	Google	$< 0.01	$0.300	$9.00	1.0M
GPT-5 Nano	OpenAI	$< 0.01	$0.300	$9.00	128K
Llama 4 Scout	Meta	$< 0.01	$0.310	$9.30	1.0M
GLM-4.7-flash	Zhipu	$< 0.01	$0.340	$10.20	200K
Devstral Small	Mistral	$< 0.01	$0.350	$10.50	256K
Mistral Small 3.2	Mistral	$< 0.01	$0.350	$10.50	128K
Ministral 3 8B	Mistral	$< 0.01	$0.375	$11.25	262K
Pixtral 12B	Mistral	$< 0.01	$0.375	$11.25	128K
GPT-4.1 Nano	OpenAI	$< 0.01	$0.400	$12.00	1.0M
Gemini 2.5 Flash-Lite	Google	$< 0.01	$0.400	$12.00	1.0M
Gemini 2.0 Flash	Google	$< 0.01	$0.400	$12.00	1.0M
DeepSeek V4-Flash	DeepSeek	$< 0.01	$0.420	$12.60	1.0M
Llama 3.3 70B	Meta	$< 0.01	$0.450	$13.50	131K
Gemma 4 26B A4B	Google	$< 0.01	$0.460	$13.80	262K
Gemma 4 31B	Google	$< 0.01	$0.480	$14.40	262K
Ministral 3 14B	Mistral	$< 0.01	$0.500	$15.00	262K
Qwen3-Next-80B-A3B-Thinking	Alibaba	$< 0.01	$0.585	$17.55	262K
GPT-OSS 120B (Bedrock)	OpenAI	$< 0.01	$0.600	$18.00	16K
GPT-4o Mini	OpenAI	$< 0.01	$0.600	$18.00	128K
Mistral Small 4	Mistral	$< 0.01	$0.600	$18.00	256K
Qwen3 Coder Next	Alibaba	$< 0.01	$0.620	$18.60	262K
Grok 4.1 Fast	xAI	$< 0.01	$0.650	$19.50	2.0M
Grok 4.1 Fast Reasoning	xAI	$< 0.01	$0.650	$19.50	2.0M
DeepSeek V3.2 (Chat)	DeepSeek	$< 0.01	$0.770	$23.10	128K
DeepSeek V3.2 (Reasoner)	DeepSeek	$< 0.01	$0.770	$23.10	128K
Grok 3 Mini	xAI	$< 0.01	$0.850	$25.50	131K
Mercury 2	Inception Labs	$< 0.01	$0.875	$26.25	128K
Llama 4 Maverick	Meta	$< 0.01	$0.965	$28.95	1.0M
Nemotron 3 Super 120B	NVIDIA	$< 0.01	$1.00	$30.00	1.0M
GPT-5.4 Nano	OpenAI	$< 0.01	$1.03	$30.75	400K
Codestral	Mistral	$< 0.01	$1.05	$31.50	256K
Grok Code Fast	xAI	$< 0.01	$1.15	$34.50	256K
MiniMax M2.7	MiniMax	$< 0.01	$1.20	$36.00	205K
MiniMax M2.5	MiniMax	$< 0.01	$1.20	$36.00	128K
Gemini 3.1 Flash-Lite	Google	$< 0.01	$1.25	$37.50	1.0M
DeepSeek V4-Pro	DeepSeek	$< 0.01	$1.30	$39.15	1.0M
Qwen3.6-Plus	Alibaba	$< 0.01	$1.38	$41.33	1.0M
GPT-5 Mini	OpenAI	$< 0.01	$1.50	$45.00	400K
GPT-4.1 Mini	OpenAI	$< 0.01	$1.60	$48.00	1.0M
Mistral Large 3	Mistral	$< 0.01	$1.75	$52.50	262K
Magistral Small	Mistral	$< 0.01	$1.75	$52.50	40K
Mistral Medium	Mistral	$< 0.01	$1.80	$54.00	131K
Devstral	Mistral	$< 0.01	$1.80	$54.00	256K
Qwen 3.5 27B	Alibaba	$< 0.01	$1.80	$54.00	128K
Gemini 2.5 Flash	Google	$< 0.01	$1.85	$55.50	1.0M
Nova 2.0 Lite	Amazon	$< 0.01	$1.85	$55.50	1.0M
DeepSeek R1	DeepSeek	$< 0.01	$2.19	$65.85	128K
GLM-4.7	Zhipu	$< 0.01	$2.30	$69.00	200K
MiniMax M3	MiniMax	$< 0.01	$2.40	$72.00	1.0M
Kimi K2 Thinking	Moonshot	$< 0.01	$2.45	$73.50	262K
Gemini 3 Flash	Google	$< 0.01	$2.50	$75.00	1.0M
Gemini 3 Flash Reasoning	Google	$< 0.01	$2.50	$75.00	1.0M
Kimi K2.5	Moonshot	$< 0.01	$2.70	$81.00	262K
QwQ-Plus	Alibaba	$< 0.01	$2.80	$84.00	131K
Grok Build 0.1	xAI	$< 0.01	$3.00	$90.00	256K
Qwen 3.5 397B	Alibaba	$< 0.01	$3.00	$90.00	128K
Grok 3 Mini Fast	xAI	$< 0.01	$3.20	$96.00	131K
Qwen3.5-Omni Plus	Alibaba	$< 0.01	$3.20	$96.00	262K
MiMo-V2-Pro	Xiaomi	$< 0.01	$3.50	$105	1.0M
Claude Haiku 3.5	Anthropic	$< 0.01	$3.60	$108	200K
GLM-5	Zhipu	$< 0.01	$3.60	$108	128K
Grok 4.3	xAI	$< 0.01	$3.75	$113	1.0M
GPT-5.4 Mini	OpenAI	$< 0.01	$3.75	$113	400K
Gemini 3.1 Flash Live	Google	$< 0.01	$3.75	$113	1.0M
Kimi K2.6	Moonshot	$< 0.01	$3.90	$117	262K
GLM-5 Turbo	Zhipu	$< 0.01	$4.40	$132	200K
o4 Mini	OpenAI	$< 0.01	$4.40	$132	200K
o3 Mini	OpenAI	$< 0.01	$4.40	$132	200K
Claude Haiku 4.5	Anthropic	$< 0.01	$4.50	$135	200K
Claude 4.5 Haiku Reasoning	Anthropic	$< 0.01	$4.50	$135	200K
GLM-5.1	Zhipu	$< 0.01	$5.00	$150	200K
Kimi K2 Thinking Turbo	Moonshot	$< 0.01	$6.30	$189	262K
Magistral Medium	Mistral	$< 0.01	$6.50	$195	40K
Qwen3.6-Max-Preview	Alibaba	$< 0.01	$6.50	$195	262K
Mistral Medium 3.5	Mistral	$< 0.01	$6.75	$203	256K
Grok 4.20	xAI	$< 0.01	$7.00	$210	2.0M
Pixtral Large	Mistral	$< 0.01	$7.00	$210	128K
GPT-5.1	OpenAI	$< 0.01	$7.50	$225	400K
GPT-5	OpenAI	$< 0.01	$7.50	$225	400K
GPT-5 Medium	OpenAI	$< 0.01	$7.50	$225	400K
Gemini 2.5 Pro	Google	$< 0.01	$7.50	$225	1.0M
Nova 2.0 Pro Reasoning	Amazon	$< 0.01	$7.50	$225	128K
Gemini 3.5 Flash	Google	$< 0.01	$7.50	$225	1.0M
GPT-4.1	OpenAI	$< 0.01	$8.00	$240	1.0M
o3	OpenAI	$< 0.01	$8.00	$240	200K
o4 Mini Deep Research	OpenAI	$< 0.01	$8.00	$240	200K
Qwen3.7 Max	Alibaba	$< 0.01	$8.75	$263	1.0M
Grok 2	xAI	$< 0.01	$9.00	$270	131K
GPT-4o	OpenAI	$0.010	$10.00	$300	128K
Gemini 3.1 Pro	Google	$0.010	$10.00	$300	1.0M
Gemini 3 Pro	Google	$0.010	$10.00	$300	1.0M
Command A+	Cohere	$0.010	$10.00	$300	128K
Command A	Cohere	$0.010	$10.00	$300	128K
GPT-5.2	OpenAI	$0.011	$10.50	$315	400K
GPT-5.3 Codex	OpenAI	$0.011	$10.50	$315	400K
Gemini 3.1 Flash TTS	Google	$0.012	$12.00	$360	32K
GPT-5.4	OpenAI	$0.013	$12.50	$375	1.1M
Claude Sonnet 4.6 Adaptive	Anthropic	$0.013	$13.50	$405	200K
Claude Sonnet 4.6	Anthropic	$0.013	$13.50	$405	200K
Claude Sonnet 4.5	Anthropic	$0.013	$13.50	$405	200K
Claude Sonnet 4	Anthropic	$0.013	$13.50	$405	200K
Claude 3.7 Sonnet	Anthropic	$0.013	$13.50	$405	200K
Grok 4	xAI	$0.013	$13.50	$405	2.0M
Grok 3	xAI	$0.013	$13.50	$405	131K
Sonar Pro	Perplexity	$0.013	$13.50	$405	128K
Claude Opus 4.8	Anthropic	$0.022	$22.50	$675	1.0M
Claude Opus 4.7	Anthropic	$0.022	$22.50	$675	1.0M
Claude Opus 4.6 Adaptive	Anthropic	$0.022	$22.50	$675	200K
Claude Opus 4.6	Anthropic	$0.022	$22.50	$675	200K
Claude Opus 4.5	Anthropic	$0.022	$22.50	$675	200K
Grok 3 Fast	xAI	$0.022	$22.50	$675	131K
GPT-5.5	OpenAI	$0.025	$25.00	$750	1.1M
MAI-Image-2	Microsoft	$0.027	$26.50	$795	32K
Voxtral TTS	Mistral	$0.032	$32.00	$960	128K
o3 Deep Research	OpenAI	$0.040	$40.00	$1,200	200K
o1	OpenAI	$0.060	$60.00	$1,800	200K
Claude Opus 4.1	Anthropic	$0.068	$67.50	$2,025	200K
o3-pro	OpenAI	$0.080	$80.00	$2,400	200K
Claude Mythos Preview	Anthropic	$0.113	$113	$3,375	1.0M
GPT-5.5 Pro	OpenAI	$0.150	$150	$4,500	1.1M
GPT-5.4 Pro	OpenAI	$0.150	$150	$4,500	1.1M
o1 Pro	OpenAI	$0.600	$600	$18,000	200K

Costs are estimates based on token counts and published API rates. Actual costs may vary with caching, batching, and rate tier discounts. Exchange rates are approximate.

How to Estimate Monthly LLM API Costs

1
Set your usage parameters
Enter your average prompt size, response length, and daily call volume. Pick a preset like Chatbot or RAG to auto-fill typical values.
2
Review monthly estimates
The calculator shows estimated monthly costs for every model, sorted from cheapest to most expensive.
3
Filter and compare
Narrow results by provider, switch currencies, and find the model that fits your budget and performance needs.

Why Use This Cost Calculator

Built-in presets for Chatbot, RAG, Code Generation, and Summarizer workloads
60+ models compared side by side with real pricing data from each provider
Multi-currency support — see costs in USD, EUR, GBP, INR, or JPY
Adjustable prompt size, response size, and daily call volume with instant recalculation
Provider filtering to focus on the models you're actually considering

Common Use Cases

Startup budgeting

Estimate monthly API costs before you build. Compare providers to find the best price-to-performance ratio for your use case.

Scaling projections

Model how costs grow as your daily call volume increases. Identify where you'd hit budget limits.

Provider comparison

Compare the total monthly cost of running the same workload on GPT-5 vs Claude vs Gemini.

Use case optimization

See how switching from long prompts to shorter ones, or vice versa, affects your bill across models.

Related Tools

Token Counter

Count tokens and see per-request costs for any text.

API Pricing

Full pricing table for all 60+ models.

Compare Models

Side-by-side model pricing and specs.

LLM Leaderboard

Rankings by quality, speed, and value.

Frequently Asked Questions

Common questions about estimating LLM API costs