Nemotron 3 Ultra 550B
Last verified: June 7, 2026$0.500/1M input · $2.50/1M output · 1.0M context · NVIDIA
Count Tokens
—
Tokens
—
Input Cost
—
Output Cost
Estimate Monthly Cost
Monthly Cost Estimator
Quick:
< $0.0001/mo
Pick a preset above or enter custom usage
Alternatives to Nemotron 3 Ultra 550B
Pricing Details
NVIDIALargest open NVIDIA model. Hybrid Mamba-Transformer with latent MoE, 550B total / 55B active, NVFP4 pre-training. No single rate (open weights): $0.50/$2.50 on OpenRouter and DeepInfra, as low as $0.37/$1.08 at launch, $0.60/$3.60 on Together. Free rate-limited endpoint on OpenRouter. AA Intelligence Index 48, top US open-weights model but behind Kimi K2.6 (54). MMLU-Pro 86.8, GPQA 87.0, LiveCodeBench v6 89.0, SWE-Bench Verified 71.9, RULER@1M 94.7. OpenMDW 1.1 license.
Input / 1M tokens
$0.5
Output / 1M tokens
$2.5
Context Window
1.0M
Max Output
33K
Price History
Launched at current price on 2026-06-04. No price changes recorded.
Frequently Asked Questions
Common questions about Nemotron 3 Ultra 550B pricing and usage