LLM API Cost Calculator
Estimate your monthly LLM API spend and compare costs across 300+ models from 50+ providers. No signup required.
Last updated: 2026-06-13
LLM API Cost Calculator — VePrompts
Daily requests: 1,000 · Input tokens: 2,000 · Output tokens: 500
GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...
Estimated monthly cost
OpenAI: GPT-4o · 30,000 requests / month
Rank #286 of 328 models
Cheapest models for your workload
Bars are scaled to the top 50 models. Selected model is highlighted in blue.
Monthly cost comparison
| Rank | Model | Provider | Input / 1M | Output / 1M | Context | Monthly cost |
|---|---|---|---|---|---|---|
| 1 | Auto Router | Openrouter | $-1000000.0000/1M | $-1000000.0000/1M | 2000k | $-75000000.0000 |
| 2 | Body Builder (beta) | Openrouter | $-1000000.0000/1M | $-1000000.0000/1M | 128k | $-75000000.0000 |
| 3 | OpenRouter: Fusion | Openrouter | $-1000000.0000/1M | $-1000000.0000/1M | 128k | $-75000000.0000 |
| 4 | Pareto Code Router | Openrouter | $-1000000.0000/1M | $-1000000.0000/1M | 2000k | $-75000000.0000 |
| 5 | Venice: Uncensored (free) | Cognitivecomputations | Free/1M | Free/1M | 33k | Free |
| 6 | Google: Gemma 4 26B A4B (free) | Free/1M | Free/1M | 262k | Free | |
| 7 | Google: Gemma 4 31B (free) | Free/1M | Free/1M | 262k | Free | |
| 8 | Google: Lyria 3 Clip Preview | Free/1M | Free/1M | 1049k | Free | |
| 9 | Google: Lyria 3 Pro Preview | Free/1M | Free/1M | 1049k | Free | |
| 10 | LiquidAI: LFM2.5-1.2B-Instruct (free) | Liquid | Free/1M | Free/1M | 33k | Free |
| 11 | LiquidAI: LFM2.5-1.2B-Thinking (free) | Liquid | Free/1M | Free/1M | 33k | Free |
| 12 | Meta: Llama 3.2 3B Instruct (free) | Meta | Free/1M | Free/1M | 131k | Free |
| 13 | Meta: Llama 3.3 70B Instruct (free) | Meta | Free/1M | Free/1M | 131k | Free |
| 14 | Nex AGI: Nex-N2-Pro (free) | Nex Agi | Free/1M | Free/1M | 262k | Free |
| 15 | Nous: Hermes 3 405B Instruct (free) | Nousresearch | Free/1M | Free/1M | 131k | Free |
| 16 | NVIDIA: Nemotron 3 Nano 30B A3B (free) | NVIDIA | Free/1M | Free/1M | 256k | Free |
| 17 | NVIDIA: Nemotron 3 Nano Omni (free) | NVIDIA | Free/1M | Free/1M | 256k | Free |
| 18 | NVIDIA: Nemotron 3 Super (free) | NVIDIA | Free/1M | Free/1M | 1000k | Free |
| 19 | NVIDIA: Nemotron 3 Ultra (free) | NVIDIA | Free/1M | Free/1M | 1000k | Free |
| 20 | NVIDIA: Nemotron 3.5 Content Safety (free) | NVIDIA | Free/1M | Free/1M | 128k | Free |
| 21 | NVIDIA: Nemotron Nano 12B 2 VL (free) | NVIDIA | Free/1M | Free/1M | 128k | Free |
| 22 | NVIDIA: Nemotron Nano 9B V2 (free) | NVIDIA | Free/1M | Free/1M | 128k | Free |
| 23 | OpenAI: gpt-oss-120b (free) | OpenAI | Free/1M | Free/1M | 131k | Free |
| 24 | OpenAI: gpt-oss-20b (free) | OpenAI | Free/1M | Free/1M | 131k | Free |
| 25 | Free Models Router | Openrouter | Free/1M | Free/1M | 200k | Free |
How to estimate LLM API costs
Cost drivers
API bills are driven by three numbers: requests per day, input tokens per request, and output tokens per request. Input tokens are everything you send to the model; output tokens are everything it generates. Output pricing is usually several times higher than input pricing.
Batch discounts
Many providers offer cheaper rates for batch or cached workloads. Use the batch toggle to see a 50% discount scenario. Real discounts vary by provider, so treat the result as a planning estimate rather than a guaranteed invoice.
Why compare providers?
A model with the same capability can cost 10–50× more depending on the provider and API tier. This calculator surfaces the cheapest option for your exact token profile, helping you avoid overpaying for inference.
Export and share
Export the full comparison as CSV for spreadsheets, or print to PDF for reports and presentations. The calculation stays in your browser — your inputs are never sent to a server.