Skip to main content
VePrompts

LLM API Cost Calculator

Estimate your monthly LLM API spend and compare costs across 300+ models from 50+ providers. No signup required.

Last updated: 2026-06-13

Share
Quick presets:

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

Estimated monthly cost

$300.00

OpenAI: GPT-4o · 30,000 requests / month

Rank #286 of 328 models

1,000 requests × 30 days
= 30,000 requests
× 2K in + 500 out
= $300.00 / month

Cheapest models for your workload

Auto Router
$-75000000.0000
Body Builder (beta)
$-75000000.0000
OpenRouter: Fusion
$-75000000.0000
Pareto Code Router
$-75000000.0000
Venice: Uncensored (free)
Free
Google: Gemma 4 26B A4B (free)
Free
Google: Gemma 4 31B (free)
Free
Google: Lyria 3 Clip Preview
Free
Google: Lyria 3 Pro Preview
Free
LiquidAI: LFM2.5-1.2B-Instruct (free)
Free
LiquidAI: LFM2.5-1.2B-Thinking (free)
Free
Meta: Llama 3.2 3B Instruct (free)
Free
OpenAI: GPT-4o
$300.00

Bars are scaled to the top 50 models. Selected model is highlighted in blue.

Monthly cost comparison

RankModelProviderInput / 1MOutput / 1MContextMonthly cost
1Auto Router Openrouter$-1000000.0000/1M$-1000000.0000/1M2000k$-75000000.0000
2Body Builder (beta) Openrouter$-1000000.0000/1M$-1000000.0000/1M128k$-75000000.0000
3OpenRouter: Fusion Openrouter$-1000000.0000/1M$-1000000.0000/1M128k$-75000000.0000
4Pareto Code Router Openrouter$-1000000.0000/1M$-1000000.0000/1M2000k$-75000000.0000
5Venice: Uncensored (free) CognitivecomputationsFree/1MFree/1M33kFree
6Google: Gemma 4 26B A4B (free) GoogleFree/1MFree/1M262kFree
7Google: Gemma 4 31B (free) GoogleFree/1MFree/1M262kFree
8Google: Lyria 3 Clip Preview GoogleFree/1MFree/1M1049kFree
9Google: Lyria 3 Pro Preview GoogleFree/1MFree/1M1049kFree
10LiquidAI: LFM2.5-1.2B-Instruct (free) LiquidFree/1MFree/1M33kFree
11LiquidAI: LFM2.5-1.2B-Thinking (free) LiquidFree/1MFree/1M33kFree
12Meta: Llama 3.2 3B Instruct (free) MetaFree/1MFree/1M131kFree
13Meta: Llama 3.3 70B Instruct (free) MetaFree/1MFree/1M131kFree
14Nex AGI: Nex-N2-Pro (free) Nex AgiFree/1MFree/1M262kFree
15Nous: Hermes 3 405B Instruct (free) NousresearchFree/1MFree/1M131kFree
16NVIDIA: Nemotron 3 Nano 30B A3B (free) NVIDIAFree/1MFree/1M256kFree
17NVIDIA: Nemotron 3 Nano Omni (free) NVIDIAFree/1MFree/1M256kFree
18NVIDIA: Nemotron 3 Super (free) NVIDIAFree/1MFree/1M1000kFree
19NVIDIA: Nemotron 3 Ultra (free) NVIDIAFree/1MFree/1M1000kFree
20NVIDIA: Nemotron 3.5 Content Safety (free) NVIDIAFree/1MFree/1M128kFree
21NVIDIA: Nemotron Nano 12B 2 VL (free) NVIDIAFree/1MFree/1M128kFree
22NVIDIA: Nemotron Nano 9B V2 (free) NVIDIAFree/1MFree/1M128kFree
23OpenAI: gpt-oss-120b (free) OpenAIFree/1MFree/1M131kFree
24OpenAI: gpt-oss-20b (free) OpenAIFree/1MFree/1M131kFree
25Free Models Router OpenrouterFree/1MFree/1M200kFree

How to estimate LLM API costs

Cost drivers

API bills are driven by three numbers: requests per day, input tokens per request, and output tokens per request. Input tokens are everything you send to the model; output tokens are everything it generates. Output pricing is usually several times higher than input pricing.

Batch discounts

Many providers offer cheaper rates for batch or cached workloads. Use the batch toggle to see a 50% discount scenario. Real discounts vary by provider, so treat the result as a planning estimate rather than a guaranteed invoice.

Why compare providers?

A model with the same capability can cost 10–50× more depending on the provider and API tier. This calculator surfaces the cheapest option for your exact token profile, helping you avoid overpaying for inference.

Export and share

Export the full comparison as CSV for spreadsheets, or print to PDF for reports and presentations. The calculation stays in your browser — your inputs are never sent to a server.