How is the monthly cost calculated?

The calculator multiplies your daily requests by 30 to get monthly requests. Each request cost equals (input tokens × input price + output tokens × output price) ÷ 1,000,000. With batch pricing enabled, the effective price is halved where providers offer batch discounts.

Which providers are included?

The calculator includes models from OpenAI, Anthropic, Google, Meta, DeepSeek, Mistral, xAI, Cohere, AI21, Amazon, Microsoft, and dozens of other providers available in the VePrompts pricing dataset.

Is batch pricing always 50% off?

The toggle applies a representative 50% batch discount to every model for comparison purposes. Actual provider batch discounts vary; verify the exact rate in your provider dashboard before budgeting.

Can I export the results?

Yes. Click Export CSV to download a spreadsheet of the comparison table, or Save as PDF to print a clean summary through your browser.

Free LLM API Cost Calculator — Monthly Spend Estimator

Monthly cost comparison

Show all 328 models

Rank	Model	Provider	Input / 1M	Output / 1M	Context	Monthly cost
1	Auto Router	Openrouter	$-1000000.0000/1M	$-1000000.0000/1M	2000k	$-75000000.0000
2	Body Builder (beta)	Openrouter	$-1000000.0000/1M	$-1000000.0000/1M	128k	$-75000000.0000
3	OpenRouter: Fusion	Openrouter	$-1000000.0000/1M	$-1000000.0000/1M	128k	$-75000000.0000
4	Pareto Code Router	Openrouter	$-1000000.0000/1M	$-1000000.0000/1M	2000k	$-75000000.0000
5	Venice: Uncensored (free)	Cognitivecomputations	Free/1M	Free/1M	33k	Free
6	Google: Gemma 4 26B A4B (free)	Google	Free/1M	Free/1M	262k	Free
7	Google: Gemma 4 31B (free)	Google	Free/1M	Free/1M	262k	Free
8	Google: Lyria 3 Clip Preview	Google	Free/1M	Free/1M	1049k	Free
9	Google: Lyria 3 Pro Preview	Google	Free/1M	Free/1M	1049k	Free
10	LiquidAI: LFM2.5-1.2B-Instruct (free)	Liquid	Free/1M	Free/1M	33k	Free
11	LiquidAI: LFM2.5-1.2B-Thinking (free)	Liquid	Free/1M	Free/1M	33k	Free
12	Meta: Llama 3.2 3B Instruct (free)	Meta	Free/1M	Free/1M	131k	Free
13	Meta: Llama 3.3 70B Instruct (free)	Meta	Free/1M	Free/1M	131k	Free
14	Nex AGI: Nex-N2-Pro (free)	Nex Agi	Free/1M	Free/1M	262k	Free
15	Nous: Hermes 3 405B Instruct (free)	Nousresearch	Free/1M	Free/1M	131k	Free
16	NVIDIA: Nemotron 3 Nano 30B A3B (free)	NVIDIA	Free/1M	Free/1M	256k	Free
17	NVIDIA: Nemotron 3 Nano Omni (free)	NVIDIA	Free/1M	Free/1M	256k	Free
18	NVIDIA: Nemotron 3 Super (free)	NVIDIA	Free/1M	Free/1M	1000k	Free
19	NVIDIA: Nemotron 3 Ultra (free)	NVIDIA	Free/1M	Free/1M	1000k	Free
20	NVIDIA: Nemotron 3.5 Content Safety (free)	NVIDIA	Free/1M	Free/1M	128k	Free
21	NVIDIA: Nemotron Nano 12B 2 VL (free)	NVIDIA	Free/1M	Free/1M	128k	Free
22	NVIDIA: Nemotron Nano 9B V2 (free)	NVIDIA	Free/1M	Free/1M	128k	Free
23	OpenAI: gpt-oss-120b (free)	OpenAI	Free/1M	Free/1M	131k	Free
24	OpenAI: gpt-oss-20b (free)	OpenAI	Free/1M	Free/1M	131k	Free
25	Free Models Router	Openrouter	Free/1M	Free/1M	200k	Free

Rank

Model

Provider

Input / 1M

Output / 1M

Context

Monthly cost

Auto Router

Openrouter

$-1000000.0000/1M

2000k

$-75000000.0000

Body Builder (beta)

Openrouter

$-1000000.0000/1M

128k

$-75000000.0000

OpenRouter: Fusion

Openrouter

$-1000000.0000/1M

128k

$-75000000.0000

Pareto Code Router

Openrouter

$-1000000.0000/1M

2000k

$-75000000.0000

Venice: Uncensored (free)

Cognitivecomputations

Free/1M

33k

Free

Google: Gemma 4 26B A4B (free)

Google

Free/1M

262k

Free

Google: Gemma 4 31B (free)

Google

Free/1M

262k

Free

Google: Lyria 3 Clip Preview

Google

Free/1M

1049k

Free

Google: Lyria 3 Pro Preview

Google

Free/1M

1049k

Free

LiquidAI: LFM2.5-1.2B-Instruct (free)

Liquid

Free/1M

33k

Free

LiquidAI: LFM2.5-1.2B-Thinking (free)

Liquid

Free/1M

33k

Free

Meta: Llama 3.2 3B Instruct (free)

How to estimate LLM API costs

Cost drivers

API bills are driven by three numbers: requests per day, input tokens per request, and output tokens per request. Input tokens are everything you send to the model; output tokens are everything it generates. Output pricing is usually several times higher than input pricing.

Batch discounts

Many providers offer cheaper rates for batch or cached workloads. Use the batch toggle to see a 50% discount scenario. Real discounts vary by provider, so treat the result as a planning estimate rather than a guaranteed invoice.

Why compare providers?

A model with the same capability can cost 10–50× more depending on the provider and API tier. This calculator surfaces the cheapest option for your exact token profile, helping you avoid overpaying for inference.

Export and share

Export the full comparison as CSV for spreadsheets, or print to PDF for reports and presentations. The calculation stays in your browser — your inputs are never sent to a server.

LLM API Cost Calculator

LLM API Cost Calculator — VePrompts

Estimated monthly cost

Cheapest models for your workload

Monthly cost comparison

How to estimate LLM API costs

Cost drivers

Batch discounts

Why compare providers?

Export and share