What is the cheapest LLM API?

The cheapest paid LLM API changes as providers adjust pricing. We update our rankings daily. Free models are excluded from this list.

Are free LLM APIs included?

Free models are shown on the main pricing page but excluded from the cheapest ranking to focus on sustainable, production-ready options.

Cheapest LLM APIs

Top 20 lowest-cost LLM APIs ranked by input token price. Updated daily.

Top 20 Cheapest Models

inclusionAI: Ling-2.6-flash Inclusionai

Input / 1M

$0.01

Output / 1M

$0.03

Context: 262K Streaming

IBM: Granite 4.0 Micro Ibm Granite

Input / 1M

$0.02

Output / 1M

$0.11

Context: 131K Streaming

Meta: Llama 3.1 8B Instruct Meta

Input / 1M

$0.02

Output / 1M

$0.03

Context: 131K Streaming

Mistral: Mistral Nemo Mistral AI

Input / 1M

$0.02

Output / 1M

$0.03

Context: 131K Streaming

Meta: Llama 3.2 1B Instruct Meta

Input / 1M

$0.03

Output / 1M

$0.20

Context: 131K Streaming

OpenAI: gpt-oss-20b OpenAI

Input / 1M

$0.03

Output / 1M

$0.14

Context: 131K Streaming

LiquidAI: LFM2-24B-A2B Liquid

Input / 1M

$0.03

Output / 1M

$0.12

Context: 128K Streaming

Amazon: Nova Micro 1.0 Amazon

Input / 1M

$0.04

Output / 1M

$0.14

Context: 128K Streaming

Cohere: Command R7B (12-2024) Cohere

Input / 1M

$0.04

Output / 1M

$0.15

Context: 128K Streaming

OpenAI: gpt-oss-120b OpenAI

Input / 1M

$0.04

Output / 1M

$0.18

Context: 131K Streaming

Qwen: Qwen2.5 7B Instruct Qwen

Input / 1M

$0.04

Output / 1M

$0.10

Context: 131K Streaming

Sao10K: Llama 3 8B Lunaris Sao10k

Input / 1M

$0.04

Output / 1M

$0.05

Context: 8K Streaming

Arcee AI: Trinity Mini Arcee Ai

Input / 1M

$0.04

Output / 1M

$0.15

Context: 131K Streaming

Qwen: Qwen3 30B A3B Instruct 2507 Qwen

Input / 1M

$0.05

Output / 1M

$0.19

Context: 131K Streaming

Google: Gemma 3 12B Google

Multimodal

Input / 1M

$0.05

Output / 1M

$0.15

Context: 131K Vision Streaming

Google: Gemma 3 4B Google

Multimodal

Input / 1M

$0.05

Output / 1M

$0.10

Context: 131K Vision Streaming

IBM: Granite 4.1 8B Ibm Granite

Input / 1M

$0.05

Output / 1M

$0.10

Context: 131K Streaming

Mistral: Mistral Small 3 Mistral AI

Input / 1M

$0.05

Output / 1M

$0.08

Context: 33K Streaming

NVIDIA: Nemotron 3 Nano 30B A3B NVIDIA

Input / 1M

$0.05

Output / 1M

$0.20

Context: 262K Streaming

OpenAI: GPT-5 Nano OpenAI

Multimodal

Input / 1M

$0.05

Output / 1M

$0.40

Context: 400K Vision Streaming

Model	Provider	Input / 1M ↑	Output / 1M ↕	Context ↕
inclusionAI: Ling-2.6-flash	Inclusionai	$0.01	$0.03	262K
IBM: Granite 4.0 Micro	Ibm Granite	$0.02	$0.11	131K
Meta: Llama 3.1 8B Instruct	Meta	$0.02	$0.03	131K
Mistral: Mistral Nemo	Mistral AI	$0.02	$0.03	131K
Meta: Llama 3.2 1B Instruct	Meta	$0.03	$0.20	131K
OpenAI: gpt-oss-20b	OpenAI	$0.03	$0.14	131K
LiquidAI: LFM2-24B-A2B	Liquid	$0.03	$0.12	128K
Amazon: Nova Micro 1.0	Amazon	$0.04	$0.14	128K
Cohere: Command R7B (12-2024)	Cohere	$0.04	$0.15	128K
OpenAI: gpt-oss-120b	OpenAI	$0.04	$0.18	131K
Qwen: Qwen2.5 7B Instruct	Qwen	$0.04	$0.10	131K
Sao10K: Llama 3 8B Lunaris	Sao10k	$0.04	$0.05	8K
Arcee AI: Trinity Mini	Arcee Ai	$0.04	$0.15	131K
Qwen: Qwen3 30B A3B Instruct 2507	Qwen	$0.05	$0.19	131K
Google: Gemma 3 12B	Google	$0.05	$0.15	131K
Google: Gemma 3 4B	Google	$0.05	$0.10	131K
IBM: Granite 4.1 8B	Ibm Granite	$0.05	$0.10	131K
Mistral: Mistral Small 3	Mistral AI	$0.05	$0.08	33K
NVIDIA: Nemotron 3 Nano 30B A3B	NVIDIA	$0.05	$0.20	262K
OpenAI: GPT-5 Nano	OpenAI	$0.05	$0.40	400K

Cheapest text Models

inclusionAI: Ling-2.6-flash Inclusionai

Input / 1M

$0.01

Output / 1M

$0.03

Context: 262K Streaming

IBM: Granite 4.0 Micro Ibm Granite

Input / 1M

$0.02

Output / 1M

$0.11

Context: 131K Streaming

Meta: Llama 3.1 8B Instruct Meta

Input / 1M

$0.02

Output / 1M

$0.03

Context: 131K Streaming

Mistral: Mistral Nemo Mistral AI

Input / 1M

$0.02

Output / 1M

$0.03

Context: 131K Streaming

Meta: Llama 3.2 1B Instruct Meta

Input / 1M

$0.03

Output / 1M

$0.20

Context: 131K Streaming

OpenAI: gpt-oss-20b OpenAI

Input / 1M

$0.03

Output / 1M

$0.14

Context: 131K Streaming

LiquidAI: LFM2-24B-A2B Liquid

Input / 1M

$0.03

Output / 1M

$0.12

Context: 128K Streaming

Amazon: Nova Micro 1.0 Amazon

Input / 1M

$0.04

Output / 1M

$0.14

Context: 128K Streaming

Cohere: Command R7B (12-2024) Cohere

Input / 1M

$0.04

Output / 1M

$0.15

Context: 128K Streaming

OpenAI: gpt-oss-120b OpenAI

Input / 1M

$0.04

Output / 1M

$0.18

Context: 131K Streaming

Model	Provider	Input / 1M ↑	Output / 1M ↕	Context ↕
inclusionAI: Ling-2.6-flash	Inclusionai	$0.01	$0.03	262K
IBM: Granite 4.0 Micro	Ibm Granite	$0.02	$0.11	131K
Meta: Llama 3.1 8B Instruct	Meta	$0.02	$0.03	131K
Mistral: Mistral Nemo	Mistral AI	$0.02	$0.03	131K
Meta: Llama 3.2 1B Instruct	Meta	$0.03	$0.20	131K
OpenAI: gpt-oss-20b	OpenAI	$0.03	$0.14	131K
LiquidAI: LFM2-24B-A2B	Liquid	$0.03	$0.12	128K
Amazon: Nova Micro 1.0	Amazon	$0.04	$0.14	128K
Cohere: Command R7B (12-2024)	Cohere	$0.04	$0.15	128K
OpenAI: gpt-oss-120b	OpenAI	$0.04	$0.18	131K

Cheapest multimodal Models

Google: Gemma 3 12B Google

Multimodal

Input / 1M

$0.05

Output / 1M

$0.15

Context: 131K Vision Streaming

Google: Gemma 3 4B Google

Multimodal

Input / 1M

$0.05

Output / 1M

$0.10

Context: 131K Vision Streaming

OpenAI: GPT-5 Nano OpenAI

Multimodal

Input / 1M

$0.05

Output / 1M

$0.40

Context: 400K Vision Streaming

Amazon: Nova Lite 1.0 Amazon

Multimodal

Input / 1M

$0.06

Output / 1M

$0.24

Context: 300K Vision Streaming

Google: Gemma 4 26B A4B Google

Multimodal

Input / 1M

$0.06

Output / 1M

$0.33

Context: 262K Vision Streaming

Qwen: Qwen3.5-Flash Qwen

Multimodal

Input / 1M

$0.07

Output / 1M

$0.26

Context: 1.0M Vision Streaming

ByteDance Seed: Seed 1.6 Flash Bytedance Seed

Multimodal

Input / 1M

$0.07

Output / 1M

$0.30

Context: 262K Vision Streaming

Mistral: Mistral Small 3.2 24B Mistral AI

Multimodal

Input / 1M

$0.07

Output / 1M

$0.20

Context: 128K Vision Streaming

Google: Gemma 3 27B Google

Multimodal

Input / 1M

$0.08

Output / 1M

$0.16

Context: 131K Vision Streaming

Qwen: Qwen3 VL 8B Instruct Qwen

Multimodal

Input / 1M

$0.08

Output / 1M

$0.50

Context: 256K Vision Streaming

Model	Provider	Input / 1M ↑	Output / 1M ↕	Context ↕
Google: Gemma 3 12B	Google	$0.05	$0.15	131K
Google: Gemma 3 4B	Google	$0.05	$0.10	131K
OpenAI: GPT-5 Nano	OpenAI	$0.05	$0.40	400K
Amazon: Nova Lite 1.0	Amazon	$0.06	$0.24	300K
Google: Gemma 4 26B A4B	Google	$0.06	$0.33	262K
Qwen: Qwen3.5-Flash	Qwen	$0.07	$0.26	1.0M
ByteDance Seed: Seed 1.6 Flash	Bytedance Seed	$0.07	$0.30	262K
Mistral: Mistral Small 3.2 24B	Mistral AI	$0.07	$0.20	128K
Google: Gemma 3 27B	Google	$0.08	$0.16	131K
Qwen: Qwen3 VL 8B Instruct	Qwen	$0.08	$0.50	256K