Cheapest LLM APIs
Top 20 lowest-cost LLM APIs ranked by input token price. Updated daily.
Top 20 Cheapest Models
Multimodal
Input / 1M
$0.05
Output / 1M
$0.15
Context: 131K Vision Streaming
Multimodal
Input / 1M
$0.05
Output / 1M
$0.10
Context: 131K Vision Streaming
Multimodal
Input / 1M
$0.05
Output / 1M
$0.40
Context: 400K Vision Streaming
| Model | Provider | Input / 1M ↑ | Output / 1M ↕ | Context ↕ | Vision | Functions |
|---|---|---|---|---|---|---|
| inclusionAI: Ling-2.6-flash | Inclusionai | $0.01 | $0.03 | 262K | ||
| IBM: Granite 4.0 Micro | Ibm Granite | $0.02 | $0.11 | 131K | ||
| Meta: Llama 3.1 8B Instruct | Meta | $0.02 | $0.03 | 131K | ||
| Mistral: Mistral Nemo | Mistral AI | $0.02 | $0.03 | 131K | ||
| Meta: Llama 3.2 1B Instruct | Meta | $0.03 | $0.20 | 131K | ||
| OpenAI: gpt-oss-20b | OpenAI | $0.03 | $0.14 | 131K | ||
| LiquidAI: LFM2-24B-A2B | Liquid | $0.03 | $0.12 | 128K | ||
| Amazon: Nova Micro 1.0 | Amazon | $0.04 | $0.14 | 128K | ||
| Cohere: Command R7B (12-2024) | Cohere | $0.04 | $0.15 | 128K | ||
| OpenAI: gpt-oss-120b | OpenAI | $0.04 | $0.18 | 131K | ||
| Qwen: Qwen2.5 7B Instruct | Qwen | $0.04 | $0.10 | 131K | ||
| Sao10K: Llama 3 8B Lunaris | Sao10k | $0.04 | $0.05 | 8K | ||
| Arcee AI: Trinity Mini | Arcee Ai | $0.04 | $0.15 | 131K | ||
| Qwen: Qwen3 30B A3B Instruct 2507 | Qwen | $0.05 | $0.19 | 131K | ||
| Google: Gemma 3 12B | $0.05 | $0.15 | 131K | |||
| Google: Gemma 3 4B | $0.05 | $0.10 | 131K | |||
| IBM: Granite 4.1 8B | Ibm Granite | $0.05 | $0.10 | 131K | ||
| Mistral: Mistral Small 3 | Mistral AI | $0.05 | $0.08 | 33K | ||
| NVIDIA: Nemotron 3 Nano 30B A3B | NVIDIA | $0.05 | $0.20 | 262K | ||
| OpenAI: GPT-5 Nano | OpenAI | $0.05 | $0.40 | 400K |
Cheapest text Models
| Model | Provider | Input / 1M ↑ | Output / 1M ↕ | Context ↕ | Vision | Functions |
|---|---|---|---|---|---|---|
| inclusionAI: Ling-2.6-flash | Inclusionai | $0.01 | $0.03 | 262K | ||
| IBM: Granite 4.0 Micro | Ibm Granite | $0.02 | $0.11 | 131K | ||
| Meta: Llama 3.1 8B Instruct | Meta | $0.02 | $0.03 | 131K | ||
| Mistral: Mistral Nemo | Mistral AI | $0.02 | $0.03 | 131K | ||
| Meta: Llama 3.2 1B Instruct | Meta | $0.03 | $0.20 | 131K | ||
| OpenAI: gpt-oss-20b | OpenAI | $0.03 | $0.14 | 131K | ||
| LiquidAI: LFM2-24B-A2B | Liquid | $0.03 | $0.12 | 128K | ||
| Amazon: Nova Micro 1.0 | Amazon | $0.04 | $0.14 | 128K | ||
| Cohere: Command R7B (12-2024) | Cohere | $0.04 | $0.15 | 128K | ||
| OpenAI: gpt-oss-120b | OpenAI | $0.04 | $0.18 | 131K |
Cheapest multimodal Models
Multimodal
Input / 1M
$0.05
Output / 1M
$0.15
Context: 131K Vision Streaming
Multimodal
Input / 1M
$0.05
Output / 1M
$0.10
Context: 131K Vision Streaming
Multimodal
Input / 1M
$0.05
Output / 1M
$0.40
Context: 400K Vision Streaming
Multimodal
Input / 1M
$0.06
Output / 1M
$0.24
Context: 300K Vision Streaming
Multimodal
Input / 1M
$0.06
Output / 1M
$0.33
Context: 262K Vision Streaming
Multimodal
Input / 1M
$0.07
Output / 1M
$0.26
Context: 1.0M Vision Streaming
Multimodal
Input / 1M
$0.07
Output / 1M
$0.30
Context: 262K Vision Streaming
Multimodal
Input / 1M
$0.07
Output / 1M
$0.20
Context: 128K Vision Streaming
Multimodal
Input / 1M
$0.08
Output / 1M
$0.16
Context: 131K Vision Streaming
Multimodal
Input / 1M
$0.08
Output / 1M
$0.50
Context: 256K Vision Streaming
| Model | Provider | Input / 1M ↑ | Output / 1M ↕ | Context ↕ | Vision | Functions |
|---|---|---|---|---|---|---|
| Google: Gemma 3 12B | $0.05 | $0.15 | 131K | |||
| Google: Gemma 3 4B | $0.05 | $0.10 | 131K | |||
| OpenAI: GPT-5 Nano | OpenAI | $0.05 | $0.40 | 400K | ||
| Amazon: Nova Lite 1.0 | Amazon | $0.06 | $0.24 | 300K | ||
| Google: Gemma 4 26B A4B | $0.06 | $0.33 | 262K | |||
| Qwen: Qwen3.5-Flash | Qwen | $0.07 | $0.26 | 1.0M | ||
| ByteDance Seed: Seed 1.6 Flash | Bytedance Seed | $0.07 | $0.30 | 262K | ||
| Mistral: Mistral Small 3.2 24B | Mistral AI | $0.07 | $0.20 | 128K | ||
| Google: Gemma 3 27B | $0.08 | $0.16 | 131K | |||
| Qwen: Qwen3 VL 8B Instruct | Qwen | $0.08 | $0.50 | 256K |
Cheapest audio Models
Input / 1M
$0.10
Output / 1M
$0.30
Context: 32K Streaming
| Model | Provider | Input / 1M ↑ | Output / 1M ↕ | Context ↕ | Vision | Functions |
|---|---|---|---|---|---|---|
| Mistral: Voxtral Small 24B 2507 | Mistral AI | $0.10 | $0.30 | 32K | ||
| OpenAI: GPT Audio Mini | OpenAI | $0.60 | $2.40 | 128K | ||
| OpenAI: GPT Audio | OpenAI | $2.50 | $10.00 | 128K |