DeepSeek: R1 Distill Qwen 32B API Pricing
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...
Input / 1M tokens
$0.29
Output / 1M tokens
$0.29
Cost per request (1K in / 500 out)
$0.0004
~$0.43 per 1M tokens mixed
Specifications
Context Window 128K tokens
Max Output Tokens 32,768
Vision Support No
Function Calling No
Streaming Yes
Modality text
Release Date 2025-01-29
Price history will appear here after the first price change is detected.
More from DeepSeek
Input / 1M tokens
$0.20
Output / 1M tokens
$0.80
Context: 131K Streaming
Input / 1M tokens
$0.20
Output / 1M tokens
$0.77
Context: 164K Streaming
Input / 1M tokens
$0.21
Output / 1M tokens
$0.79
Context: 164K Streaming
Input / 1M tokens
$0.27
Output / 1M tokens
$0.95
Context: 164K Streaming
Input / 1M tokens
$0.23
Output / 1M tokens
$0.34
Context: 131K Streaming
Similar Models from Other Providers
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming