Skip to main content
API Pricing

DeepSeek: R1 Distill Llama 70B API Pricing

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

Input / 1M tokens
$0.80
Output / 1M tokens
$0.80
Cost per request (1K in / 500 out)
$0.0012
~$1.20 per 1M tokens mixed

Specifications

Context Window 128K tokens
Max Output Tokens 8,192
Vision Support No
Function Calling No
Streaming Yes
Modality text
Release Date 2025-01-23
Price history will appear here after the first price change is detected.

More from DeepSeek

Input / 1M tokens
$0.20
Output / 1M tokens
$0.80
Context: 131K Streaming
Input / 1M tokens
$0.20
Output / 1M tokens
$0.77
Context: 164K Streaming
Input / 1M tokens
$0.21
Output / 1M tokens
$0.79
Context: 164K Streaming
Input / 1M tokens
$0.27
Output / 1M tokens
$0.95
Context: 164K Streaming
Input / 1M tokens
$0.23
Output / 1M tokens
$0.34
Context: 131K Streaming

Similar Models from Other Providers

Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming