Inception: Mercury 2 API Pricing
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...
Input / 1M tokens
$0.25
Output / 1M tokens
$0.75
Cost per request (1K in / 500 out)
$0.0006
~$0.63 per 1M tokens mixed
Specifications
Context Window 128K tokens
Max Output Tokens 50,000
Vision Support No
Function Calling No
Streaming Yes
Modality text
Release Date 2026-03-04
Price history will appear here after the first price change is detected.
Similar Models from Other Providers
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming