Skip to main content
API Pricing

Inception: Mercury 2 API Pricing

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

Input / 1M tokens
$0.25
Output / 1M tokens
$0.75
Cost per request (1K in / 500 out)
$0.0006
~$0.63 per 1M tokens mixed

Specifications

Context Window 128K tokens
Max Output Tokens 50,000
Vision Support No
Function Calling No
Streaming Yes
Modality text
Release Date 2026-03-04
Price history will appear here after the first price change is detected.

Similar Models from Other Providers

Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming