NVIDIA: Nemotron Nano 12B 2 VL (free) API Pricing
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...
Input / 1M tokens
Free
Output / 1M tokens
Free
Cost per request (1K in / 500 out)
Free
~$0.00 per 1M tokens mixed
Specifications
Context Window 128K tokens
Max Output Tokens 128,000
Vision Support Yes
Function Calling No
Streaming Yes
Modality multimodal
Release Date 2025-10-28
Price history will appear here after the first price change is detected.
More from NVIDIA
Input / 1M tokens
$0.40
Output / 1M tokens
$0.40
Context: 131K Streaming
Input / 1M tokens
$0.05
Output / 1M tokens
$0.20
Context: 262K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Vision Streaming
Input / 1M tokens
$0.09
Output / 1M tokens
$0.45
Context: 1.0M Streaming
Similar Models from Other Providers
Multimodal
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming