Skip to main content
API Pricing

NVIDIA: Nemotron Nano 12B 2 VL (free) API Pricing

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

Input / 1M tokens
Free
Output / 1M tokens
Free
Cost per request (1K in / 500 out)
Free
~$0.00 per 1M tokens mixed

Specifications

Context Window 128K tokens
Max Output Tokens 128,000
Vision Support Yes
Function Calling No
Streaming Yes
Modality multimodal
Release Date 2025-10-28
Price history will appear here after the first price change is detected.

More from NVIDIA

Input / 1M tokens
$0.40
Output / 1M tokens
$0.40
Context: 131K Streaming
Input / 1M tokens
$0.05
Output / 1M tokens
$0.20
Context: 262K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Vision Streaming
Input / 1M tokens
$0.09
Output / 1M tokens
$0.45
Context: 1.0M Streaming

Similar Models from Other Providers

Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming