NVIDIA: Nemotron 3 Nano Omni (free) API Pricing
NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...
Input / 1M tokens
Free
Output / 1M tokens
Free
Cost per request (1K in / 500 out)
Free
~$0.00 per 1M tokens mixed
Specifications
Context Window 256K tokens
Max Output Tokens 65,536
Vision Support Yes
Function Calling No
Streaming Yes
Modality multimodal
Release Date 2026-04-28
Price history will appear here after the first price change is detected.
More from NVIDIA
Input / 1M tokens
$0.40
Output / 1M tokens
$0.40
Context: 131K Streaming
Input / 1M tokens
$0.05
Output / 1M tokens
$0.20
Context: 262K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Streaming
Input / 1M tokens
$0.09
Output / 1M tokens
$0.45
Context: 1.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Streaming
Similar Models from Other Providers
Multimodal
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 262K Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming
Multimodal
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Vision Streaming