Skip to main content
API Pricing

NVIDIA: Nemotron 3 Nano 30B A3B (free) API Pricing

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

Input / 1M tokens
Free
Output / 1M tokens
Free
Cost per request (1K in / 500 out)
Free
~$0.00 per 1M tokens mixed

Specifications

Context Window 256K tokens
Max Output Tokens 0
Vision Support No
Function Calling No
Streaming Yes
Modality text
Release Date 2025-12-14
Price history will appear here after the first price change is detected.

More from NVIDIA

Input / 1M tokens
$0.40
Output / 1M tokens
$0.40
Context: 131K Streaming
Input / 1M tokens
$0.05
Output / 1M tokens
$0.20
Context: 262K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 256K Vision Streaming
Input / 1M tokens
$0.09
Output / 1M tokens
$0.45
Context: 1.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 1.0M Streaming

Similar Models from Other Providers

Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 128K Streaming
Input / 1M tokens
Variable
Output / 1M tokens
Variable
Context: 2.0M Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming
Input / 1M tokens
Free
Output / 1M tokens
Free
Context: 33K Streaming