API Pricing

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 API Pricing

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

Input / 1M tokens

$0.40

Output / 1M tokens

$0.40

Cost per request (1K in / 500 out)

$0.0006

~$0.60 per 1M tokens mixed

Specifications

Context Window 131K tokens

Max Output Tokens 16,384

Vision Support No

Function Calling No

Streaming Yes

Modality text

Release Date 2025-10-10

Price history will appear here after the first price change is detected.

More from NVIDIA

NVIDIA: Nemotron 3 Nano 30B A3B

Input / 1M tokens

$0.05

Output / 1M tokens

$0.20

Context: 262K Streaming

NVIDIA: Nemotron 3 Nano 30B A3B (free)

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 256K Streaming

NVIDIA: Nemotron 3 Nano Omni (free)

Multimodal

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 256K Vision Streaming

NVIDIA: Nemotron 3 Super

Input / 1M tokens

$0.09

Output / 1M tokens

$0.45

Context: 1.0M Streaming

NVIDIA: Nemotron 3 Super (free)

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 1.0M Streaming

Similar Models from Other Providers

Body Builder (beta)

Input / 1M tokens

Variable

Output / 1M tokens

Variable

Context: 128K Streaming

OpenRouter: Fusion

Input / 1M tokens

Variable

Output / 1M tokens

Variable

Context: 128K Streaming

Pareto Code Router

Input / 1M tokens

Variable

Output / 1M tokens

Variable

Context: 2.0M Streaming

Venice: Uncensored (free)

Cognitivecomputations

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 33K Streaming

LiquidAI: LFM2.5-1.2B-Instruct (free)

Input / 1M tokens

Free

Output / 1M tokens

Free

Context: 33K Streaming

Compare NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Body Builder (beta)

Side-by-side comparison

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

OpenRouter: Fusion

Side-by-side comparison

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

Pareto Code Router

Side-by-side comparison

Cost Calculator

Estimate your API spend

Side-by-side comparison

Lowest cost models ranked

All NVIDIA Models

Browse provider pricing