nvidia/Llama-3.3-Nemotron-Super-49B-v1.5

Name: DeepInfra: nvidia/Llama-3.3-Nemotron-Super-49B-v1.5
Brand: DeepInfra

by DeepInfra — Price updated 2026-04-04

Input $0.10

Output $0.40

Context Window 131,072

Max Output 131,072

Input text

Output text

nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 is a budget-friendly language model from DeepInfra.

It supports a context window of 131K tokens with up to 131K output tokens.

Priced at $0.10/1M input tokens, it ranks among the more affordable options in the current market.

Provider	Input ($/1M)	Output ($/1M)
NVIDIA	$0.10	$0.40
DeepInfra	$0.10	$0.40