nvidia/Llama-3.3-Nemotron-Super-49B-v1.5

by DeepInfra — Price updated 2026-04-04

Pricing ($/1M tokens)
Input $0.10
Output $0.40
Specifications
Context Window 131,072
Max Output 131,072
Input text
Output text
About This Model

nvidia/Llama-3.3-Nemotron-Super-49B-v1.5 is a budget-friendly language model from DeepInfra.

It supports a context window of 131K tokens with up to 131K output tokens.

Priced at $0.10/1M input tokens, it ranks among the more affordable options in the current market.

Available Providers (2)
Provider Input ($/1M) Output ($/1M)
NVIDIA $0.10 $0.40
DeepInfra $0.10 $0.40