Back to comparator
nvidia

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b

For $1, you can send approximately:
~571messages

How do we get this number?

One message = ~7,000 input tokens + ~7,000 output tokens
Input cost per message7,000 x $0.05/M = $0.000350
Output cost per message7,000 x $0.20/M = $0.001400
Total cost per message$0.001750
Messages for $1571.43
Context window
262k
tokens
Max response
0
tokens
Input price
$0.05
per million tokens
Output price
$0.20
per million tokens

Modalities

Input:textOutput:text

Description

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...