nvidia
NVIDIA: Nemotron Nano 9B V2
nvidia/nemotron-nano-9b-v2
For $1, you can send approximately:
~714messages
How do we get this number?
One message = ~7,000 input tokens + ~7,000 output tokens
Input cost per message7,000 x $0.04/M = $0.000280
Output cost per message7,000 x $0.16/M = $0.001120
Total cost per message$0.001400
Messages for $1714.29
Context window
131k
tokens
Max response
0
tokens
Input price
$0.04
per million tokens
Output price
$0.16
per million tokens
Modalities
Input:textOutput:text
Description
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...