nvidia
NVIDIA: Nemotron Nano 12B 2 VL
nvidia/nemotron-nano-12b-v2-vl
For $1, you can send approximately:
~179messages
How do we get this number?
One message = ~7,000 input tokens + ~7,000 output tokens
Input cost per message7,000 x $0.20/M = $0.001400
Output cost per message7,000 x $0.60/M = $0.004200
Total cost per message$0.005600
Messages for $1178.57
Context window
131k
tokens
Max response
0
tokens
Input price
$0.20
per million tokens
Output price
$0.60
per million tokens
Modalities
Input:imageInput:textInput:videoOutput:text
Description
NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...