nvidia

NVIDIA: Nemotron Nano 12B 2 VL

nvidia/nemotron-nano-12b-v2-vl

For $1, you can send approximately:

~179messages

How do we get this number?

One message = ~7,000 input tokens + ~7,000 output tokens

Input cost per message7,000 x $0.20/M = $0.001400

Output cost per message7,000 x $0.60/M = $0.004200

Total cost per message$0.005600

Messages for $1178.57

Context window

131k

tokens

Max response

tokens

Input price

$0.20

per million tokens

Output price

$0.60

per million tokens

Modalities

Input:imageInput:textInput:videoOutput:text

Description

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...