Meta: Llama 3.2 11B Vision Instruct
meta-llama/llama-3.2-11b-vision-instruct
For $1, you can send approximately:
~1.5kmessages
How do we get this number?
One message = ~7,000 input tokens + ~7,000 output tokens
Input cost per message7,000 x $0.05/M = $0.000343
Output cost per message7,000 x $0.05/M = $0.000343
Total cost per message$0.000686
Messages for $11457.73
Context window
131k
tokens
Max response
16k
tokens
Input price
$0.05
per million tokens
Output price
$0.05
per million tokens
Modalities
Input:textInput:imageOutput:text
Description
Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...