Llama 3.2 11B Instruct logo

Llama 3.2 11B Instruct

Meta Llama

Llama 3.2 11B Vision Instruct is a sophisticated multimodal language model expertly fine-tuned for visual understanding. It excels at tasks like identifying objects, reasoning about image content, creating descriptive captions, and answering diverse questions related to images. The model takes both text and images as input, then produces articulate text-based responses.

Model Specifications

Technical details and capabilities of Llama 3.2 11B Instruct

Core Specifications

10.6B Parameters

Model size and complexity

128.0K / 128.0K

Input / Output tokens

December 30, 2023

Knowledge cutoff date

September 24, 2024

Release date

Capabilities & License

Multimodal Support
Supported
Web Hydrated
No
License
Llama 3.2 Community License

Resources

API Reference
https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct
Code Repository
https://github.com/facebookresearch/llama

Performance Insights

Check out how Llama 3.2 11B Instruct handles various AI tasks through comprehensive benchmark results.

100
75
50
25
0
91.1
AI2 Diagram
91.1
(91%)
88.4
DocVQA
88.4
(88%)
83.4
ChartQA
83.4
(83%)
75.2
VQAv2 (test)
75.2
(75%)
73
MMLU
73
(73%)
68.9
MGSM
68.9
(69%)
51.9
MATH
51.9
(52%)
51.5
MathVista
51.5
(52%)
50.7
MMMU
50.7
(51%)
33
MMMU-Pro
33
(33%)
32.8
GPQA
32.8
(33%)
AI2 Diagram
DocVQA
ChartQA
VQAv2 (test)
MMLU
MGSM
MATH
MathVista
MMMU
MMMU-Pro
GPQA

Detailed Benchmarks

Dive deeper into Llama 3.2 11B Instruct's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Non categorized

MMMU-Pro

Current model
Other models
Avg (43.4%)

MathVista

Current model
Other models
Avg (48.7%)

AI2 Diagram

Current model
Other models
Avg (91.7%)

DocVQA

Current model
Other models
Avg (90.2%)

MGSM

Current model
Other models
Avg (69.5%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Llama 3.2 11B Instruct. Compare costs across platforms to find the best pricing for your use case.

OpenAI
Anthropic
Google
Mistral AI
Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models