Qwen2-VL-72B-Instruct logo

Qwen2-VL-72B-Instruct

Qwen

This advanced AI model is designed for sophisticated visual analysis and logical, step-by-step thinking. It accepts both images and videos, intelligently adapting to different resolutions. Enhanced positional encoding (M-ROPE) allows for cutting-edge functionalities, including tackling complex problems, deciphering multilingual text within images, and engaging in interactive, agent-based scenarios within videos.

Model Specifications

Technical details and capabilities of Qwen2-VL-72B-Instruct

Core Specifications

73.4B Parameters

Model size and complexity

32.8K / 32.8K

Input / Output tokens

June 29, 2023

Knowledge cutoff date

August 28, 2024

Release date

Capabilities & License

Multimodal Support
Supported
Web Hydrated
No
License
tongyi-qianwen

Resources

Research Paper
https://arxiv.org/abs/2409.12191
API Reference
https://huggingface.co/Qwen/Qwen2-VL-72B-Instruct
Code Repository
https://github.com/QwenLM/Qwen2-VL

Performance Insights

Check out how Qwen2-VL-72B-Instruct handles various AI tasks through comprehensive benchmark results.

100
75
50
25
0
96.5
DocVQAtest
96.5
(97%)
91.9
VCR_en_easy
91.9
(92%)
88.3
ChartQAtest
88.3
(88%)
87.7
OCRBench
87.7
(88%)
86.5
MMBench_test
86.5
(87%)
85.5
TextVQAval
85.5
(86%)
84.5
InfoVQAtest
84.5
(85%)
77.9
EgoSchema_test
77.9
(78%)
77.8
RealWorldQA
77.8
(78%)
74
MMVetGPT4Turbo
74
(74%)
73.6
MVBench
73.6
(74%)
70.5
MathVista_test_mini
70.5
(71%)
64.5
MMMUval
64.5
(65%)
46.2
MMMU-Pro
46.2
(46%)
30.9
MTVQA
30.9
(31%)
DocVQAtest
VCR_en_easy
ChartQAtest
OCRBench
MMBench_test
TextVQAval
InfoVQAtest
EgoSchema_test
RealWorldQA
MMVetGPT4Turbo
MVBench
MathVista_test_mini
MMMUval
MMMU-Pro
MTVQA

Detailed Benchmarks

Dive deeper into Qwen2-VL-72B-Instruct's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Non categorized

MMMU-Pro

Current model
Other models
Avg (43.4%)

RealWorldQA

Current model
Other models
Avg (73.3%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Qwen2-VL-72B-Instruct. Compare costs across platforms to find the best pricing for your use case.

OpenAI
Anthropic
Google
Mistral AI
Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models