Qwen2-VL-72B-Instruct

Qwen

This advanced AI model is designed for sophisticated visual analysis and logical, step-by-step thinking. It accepts both images and videos, intelligently adapting to different resolutions. Enhanced positional encoding (M-ROPE) allows for cutting-edge functionalities, including tackling complex problems, deciphering multilingual text within images, and engaging in interactive, agent-based scenarios within videos.

Model Specifications

Technical details and capabilities of Qwen2-VL-72B-Instruct

Core Specifications

73.4B Parameters

Model size and complexity

32.8K / 32.8K

Input / Output tokens

June 29, 2023

Knowledge cutoff date

August 28, 2024

Release date

Capabilities & License

Multimodal Support

Supported

Web Hydrated

License

tongyi-qianwen

Resources

Research Paper

https://arxiv.org/abs/2409.12191

API Reference

https://huggingface.co/Qwen/Qwen2-VL-72B-Instruct

Code Repository

https://github.com/QwenLM/Qwen2-VL

Performance Insights

Check out how Qwen2-VL-72B-Instruct handles various AI tasks through comprehensive benchmark results.

100

96.5

DocVQAtest

96.5

(97%)

91.9

VCR_en_easy

91.9

(92%)

88.3

ChartQAtest

88.3

(88%)

87.7

OCRBench

87.7

(88%)

86.5

MMBench_test

86.5

(87%)

85.5

TextVQAval

85.5

(86%)

84.5

InfoVQAtest

84.5

(85%)

77.9

EgoSchema_test

77.9

(78%)

77.8

RealWorldQA

77.8

(78%)

MMVetGPT4Turbo

(74%)

73.6

MVBench

73.6

(74%)

70.5

MathVista_test_mini

70.5

(71%)

64.5

MMMUval

64.5

(65%)

46.2

MMMU-Pro

46.2

(46%)

30.9

MTVQA

30.9

(31%)

DocVQAtest

VCR_en_easy

ChartQAtest

OCRBench

MMBench_test

TextVQAval

InfoVQAtest

EgoSchema_test

RealWorldQA

MMVetGPT4Turbo

MVBench

MathVista_test_mini

MMMUval

MMMU-Pro

MTVQA

Detailed Benchmarks

Dive deeper into Qwen2-VL-72B-Instruct's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Non categorized

MMMU-Pro

Mistral Small 3.1 24B

49.3%

Qwen2-VL-72B-Instruct

46.2%

Llama 3.2 90B Instruct

45.2%

Llama 3.2 11B Instruct

33.0%

Current model

Other models

Avg (43.4%)

RealWorldQA

Qwen2-VL-72B-Instruct

77.8%

Grok-1.5V

68.7%

Current model

Other models

Avg (73.3%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Qwen2-VL-72B-Instruct. Compare costs across platforms to find the best pricing for your use case.

OpenAI

Anthropic

Google

Mistral AI

Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Qwen2-VL-72B-Instruct

Model Specifications

Core Specifications

Capabilities & License

Resources

Performance Insights

Detailed Benchmarks

Non categorized

MMMU-Pro

RealWorldQA

Providers Pricing Coming Soon

Share your feedback

Stay Ahead with AI Updates