Grok-1.5V

xAI

This AI model excels at understanding both text and images, handling diverse visual formats like documents, diagrams, charts, screenshots, and photos. It's particularly strong in grasping real-world spatial relationships.

Model Specifications

Technical details and capabilities of Grok-1.5V

Core Specifications

128.0K / 128.0K

Input / Output tokens

April 11, 2024

Release date

Capabilities & License

Multimodal Support

Supported

Web Hydrated

License

Proprietary

Resources

API Reference

https://x.ai/api

Performance Insights

Check out how Grok-1.5V handles various AI tasks through comprehensive benchmark results.

88.3

AI2D

88.3

(98%)

85.6

DocVQA

85.6

(95%)

78.1

TextVQA

78.1

(87%)

76.1

ChartQA

76.1

(85%)

68.7

RealWorldQA

68.7

(76%)

53.6

MMMU

53.6

(60%)

52.8

Mathvista

52.8

(59%)

AI2D

DocVQA

TextVQA

ChartQA

RealWorldQA

MMMU

Mathvista

Detailed Benchmarks

Dive deeper into Grok-1.5V's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Non categorized

MMMU

Gemini Pro 2.5 Experimental

81.7%

59.4%

56.2%

53.7%

53.6%

53.6%

52.5%

Llama 3.2 11B Instruct

50.7%

Gemini 1.0 Pro

47.9%

GPT-3.5 Turbo

0.0%

Current model

Other models

Avg (50.9%)

Mathvista

Pixtral-12B

58.0%

Grok-1.5V

52.8%

Current model

Other models

Avg (55.4%)

AI2D

Claude 3.5 Sonnet

94.7%

GPT-4o

94.2%

Pixtral Large

93.8%

Mistral Small 3.1 24B

93.7%

Grok-1.5V

88.3%

Phi-3.5-vision-instruct

78.1%

Current model

Other models

Avg (90.5%)

TextVQA

Grok-1.5V

78.1%

Phi-3.5-vision-instruct

72.0%

Current model

Other models

Avg (75.0%)

ChartQA

Pixtral Large

88.1%

Mistral Small 3.1 24B

86.2%

GPT-4o

85.7%

Llama 3.2 90B Instruct

85.5%

Llama 3.2 11B Instruct

83.4%

Phi-3.5-vision-instruct

81.8%

Pixtral-12B

81.8%

Grok-1.5V

76.1%

Current model

Other models

Avg (83.6%)

DocVQA

93.3%

93.2%

92.8%

90.7%

Llama 3.2 90B Instruct

90.1%

Llama 3.2 11B Instruct

88.4%

Grok-1.5

85.6%

Grok-1.5V

85.6%

Current model

Other models

Avg (90.0%)

RealWorldQA

Qwen2-VL-72B-Instruct

77.8%

Grok-1.5V

68.7%

Current model

Other models

Avg (73.3%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Grok-1.5V. Compare costs across platforms to find the best pricing for your use case.

OpenAI

Anthropic

Google

Mistral AI

Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Grok-1.5V

Model Specifications

Core Specifications

Capabilities & License

Resources

Performance Insights

Detailed Benchmarks

Non categorized

MMMU

Mathvista

AI2D

TextVQA

ChartQA

DocVQA

RealWorldQA

Providers Pricing Coming Soon

Share your feedback

Stay Ahead with AI Updates