Grok-1.5V logo

Grok-1.5V

xAI

This AI model excels at understanding both text and images, handling diverse visual formats like documents, diagrams, charts, screenshots, and photos. It's particularly strong in grasping real-world spatial relationships.

Model Specifications

Technical details and capabilities of Grok-1.5V

Core Specifications

128.0K / 128.0K

Input / Output tokens

April 11, 2024

Release date

Capabilities & License

Multimodal Support
Supported
Web Hydrated
No
License
Proprietary

Resources

API Reference
https://x.ai/api

Performance Insights

Check out how Grok-1.5V handles various AI tasks through comprehensive benchmark results.

90
68
45
23
0
88.3
AI2D
88.3
(98%)
85.6
DocVQA
85.6
(95%)
78.1
TextVQA
78.1
(87%)
76.1
ChartQA
76.1
(85%)
68.7
RealWorldQA
68.7
(76%)
53.6
MMMU
53.6
(60%)
52.8
Mathvista
52.8
(59%)
AI2D
DocVQA
TextVQA
ChartQA
RealWorldQA
MMMU
Mathvista

Detailed Benchmarks

Dive deeper into Grok-1.5V's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Non categorized

Mathvista

Current model
Other models
Avg (55.4%)

AI2D

Current model
Other models
Avg (90.5%)

TextVQA

Current model
Other models
Avg (75.0%)

ChartQA

Current model
Other models
Avg (83.6%)

DocVQA

Current model
Other models
Avg (90.0%)

RealWorldQA

Current model
Other models
Avg (73.3%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Grok-1.5V. Compare costs across platforms to find the best pricing for your use case.

OpenAI
Anthropic
Google
Mistral AI
Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models