Pixtral-12B
Unknown Developer
This powerful multimodal model uses 12 billion parameters, including a 400 million parameter vision encoder, to expertly interpret both images and documents. It delivers exceptional results on tasks requiring combined understanding of text and images, without sacrificing strong performance in text-only applications. The model is also flexible, accommodating varying image sizes and processing multiple images simultaneously for richer contextual awareness.
Model Specifications
Technical details and capabilities of Pixtral-12B
Core Specifications
12.4B Parameters
Model size and complexity
128.0K / 8.2K
Input / Output tokens
September 16, 2024
Release date
Performance Insights
Check out how Pixtral-12B handles various AI tasks through comprehensive benchmark results.
Detailed Benchmarks
Dive deeper into Pixtral-12B's performance across specific task categories. Expand each section to see detailed metrics and comparisons.
Coding
HumanEval
Knowledge
MMLU
Non categorized
MMMU
ChartQA
DocVQA
VQAv2
MT-Bench
IFEval
Providers Pricing Coming Soon
We're working on gathering comprehensive pricing data from all major providers for Pixtral-12B. Compare costs across platforms to find the best pricing for your use case.
Share your feedback
Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.
Your feedback helps us improve our service