o3-mini

OpenAI

This model is a streamlined version of O3, designed with several key improvements. Expect better handling of different types of data (like text and images), more advanced reasoning skills, and a lighter footprint in terms of computing resources. Despite these enhancements, it will still perform strongly on standard tasks.

Model Specifications

Technical details and capabilities of o3-mini

Core Specifications

200.0K / 200.0K

Input / Output tokens

May 31, 2024

Knowledge cutoff date

January 29, 2025

Release date

Capabilities & License

Multimodal Support
Not Supported
Web Hydrated
No
License
Proprietary

Resources

Research Paper
https://cdn.openai.com/o3-mini-system-card.pdf
API Reference
https://platform.openai.com/docs/models
Code Repository
https://github.com/openai

Performance Insights

Check out how o3-mini handles various AI tasks through comprehensive benchmark results.

100
75
50
25
0
97.9
MATH
97.9
(98%)
92
MGSM
92
(92%)
87.3
AIME 2024
87.3
(87%)
86.9
MMLU
86.9
(87%)
86.5
AIME 2025
86.5
(87%)
84.6
Livebench
84.6
(85%)
79.7
GPQA
79.7
(80%)
79
Codeforces
79
(79%)
74.1
LiveCodeBench
74.1
(74%)
60.4
Aider Polyglot
60.4
(60%)
49.3
SWE-bench Verified
49.3
(49%)
36.3
MRCR
36.3
(36%)
14.0
Humanity's Last Exam
14.0
(14%)
13.8
SimpleQA
13.8
(14%)
9.2
FrontierMath
9.2
(9%)
MATH
MGSM
AIME 2024
MMLU
AIME 2025
Livebench
GPQA
Codeforces
LiveCodeBench
Aider Polyglot
SWE-bench Verified
MRCR
Humanity's Last Exam
SimpleQA
FrontierMath

Detailed Benchmarks

Dive deeper into o3-mini's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Math

AIME 2024

Current model
Other models
Avg (80.8%)

AIME 2025

Current model
Other models
Avg (79.3%)

Coding

Codeforces

90.0%
79.0%
68.0%
47.0%
41.3%
11.0%
Current model
Other models
Avg (57.0%)

SWE-bench Verified

Current model
Other models
Avg (50.1%)

Aider Polyglot

Current model
Other models
Avg (53.7%)

LiveCodeBench

Current model
Other models
Avg (63.2%)

Knowledge

GPQA

Current model
Other models
Avg (76.2%)

MMLU

Current model
Other models
Avg (85.2%)

MATH

Current model
Other models
Avg (88.9%)

Non categorized

FrontierMath

5.5%
Current model
Other models
Avg (7.3%)

MGSM

Current model
Other models
Avg (91.0%)

SimpleQA

Current model
Other models
Avg (28.7%)

Humanity's Last Exam

Current model
Other models
Avg (11.3%)

MRCR

Current model
Other models
Avg (63.8%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for o3-mini. Compare costs across platforms to find the best pricing for your use case.

OpenAI
Anthropic
Google
Mistral AI
Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models