
DeepSeek-R1
DeepSeek
DeepSeek-R1 is a cutting-edge reasoning model developed using DeepSeek-V3 as its foundation (671B total parameters, 37B activated per token). This first-generation model leverages extensive reinforcement learning (RL) to significantly improve its chain-of-thought processes and overall reasoning abilities. As a result, DeepSeek-R1 excels in complex tasks involving mathematics, coding, and multi-step reasoning.
Model Specifications
Technical details and capabilities of DeepSeek-R1
Core Specifications
671.0B Parameters
Model size and complexity
14800.0B Training Tokens
Amount of data used in training
131.1K / 131.1K
Input / Output tokens
January 19, 2025
Release date
Capabilities & License
Performance Insights
Check out how DeepSeek-R1 handles various AI tasks through comprehensive benchmark results.
Model Comparison
See how DeepSeek-R1 stacks up against other leading models across key performance metrics.
Detailed Benchmarks
Dive deeper into DeepSeek-R1's performance across specific task categories. Expand each section to see detailed metrics and comparisons.
Math
AIME 2024
MATH-500
AIME 2025
Coding
LiveCodeBench
SWE-bench Verified
Aider Polyglot
Reasoning
DROP
Knowledge
MMLU
GPQA
Non categorized
MMLU-Redux
MMLU-Pro
IFEval
SimpleQA
C-Eval
Humanity's Last Exam
Providers Pricing Coming Soon
We're working on gathering comprehensive pricing data from all major providers for DeepSeek-R1. Compare costs across platforms to find the best pricing for your use case.
Share your feedback
Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.
Your feedback helps us improve our service