Gemma 2 9B logo

Gemma 2 9B

Google

Gemma 2 9B IT is a refined, instruction-tuned iteration of Google's Gemma 2 9B model. This enhanced model was developed using a massive dataset of 8 trillion tokens encompassing web content, code, and mathematical data. It incorporates advanced techniques such as sliding window attention, logit soft-capping, and knowledge distillation. Moreover, Gemma 2 9B IT is specifically optimized for dialogue-based applications, achieved through supervised fine-tuning, distillation, reinforcement learning from human feedback (RLHF), and strategic model merging via WARP.

Model Specifications

Technical details and capabilities of Gemma 2 9B

Core Specifications

9.2B Parameters

Model size and complexity

8000.0B Training Tokens

Amount of data used in training

8.2K / 8.2K

Input / Output tokens

June 26, 2024

Release date

Capabilities & License

Multimodal Support
Not Supported
Web Hydrated
No
License
gemma

Resources

Research Paper
https://storage.googleapis.com/deepmind-media/gemma/gemma-2-report.pdf
API Reference
https://huggingface.co/google/gemma-2-9b-it
Playground
https://huggingface.co/chat/models/google/gemma-2-9b-it

Performance Insights

Check out how Gemma 2 9B handles various AI tasks through comprehensive benchmark results.

90
68
45
23
0
88
ARC-e
88
(98%)
84.2
BoolQ
84.2
(94%)
81.9
HellaSwag
81.9
(91%)
81.7
PIQA
81.7
(91%)
80.6
Winogrande
80.6
(90%)
76.6
TriviaQA
76.6
(85%)
71.3
MMLU
71.3
(79%)
68.6
GSM8K
68.6
(76%)
68.4
ARC-c
68.4
(76%)
68.2
BIG-Bench
68.2
(76%)
53.4
SocialIQA
53.4
(59%)
52.8
AGIEval
52.8
(59%)
52.4
MBPP
52.4
(58%)
40.2
HumanEval
40.2
(45%)
36.6
MATH
36.6
(41%)
29.2
Natural Questions
29.2
(32%)
ARC-e
BoolQ
HellaSwag
PIQA
Winogrande
TriviaQA
MMLU
GSM8K
ARC-c
BIG-Bench
SocialIQA
AGIEval
MBPP
HumanEval
MATH
Natural Questions

Detailed Benchmarks

Dive deeper into Gemma 2 9B's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Coding

HumanEval

Current model
Other models
Avg (58.4%)

MBPP

Current model
Other models
Avg (70.5%)

Knowledge

MATH

Current model
Other models
Avg (40.7%)

Non categorized

PIQA

Current model
Other models
Avg (83.6%)

SocialIQA

Current model
Other models
Avg (53.6%)

BoolQ

Current model
Other models
Avg (82.9%)

Winogrande

Current model
Other models
Avg (82.2%)

ARC-e

Current model
Other models
Avg (88.3%)

ARC-c

Current model
Other models
Avg (69.9%)

TriviaQA

Current model
Other models
Avg (78.0%)

Natural Questions

Current model
Other models
Avg (31.9%)

AGIEval

Current model
Other models
Avg (56.3%)

BIG-Bench

Current model
Other models
Avg (71.5%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Gemma 2 9B. Compare costs across platforms to find the best pricing for your use case.

OpenAI
Anthropic
Google
Mistral AI
Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models