Gemma 2 9B

Google

Gemma 2 9B IT is a refined, instruction-tuned iteration of Google's Gemma 2 9B model. This enhanced model was developed using a massive dataset of 8 trillion tokens encompassing web content, code, and mathematical data. It incorporates advanced techniques such as sliding window attention, logit soft-capping, and knowledge distillation. Moreover, Gemma 2 9B IT is specifically optimized for dialogue-based applications, achieved through supervised fine-tuning, distillation, reinforcement learning from human feedback (RLHF), and strategic model merging via WARP.

Model Specifications

Technical details and capabilities of Gemma 2 9B

Core Specifications

9.2B Parameters

Model size and complexity

8000.0B Training Tokens

Amount of data used in training

8.2K / 8.2K

Input / Output tokens

June 26, 2024

Release date

Capabilities & License

Multimodal Support

Not Supported

Web Hydrated

License

gemma

Resources

Research Paper

https://storage.googleapis.com/deepmind-media/gemma/gemma-2-report.pdf

API Reference

https://huggingface.co/google/gemma-2-9b-it

Playground

https://huggingface.co/chat/models/google/gemma-2-9b-it

Performance Insights

Check out how Gemma 2 9B handles various AI tasks through comprehensive benchmark results.

ARC-e

(98%)

84.2

BoolQ

84.2

(94%)

81.9

HellaSwag

81.9

(91%)

81.7

PIQA

81.7

(91%)

80.6

Winogrande

80.6

(90%)

76.6

TriviaQA

76.6

(85%)

71.3

MMLU

71.3

(79%)

68.6

GSM8K

68.6

(76%)

68.4

ARC-c

68.4

(76%)

68.2

BIG-Bench

68.2

(76%)

53.4

SocialIQA

53.4

(59%)

52.8

AGIEval

52.8

(59%)

52.4

MBPP

52.4

(58%)

40.2

HumanEval

40.2

(45%)

36.6

MATH

36.6

(41%)

29.2

Natural Questions

29.2

(32%)

ARC-e

BoolQ

HellaSwag

PIQA

Winogrande

TriviaQA

MMLU

GSM8K

ARC-c

BIG-Bench

SocialIQA

AGIEval

MBPP

HumanEval

MATH

Natural Questions

Detailed Benchmarks

Dive deeper into Gemma 2 9B's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Math

GSM8K

Phi-3.5-MoE-instruct

88.7%

Gemini 1.5 Flash

86.2%

Phi-3.5-mini-instruct

86.2%

Qwen2.5-Coder 7B Instruct

83.9%

Qwen2 7B Instruct

82.3%

Llama 3.2 3B Instruct

77.7%

Gemma 2 27B

74.0%

Gemma 2 9B

Model Specifications

Core Specifications

Capabilities & License

Resources

Performance Insights

Detailed Benchmarks

Math

GSM8K

Coding

HumanEval

MBPP

Reasoning

HellaSwag

Knowledge

MMLU

MATH

Non categorized

PIQA

SocialIQA

BoolQ

Winogrande

ARC-e

ARC-c

TriviaQA

Natural Questions

AGIEval

BIG-Bench

Providers Pricing Coming Soon

Share your feedback

Stay Ahead with AI Updates