Claude 3.5 Haiku

Anthropic

Claude 3.5 Haiku is Anthropic's speediest AI model, providing sophisticated coding, tool utilization, and logical deduction skills at a budget-friendly cost. It's perfect for powering user-focused applications, managing specialized sub-agent functions, and creating customized experiences derived from extensive datasets. This model shines in areas like code completion, interactive chatbot interactions, data mining, and immediate content oversight.

Model Specifications

Technical details and capabilities of Claude 3.5 Haiku

Core Specifications

200.0K / 200.0K

Input / Output tokens

October 21, 2024

Release date

Capabilities & License

Multimodal Support

Not Supported

Web Hydrated

License

Proprietary

Resources

API Reference

https://docs.anthropic.com/en/docs/intro-to-claude#claude-3-5-family

Playground

https://claude.ai

Performance Insights

Check out how Claude 3.5 Haiku handles various AI tasks through comprehensive benchmark results.

88.1

HumanEval

88.1

(98%)

85.6

MGSM

85.6

(95%)

83.1

DROP

83.1

(92%)

69.4

MATH

69.4

(77%)

MMLU-Pro

(72%)

TAU-bench Retail

(57%)

41.6

GPQA

41.6

(46%)

40.6

SWE-bench Verified

40.6

(45%)

22.8

TAU-bench Airline

22.8

(25%)

HumanEval

MGSM

DROP

MATH

MMLU-Pro

TAU-bench Retail

GPQA

SWE-bench Verified

TAU-bench Airline

Model Comparison

See how Claude 3.5 Haiku stacks up against other leading models across key performance metrics.

100

41.6

GPQA - Claude 3.5 Haiku

41.6

(42%)

40.2

GPQA - GPT-4o mini

40.2

(40%)

GPQA - GPT-4 Turbo

(48%)

50.4

GPQA - Claude 3 Opus

50.4

(50%)

53.6

GPQA - GPT-4o

53.6

(54%)

59.4

GPQA - Claude 3.5 Sonnet

59.4

(59%)

88.1

HumanEval - Claude 3.5 Haiku

88.1

(88%)

87.2

HumanEval - GPT-4o mini

87.2

(87%)

87.1

HumanEval - GPT-4 Turbo

87.1

(87%)

84.9

HumanEval - Claude 3 Opus

84.9

(85%)

90.2

HumanEval - GPT-4o

90.2

(90%)

HumanEval - Claude 3.5 Sonnet

(92%)

69.4

MATH - Claude 3.5 Haiku

69.4

(69%)

70.2

MATH - GPT-4o mini

70.2

(70%)

72.6

MATH - GPT-4 Turbo

72.6

(73%)

60.1

MATH - Claude 3 Opus

60.1

(60%)

76.6

MATH - GPT-4o

76.6

(77%)

71.1

MATH - Claude 3.5 Sonnet

71.1

(71%)

85.6

MGSM - Claude 3.5 Haiku

85.6

(86%)

MGSM - GPT-4o mini

(87%)

88.5

MGSM - GPT-4 Turbo

88.5

(89%)

90.7

MGSM - Claude 3 Opus

90.7

(91%)

90.5

MGSM - GPT-4o

90.5

(91%)

91.6

MGSM - Claude 3.5 Sonnet

91.6

(92%)

83.1

DROP - Claude 3.5 Haiku

83.1

(83%)

79.7

DROP - GPT-4o mini

79.7

(80%)

DROP - GPT-4 Turbo

(86%)

83.1

DROP - Claude 3 Opus

83.1

(83%)

83.4

DROP - GPT-4o

83.4

(83%)

87.1

DROP - Claude 3.5 Sonnet

87.1

(87%)

GPQA

HumanEval

MATH

MGSM

DROP

Claude 3.5 Haiku

GPT-4o mini

GPT-4 Turbo

Claude 3 Opus

GPT-4o

Claude 3.5 Sonnet

Detailed Benchmarks

Dive deeper into Claude 3.5 Haiku's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Coding

SWE-bench Verified

Gemini Pro 2.5 Experimental

63.8%

o3-mini

49.3%

DeepSeek-R1

49.2%

Claude 3.5 Sonnet

49.0%

48.9%

DeepSeek-V3

42.0%

Claude 3.5 Haiku

40.6%

GPT-4.5

38.0%

Current model

Other models

Avg (47.6%)

HumanEval

Claude 3.5 Sonnet

93.7%

Llama 3.3 70B Instruct

88.4%

Qwen2.5 32B Instruct

88.4%

Qwen2.5-Coder 7B Instruct

88.4%

88.4%

88.1%

87.8%

87.2%

Ministral 8B Instruct

34.8%

Current model

Other models

Avg (83.3%)

Reasoning

DROP

DeepSeek-R1

92.2%

GPT-4 Turbo

86.0%

Nova Pro

85.4%

Llama 3.1 405B Instruct

84.8%

83.4%

83.1%

83.1%

80.9%

80.2%

Llama 3.1 8B Instruct

59.5%

Current model

Other models

Avg (81.9%)

Knowledge

GPQA

87.7%

Mistral Small 3

45.3%

Qwen2 72B Instruct

42.4%

Nova Lite

42.0%

Llama 3.1 70B Instruct

41.7%

41.6%

40.4%

40.2%

40.0%

25.3%

Current model

Other models

Avg (44.7%)

MATH

97.9%

72.6%

71.1%

70.6%

70.2%

69.4%

69.3%

Mistral Small 3.1 24B

69.3%

Llama 3.2 90B Instruct

68.0%

Gemini 1.0 Pro

32.6%

Current model

Other models

Avg (69.1%)

Non categorized

MMLU-Pro

DeepSeek-R1

84.0%

Claude 3 Opus

68.5%

Gemini 1.5 Flash

67.3%

Llama 3.1 70B Instruct

66.4%

66.3%

65.0%

64.4%

63.7%

58.7%

Qwen2.5-Coder 7B Instruct

40.1%

Current model

Other models

Avg (64.4%)

MGSM

92.0%

88.5%

87.5%

87.0%

Llama 3.2 90B Instruct

86.9%

85.6%

83.5%

82.6%

80.6%

Phi-3.5-mini-instruct

47.9%

Current model

Other models

Avg (82.2%)

TAU-bench Retail

Claude 3.7 Sonnet

81.2%

73.5%

Claude 3.5 Sonnet

69.2%

Claude 3.5 Haiku

51.0%

Current model

Other models

Avg (68.7%)

TAU-bench Airline

Claude 3.7 Sonnet

58.4%

54.2%

Claude 3.5 Sonnet

46.0%

Claude 3.5 Haiku

22.8%

Current model

Other models

Avg (45.3%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for Claude 3.5 Haiku. Compare costs across platforms to find the best pricing for your use case.

OpenAI

Anthropic

Google

Mistral AI

Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Claude 3.5 Haiku

Model Specifications

Core Specifications

Capabilities & License

Resources

Performance Insights

Model Comparison

Detailed Benchmarks

Coding

SWE-bench Verified

HumanEval

Reasoning

DROP

Knowledge

GPQA

MATH

Non categorized

MMLU-Pro

MGSM

TAU-bench Retail

TAU-bench Airline

Providers Pricing Coming Soon

Share your feedback

Stay Ahead with AI Updates