GPT-4o

OpenAI

GPT-4o ("o" for "omni") is a versatile AI model capable of processing various inputs, including text, audio, images, and video, and producing outputs in text, audio, and image formats. It delivers the same high performance as GPT-4 Turbo in text and code-related tasks, while offering enhanced capabilities in understanding non-English languages, visual content, and audio.

Model Specifications

Technical details and capabilities of GPT-4o

Core Specifications

128.0K / 16.4K

Input / Output tokens

August 5, 2024

Release date

Capabilities & License

Multimodal Support

Supported

Web Hydrated

No

License

Proprietary

Resources

API Reference

https://platform.openai.com/docs/api-reference

Playground

https://chat.openai.com/

Performance Insights

Check out how GPT-4o handles various AI tasks through comprehensive benchmark results.

100

75

50

25

0

94.2

AI2D

94.2

(94%)

92.8

DocVQA

92.8

(93%)

90.5

MGSM

90.5

(91%)

90.2

HumanEval

90.2

(90%)

90

RepoQA 32k

90

(90%)

88.7

MMLU

88.7

(89%)

88

MMLU

88

(88%)

88

MBPPPlus

88

(88%)

85.7

ChartQA

85.7

(86%)

84

IFEval

84

(84%)

83.4

DROP

83.4

(83%)

76.6

MATH

76.6

(77%)

74.7

MMLU-Pro

74.7

(75%)

74

BFCL

74

(74%)

72.6

MMLU-Pro

72.6

(73%)

72.2

EgoSchema

72.2

(72%)

70

SQL

70

(70%)

69.1

MMMU

69.1

(69%)

63.8

MathVista

63.8

(64%)

63.8

MathVista

63.8

(64%)

61.9

ActivityNet

61.9

(62%)

61.8

SimpleQA

61.8

(62%)

60

Taubench Retail

60

(60%)

53.6

GPQA

53.6

(54%)

53.6

GPQA

53.6

(54%)

41

Taubench Airline

41

(41%)

27.1

Aider Polyglot

27.1

(27%)

13.4

AIME 2024

13.4

(13%)

11

Codeforces

11

(11%)

AI2D

DocVQA

MGSM

HumanEval

RepoQA 32k

MMLU

MMLU

MBPPPlus

ChartQA

IFEval

DROP

MATH

MMLU-Pro

BFCL

MMLU-Pro

EgoSchema

SQL

MMMU

MathVista

MathVista

ActivityNet

SimpleQA

Taubench Retail

GPQA

GPQA

Taubench Airline

Aider Polyglot

AIME 2024

Codeforces

Detailed Benchmarks

Dive deeper into GPT-4o's performance across specific task categories. Expand each section to see detailed metrics and comparisons.

Math

AIME 2024

86.0%

83.3%

Claude 3.7 Sonnet

80.0%

79.8%

77.5%

42.0%

36.7%

13.4%

Current model

Other models

Avg (62.3%)

Coding

HumanEval

Claude 3.5 Sonnet

93.7%

Qwen2.5-Coder 32B Instruct

92.7%

92.4%

Claude 3.5 Sonnet

92.0%

Mistral Large 2

92.0%

90.2%

89.0%

89.0%

Llama 3.1 405B Instruct

89.0%

Ministral 8B Instruct

34.8%

Current model

Other models

Avg (85.5%)

Codeforces

90.0%

79.0%

68.0%

51.6%

47.0%

41.3%

31.4%

11.0%

Current model

Other models

Avg (52.4%)

Aider Polyglot

Gemini Pro 2.5 Experimental

72.9%

Claude 3.7 Sonnet

64.9%

61.7%

60.4%

56.9%

44.9%

27.1%

20.9%

Current model

Other models

Avg (51.2%)

Reasoning

DROP

92.2%

Claude 3.5 Sonnet

87.1%

86.0%

85.4%

Llama 3.1 405B Instruct

84.8%

83.4%

Claude 3.5 Haiku

83.1%

83.1%

80.9%

Llama 3.1 8B Instruct

59.5%

Current model

Other models

Avg (82.5%)

Knowledge

MMLU

91.8%

90.8%

90.8%

Claude 3.5 Sonnet

90.4%

Claude 3.5 Sonnet

90.4%

88.7%

88.5%

88.0%

87.5%

Llama 3.2 3B Instruct

63.4%

Current model

Other models

Avg (87.0%)

GPQA

87.7%

59.1%

59.1%

56.1%

56.0%

53.6%

53.6%

Gemini 1.5 Flash

51.0%

51.0%

Qwen2 7B Instruct

25.3%

Current model

Other models

Avg (55.3%)

MATH

97.9%

78.0%

Gemini 1.5 Flash

77.9%

Llama 3.3 70B Instruct

77.0%

76.6%

76.6%

76.1%

Qwen2.5 7B Instruct

75.5%

74.7%

32.6%

Current model

Other models

Avg (74.3%)

Non categorized

MGSM

92.0%

Claude 3.5 Sonnet

91.6%

Llama 3.3 70B Instruct

91.1%

90.8%

90.7%

90.5%

89.3%

88.5%

87.5%

Phi-3.5-mini-instruct

47.9%

Current model

Other models

Avg (86.0%)

MathVista

74.9%

68.1%

68.1%

Claude 3.5 Sonnet

67.7%

Gemini 1.5 Flash

65.8%

63.8%

63.8%

Llama 3.2 90B Instruct

57.3%

56.7%

0.0%

Current model

Other models

Avg (58.6%)

MMLU-Pro

84.0%

Claude 3.5 Sonnet

76.1%

75.9%

75.8%

75.5%

74.7%

Llama 3.1 405B Instruct

73.3%

72.6%

72.0%

Qwen2.5-Coder 7B Instruct

40.1%

Current model

Other models

Avg (72.0%)

MMMU

Gemini Pro 2.5 Experimental

81.7%

74.4%

Gemini 2.0 Flash

70.7%

QvQ-72B-Preview

70.3%

70.0%

69.1%

Claude 3.5 Sonnet

68.3%

66.1%

65.9%

0.0%

Current model

Other models

Avg (63.6%)

AI2D

Claude 3.5 Sonnet

94.7%

94.2%

93.8%

Mistral Small 3.1 24B

93.7%

88.3%

Phi-3.5-vision-instruct

78.1%

Current model

Other models

Avg (90.5%)

ChartQA

Claude 3.5 Sonnet

90.8%

88.1%

Mistral Small 3.1 24B

86.2%

85.7%

Llama 3.2 90B Instruct

85.5%

Llama 3.2 11B Instruct

83.4%

Phi-3.5-vision-instruct

81.8%

81.8%

76.1%

Current model

Other models

Avg (84.4%)

DocVQA

Claude 3.5 Sonnet

95.2%

Mistral Small 3.1 24B

94.1%

93.6%

93.3%

93.2%

92.8%

90.7%

Llama 3.2 90B Instruct

90.1%

Llama 3.2 11B Instruct

88.4%

85.6%

Current model

Other models

Avg (91.7%)

EgoSchema

72.2%

Gemini 2.0 Flash

71.5%

55.7%

Current model

Other models

Avg (66.5%)

SimpleQA

62.5%

61.8%

Gemini Pro 2.5 Experimental

52.9%

43.6%

42.6%

42.4%

30.1%

24.9%

Current model

Other models

Avg (45.1%)

IFEval

Claude 3.7 Sonnet

93.2%

87.2%

86.1%

85.6%

Qwen2.5 72B Instruct

84.1%

84.0%

83.9%

83.3%

Mistral Small 3

82.9%

61.3%

Current model

Other models

Avg (83.2%)

Taubench Retail

60.0%

60.0%

55.0%

Current model

Other models

Avg (58.3%)

Taubench Airline

43.0%

41.0%

30.0%

Current model

Other models

Avg (38.0%)

BFCL

Llama 3.1 405B Instruct

88.5%

Llama 3.1 70B Instruct

84.8%

Llama 3.1 8B Instruct

76.1%

74.0%

68.4%

66.6%

66.4%

65.0%

56.2%

Current model

Other models

Avg (71.8%)

MBPPPlus

91.0%

89.0%

88.0%

Current model

Other models

Avg (89.3%)

SQL

71.0%

70.0%

58.0%

Current model

Other models

Avg (66.3%)

RepoQA 32k

91.0%

90.0%

86.0%

Current model

Other models

Avg (89.0%)

Providers Pricing Coming Soon

We're working on gathering comprehensive pricing data from all major providers for GPT-4o. Compare costs across platforms to find the best pricing for your use case.

OpenAI

Anthropic

Google

Mistral AI

Cohere

Share your feedback

Hi, I'm Charlie Palars, the founder of Deepranking.ai. I'm always looking for ways to improve the site and make it more useful for you. You can write me through this form or directly through X at @palarsio.

Your feedback helps us improve our service

Stay Ahead with AI Updates

Get insights on Gemini Pro 2.5, Sonnet 3.7 and more top AI models