leaderboards

Track model performance across benchmarks

🥈
G

Gemini 3.1 Pro

Google

78.2
🥇
O

GPT-5.2 Pro

OpenAI

78.9
🥉
A

Claude Opus 4.6

Anthropic

77.8
1
O

GPT-5.2 Pro

OpenAI

78.9
2
G

Gemini 3.1 Pro

Google

78.2
3
A

Claude Opus 4.6

Anthropic

77.8
4
O

GPT-5.2

OpenAI

75.7
5
G

Gemini 3 Pro

Google

74.1
6
O

GPT-5

OpenAI

73.0
7
G

Gemini 3 Flash

Google

71.3
8
D

DeepSeek-R2

DeepSeek

71.2
9
O

o1

OpenAI

69.8
10
O

GPT-4.5

OpenAI

68.0
11
Q

Qwen3 VL 235B

Alibaba

66.2
12
O

o1-preview

OpenAI

65.9
13
Mi

Mistral Large 2

Mistral AI

60.3
14
M

LLaMA 3.3 70B

Meta

55.9
15
P

Phi-4 14B

Microsoft

51.5

15

Models ranked

69.2

Average score

GPT-5.2 Pro

Current leader