WebDev Arena Leaderboard
WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena
Leaderboard
Arena Score
1403.04
License
Proprietary
95% CI
+8.77 / -8.57
Votes
6,835
DeepSeek
Arena Score
1389.87
License
MIT
95% CI
+10.18 / -7.87
Votes
4,466
Anthropic
Arena Score
1379.95
License
Proprietary
95% CI
+8.49 / -6.51
Votes
8,740

Alibaba
Arena Score
1361.45
License
Apache 2.0
95% CI
+9.05 / -8.87
Votes
6,172
ZAI
Arena Score
1361.45
License
MIT
95% CI
+22.72 / -21.05
Votes
905
Anthropic
Arena Score
1358.99
License
Proprietary
95% CI
+7.95 / -7.67
Votes
7,955
Anthropic
Arena Score
1358.40
License
Proprietary
95% CI
+9.79 / -8.74
Votes
7,460
ZAI
Arena Score
1355.04
License
MIT
95% CI
+24.24 / -23.94
Votes
831
Moonshot
Arena Score
1312.70
License
Modified MIT
95% CI
+7.55 / -7.11
Votes
5,828
Arena Score
1289.74
License
Proprietary
95% CI
+7.97 / -5.35
Votes
7,489
OpenAI
Arena Score
1253.40
License
Proprietary
95% CI
+7.43 / -6.80
Votes
10,709
Anthropic
Arena Score
1238.14
License
Proprietary
95% CI
+4.17 / -5.94
Votes
26,267
DeepSeek
Arena Score
1207.94
License
MIT
95% CI
+15.16 / -19.31
Votes
1,094
DeepSeek
Arena Score
1199.44
License
MIT
95% CI
+11.30 / -9.47
Votes
3,755
OpenAI
Arena Score
1192.59
License
Proprietary
95% CI
+6.53 / -6.47
Votes
9,064

Alibaba
Arena Score
1189.64
License
Apache 2.0
95% CI
+7.19 / -7.79
Votes
5,600
OpenAI
Arena Score
1186.20
License
Proprietary
95% CI
+9.70 / -7.83
Votes
5,572
Mistral
Arena Score
1180.67
License
Proprietary
95% CI
+8.21 / -8.88
Votes
7,028
xAI
Arena Score
1177.84
License
Proprietary
95% CI
+8.76 / -8.16
Votes
6,480
Arena Score
1143.28
License
Proprietary
95% CI
+6.23 / -7.74
Votes
5,764
OpenAI
Arena Score
1136.73
License
Proprietary
95% CI
+13.14 / -12.81
Votes
2,979
Anthropic
Arena Score
1133.44
License
Proprietary
95% CI
+5.40 / -5.41
Votes
22,213
MiniMax
Arena Score
1130.10
License
MIT
95% CI
+10.77 / -9.39
Votes
3,361
OpenAI
Arena Score
1118.45
License
Proprietary
95% CI
+5.77 / -7.41
Votes
8,300
OpenAI
Arena Score
1092.17
License
Proprietary
95% CI
+10.04 / -8.80
Votes
6,369
Arena Score
1089.74
License
Proprietary
95% CI
+3.58 / -5.99
Votes
11,859
OpenAI
Arena Score
1045.16
License
Proprietary
95% CI
+5.89 / -6.05
Votes
9,235
OpenAI
Arena Score
1042.58
License
Proprietary
95% CI
+5.79 / -6.52
Votes
13,688
Arena Score
1040.27
License
Proprietary
95% CI
+5.99 / -6.52
Votes
10,498
Arena Score
1029.79
License
Proprietary
95% CI
+14.95 / -18.68
Votes
1,058
Arena Score
1026.95
License
Llama 4
95% CI
+7.31 / -11.44
Votes
5,474
Arena Score
980.05
License
Proprietary
95% CI
+5.54 / -6.31
Votes
14,454

Alibaba
Arena Score
975.53
License
Proprietary
95% CI
+6.23 / -8.63
Votes
11,072
OpenAI
Arena Score
964.00
License
Proprietary
95% CI
+5.97 / -5.81
Votes
18,601
DeepSeek
Arena Score
959.78
License
DeepSeek
95% CI
+7.86 / -6.83
Votes
7,699

Alibaba
Arena Score
901.97
License
Apache 2.0
95% CI
+4.78 / -5.31
Votes
16,199
Arena Score
901.32
License
Llama 4
95% CI
+27.33 / -21.45
Votes
687
Arena Score
892.55
License
Proprietary
95% CI
+5.52 / -7.44
Votes
15,159
Arena Score
809.53
License
Llama 3.1
95% CI
+18.30 / -16.17
Votes
1,117
More Statistics for WebDev Arena (Overall)
Confidence Interval for Model Strength
Figure 1
Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)
Figure 2
Fraction of Model A Wins for All Non-tied A vs. B Battles
Figure 3
Battle Count for Each Combination of Models (without Ties)
Figure 4