WebDev Arena Leaderboard
WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena
Leaderboard
Arena Score
1443.22
License
Proprietary
95% CI
+16.23 / -16.87
Votes
1,872
Anthropic
Arena Score
1411.98
License
Proprietary
95% CI
+15.25 / -13.93
Votes
2,466
Arena Score
1408.20
License
Proprietary
95% CI
+14.65 / -13.74
Votes
3,858
Anthropic
Arena Score
1389.18
License
Proprietary
95% CI
+15.21 / -14.98
Votes
2,078
Anthropic
Arena Score
1357.13
License
Proprietary
95% CI
+8.68 / -7.77
Votes
7,481
Arena Score
1311.55
License
Proprietary
95% CI
+10.24 / -13.76
Votes
2,626
OpenAI
Arena Score
1256.03
License
Proprietary
95% CI
+8.79 / -7.56
Votes
5,489
Anthropic
Arena Score
1237.72
License
Proprietary
95% CI
+5.62 / -4.56
Votes
26,338
DeepSeek
Arena Score
1206.73
License
MIT
95% CI
+18.65 / -16.07
Votes
1,097
DeepSeek
Arena Score
1198.26
License
MIT
95% CI
+9.00 / -11.30
Votes
3,769
OpenAI
Arena Score
1188.34
License
Proprietary
95% CI
+8.08 / -9.78
Votes
4,514
OpenAI
Arena Score
1186.65
License
Proprietary
95% CI
+10.53 / -9.05
Votes
3,611

Alibaba
Arena Score
1183.37
License
Apache 2.0
95% CI
+12.80 / -13.57
Votes
2,634
Mistral
Arena Score
1167.45
License
Proprietary
95% CI
+13.51 / -13.47
Votes
2,139
Arena Score
1143.71
License
Proprietary
95% CI
+9.10 / -9.58
Votes
3,262
xAI
Arena Score
1142.79
License
Proprietary
95% CI
+6.89 / -7.33
Votes
6,284
OpenAI
Arena Score
1136.26
License
Proprietary
95% CI
+10.19 / -12.75
Votes
2,984
Anthropic
Arena Score
1132.85
License
Proprietary
95% CI
+4.34 / -4.92
Votes
21,827
OpenAI
Arena Score
1099.59
License
Proprietary
95% CI
+9.89 / -13.05
Votes
3,093
OpenAI
Arena Score
1091.68
License
Proprietary
95% CI
+7.35 / -8.36
Votes
6,391
Arena Score
1088.85
License
Proprietary
95% CI
+6.19 / -6.66
Votes
11,936
OpenAI
Arena Score
1044.86
License
Proprietary
95% CI
+6.26 / -7.73
Votes
9,271
OpenAI
Arena Score
1041.81
License
Proprietary
95% CI
+6.57 / -5.78
Votes
13,828
Arena Score
1039.93
License
Proprietary
95% CI
+5.68 / -6.02
Votes
10,533
Arena Score
1029.57
License
Proprietary
95% CI
+16.88 / -17.28
Votes
1,064
Arena Score
1026.37
License
Llama 4
95% CI
+8.46 / -8.46
Votes
5,363
Arena Score
980.20
License
Proprietary
95% CI
+4.78 / -6.32
Votes
14,485

Alibaba
Arena Score
974.98
License
Proprietary
95% CI
+4.90 / -6.62
Votes
11,110
OpenAI
Arena Score
964.00
License
Proprietary
95% CI
+4.67 / -5.17
Votes
18,637
DeepSeek
Arena Score
959.75
License
DeepSeek
95% CI
+6.58 / -7.33
Votes
7,717

Alibaba
Arena Score
902.25
License
Apache 2.0
95% CI
+4.26 / -6.21
Votes
16,252
Arena Score
900.16
License
Llama 4
95% CI
+26.71 / -25.64
Votes
692
Arena Score
892.47
License
Proprietary
95% CI
+5.86 / -6.54
Votes
15,201
Arena Score
809.61
License
Llama 3.1
95% CI
+19.02 / -17.33
Votes
1,117
More Statistics for WebDev Arena (Overall)
Confidence Interval for Model Strength
Figure 1
Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)
Figure 2
Fraction of Model A Wins for All Non-tied A vs. B Battles
Figure 3
Battle Count for Each Combination of Models (without Ties)
Figure 4