WebDev Arena Leaderboard

WebDev Arena is a real-time AI coding competition where models go head-to-head in web development challenges, developed by LMArena

Leaderboard

Arena Score

1403.04

License

Proprietary

95% CI

+8.77 / -8.57

Votes

6,835

Arena Score

1389.87

License

MIT

95% CI

+10.18 / -7.87

Votes

4,466

Arena Score

1379.95

License

Proprietary

95% CI

+8.49 / -6.51

Votes

8,740

#4

Arena Score

1361.45

License

Apache 2.0

95% CI

+9.05 / -8.87

Votes

6,172

#2

Arena Score

1361.45

License

MIT

95% CI

+22.72 / -21.05

Votes

905

Arena Score

1358.99

License

Proprietary

95% CI

+7.95 / -7.67

Votes

7,955

Arena Score

1358.40

License

Proprietary

95% CI

+9.79 / -8.74

Votes

7,460

Arena Score

1355.04

License

MIT

95% CI

+24.24 / -23.94

Votes

831

MoonshotAI
#9

Arena Score

1312.70

License

Modified MIT

95% CI

+7.55 / -7.11

Votes

5,828

Arena Score

1289.74

License

Proprietary

95% CI

+7.97 / -5.35

Votes

7,489

Arena Score

1253.40

License

Proprietary

95% CI

+7.43 / -6.80

Votes

10,709

Arena Score

1238.14

License

Proprietary

95% CI

+4.17 / -5.94

Votes

26,267

#13

Arena Score

1207.94

License

MIT

95% CI

+15.16 / -19.31

Votes

1,094

DeepSeek-R1

DeepSeek

#13

Arena Score

1199.44

License

MIT

95% CI

+11.30 / -9.47

Votes

3,755

Arena Score

1192.59

License

Proprietary

95% CI

+6.53 / -6.47

Votes

9,064

#13

Arena Score

1189.64

License

Apache 2.0

95% CI

+7.19 / -7.79

Votes

5,600

#13

Arena Score

1186.20

License

Proprietary

95% CI

+9.70 / -7.83

Votes

5,572

Arena Score

1180.67

License

Proprietary

95% CI

+8.21 / -8.88

Votes

7,028

#15

Arena Score

1177.84

License

Proprietary

95% CI

+8.76 / -8.16

Votes

6,480

Arena Score

1143.28

License

Proprietary

95% CI

+6.23 / -7.74

Votes

5,764

Arena Score

1136.73

License

Proprietary

95% CI

+13.14 / -12.81

Votes

2,979

Arena Score

1133.44

License

Proprietary

95% CI

+5.40 / -5.41

Votes

22,213

MiniMax-M1

MiniMax

#20

Arena Score

1130.10

License

MIT

95% CI

+10.77 / -9.39

Votes

3,361

Arena Score

1118.45

License

Proprietary

95% CI

+5.77 / -7.41

Votes

8,300

Arena Score

1092.17

License

Proprietary

95% CI

+10.04 / -8.80

Votes

6,369

Arena Score

1089.74

License

Proprietary

95% CI

+3.58 / -5.99

Votes

11,859

#27

Arena Score

1045.16

License

Proprietary

95% CI

+5.89 / -6.05

Votes

9,235

Arena Score

1042.58

License

Proprietary

95% CI

+5.79 / -6.52

Votes

13,688

Arena Score

1040.27

License

Proprietary

95% CI

+5.99 / -6.52

Votes

10,498

Arena Score

1029.79

License

Proprietary

95% CI

+14.95 / -18.68

Votes

1,058

Arena Score

1026.95

License

Llama 4

95% CI

+7.31 / -11.44

Votes

5,474

Arena Score

980.05

License

Proprietary

95% CI

+5.54 / -6.31

Votes

14,454

#32

Arena Score

975.53

License

Proprietary

95% CI

+6.23 / -8.63

Votes

11,072

Arena Score

964.00

License

Proprietary

95% CI

+5.97 / -5.81

Votes

18,601

DeepSeek-V3

DeepSeek

#33

Arena Score

959.78

License

DeepSeek

95% CI

+7.86 / -6.83

Votes

7,699

Arena Score

901.97

License

Apache 2.0

95% CI

+4.78 / -5.31

Votes

16,199

Arena Score

901.32

License

Llama 4

95% CI

+27.33 / -21.45

Votes

687

Arena Score

892.55

License

Proprietary

95% CI

+5.52 / -7.44

Votes

15,159

Arena Score

809.53

License

Llama 3.1

95% CI

+18.30 / -16.17

Votes

1,117

More Statistics for WebDev Arena (Overall)

Confidence Interval for Model Strength

Figure 1

Average Win Rate Against All Other Models (Assuming Uniform Sampling and No Ties)

Figure 2

Fraction of Model A Wins for All Non-tied A vs. B Battles

Figure 3

Battle Count for Each Combination of Models (without Ties)

Figure 4