Back to Project Page

WebVR Leaderboard

Full results from the WebVR benchmark. Scores are reported for Global Aesthetics (GA), Navigation and Footer (NF), Section-Specific Layouts (SSL), Interaction and Motion (IM), and Overall.

At a Glance

Models Evaluated

19

Best Overall

79.14

Kimi-K2.5

Hardest Dimension

IM

Interaction and Motion

Best Open-source

79.14

Kimi-K2.5

Top Models

#1

Kimi-K2.5

79.14 Overall

Open-source

  • GA 87.44
  • NF 89.21
  • SSL 79.26
  • IM 60.10

#2

Claude-Sonnet-4.6

78.49 Overall

Closed-source

  • GA 87.16
  • NF 89.37
  • SSL 78.87
  • IM 59.06

#3

GPT-5.2-Thinking

77.93 Overall

Closed-source

  • GA 89.76
  • NF 89.08
  • SSL 77.27
  • IM 59.97

Dimension Leaders

GA

GPT-5.2-Thinking

89.76

NF

Claude-Sonnet-4.6

89.37

SSL

Kimi-K2.5

79.26

IM

Kimi-K2.5

60.10

Overall Ranking

Models are sorted by overall score. The top three rows are highlighted for quick comparison.

Rank Model Type GA NF SSL IM Overall
1Kimi-K2.5Open-source87.4489.2179.2660.1079.14
2Claude-Sonnet-4.6Closed-source87.1689.3778.8759.0678.49
3GPT-5.2-ThinkingClosed-source89.7689.0877.2759.9777.93
4Claude-Opus-4.6Closed-source87.6687.9878.6054.3377.33
5Gemini-3.1-Pro-PreviewClosed-source88.3087.2977.0956.5076.69
6Seed-2.0-ProClosed-source82.8886.2773.3545.8871.88
7Gemini-3.0-FlashClosed-source84.0585.1967.7448.4369.49
8Gemini-3.0-ProClosed-source80.8481.7966.3146.8667.32
9Gemini-2.5-ProClosed-source78.5980.1759.5648.6663.09
10Seed-1.8Closed-source75.0677.9562.2136.3361.98
11Qwen3.5-397B-A17BOpen-source80.4676.6258.8141.9661.33
12Claude-Sonnet-3.7Closed-source76.3880.5459.2637.3861.21
13Gemini-2.5-FlashClosed-source71.9570.9051.5239.7655.62
14Qwen3-VL-235B-A22B-ThinkingOpen-source61.2068.0443.1129.3046.80
15GPT-4.1Closed-source61.9164.4242.7026.7145.85
16Qwen3-VL-235B-A22B-InstructOpen-source51.0652.6540.0922.1240.71
17Qwen3-VL-30B-A3B-ThinkingOpen-source53.3860.4733.4920.2237.69
18Qwen3-VL-30B-A3B-InstructOpen-source33.3334.8717.7112.6721.44
19GLM-4.6VOpen-source22.7815.427.1714.3511.42

By Model Group

This view mirrors the grouping in the paper and makes it easier to compare model families within open-source and closed-source settings.

Group Model GA NF SSL IM Overall
Open-source Models
Open-sourceGLM-4.6V22.7815.427.1714.3511.42
Open-sourceQwen3-VL-30B-A3B-Instruct33.3334.8717.7112.6721.44
Open-sourceQwen3-VL-30B-A3B-Thinking53.3860.4733.4920.2237.69
Open-sourceQwen3-VL-235B-A22B-Instruct51.0652.6540.0922.1240.71
Open-sourceQwen3-VL-235B-A22B-Thinking61.2068.0443.1129.3046.80
Open-sourceQwen3.5-397B-A17B80.4676.6258.8141.9661.33
Open-sourceKimi-K2.587.4489.2179.2660.1079.14
Closed-source Models
Closed-sourceGPT-4.161.9164.4242.7026.7145.85
Closed-sourceGPT-5.2-Thinking89.7689.0877.2759.9777.93
Closed-sourceGemini-2.5-Flash71.9570.9051.5239.7655.62
Closed-sourceGemini-2.5-Pro78.5980.1759.5648.6663.09
Closed-sourceGemini-3.0-Flash84.0585.1967.7448.4369.49
Closed-sourceGemini-3.0-Pro80.8481.7966.3146.8667.32
Closed-sourceGemini-3.1-Pro-Preview88.3087.2977.0956.5076.69
Closed-sourceClaude-Sonnet-3.776.3880.5459.2637.3861.21
Closed-sourceClaude-Sonnet-4.687.1689.3778.8759.0678.49
Closed-sourceClaude-Opus-4.687.6687.9878.6054.3377.33
Closed-sourceSeed-1.875.0677.9562.2136.3361.98
Closed-sourceSeed-2.0-Pro82.8886.2773.3545.8871.88