Anthropic's Claude 3.5 Sonnet, released June 20, has surged to the top of the LMSYS Chatbot Arena leaderboard—the leading crowd-sourced benchmark for large language model capabilities—outperforming OpenAI's GPT-4o in coding, math, vision, and agentic tasks with scores exceeding previous records. This development has shifted trader consensus toward Anthropic's competitive edge, reflecting real-money bets on its sustained leadership amid rapid AI iteration cycles. OpenAI retains strengths in multimodal integration, while Meta's anticipated Llama 3.1 405B model, teased for an imminent launch, and potential updates from Google DeepMind or xAI Grok could disrupt rankings before June 30. Traders eye fresh benchmark evals and surprise releases as pivotal catalysts in this fluid race.
Experimental AI-generated summary referencing Polymarket data · Updated$1,132,626 Vol.

OpenAI
30%

xAI
16%

DeepSeek
8%

Alibaba
6%

Meta
6%

Z.ai
5%

Baidu
4%

Nvidia
3%

Mistral
3%

Meituan
3%
$1,132,626 Vol.

OpenAI
30%

xAI
16%

DeepSeek
8%

Alibaba
6%

Meta
6%

Z.ai
5%

Baidu
4%

Nvidia
3%

Mistral
3%

Meituan
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Market Opened: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet, released June 20, has surged to the top of the LMSYS Chatbot Arena leaderboard—the leading crowd-sourced benchmark for large language model capabilities—outperforming OpenAI's GPT-4o in coding, math, vision, and agentic tasks with scores exceeding previous records. This development has shifted trader consensus toward Anthropic's competitive edge, reflecting real-money bets on its sustained leadership amid rapid AI iteration cycles. OpenAI retains strengths in multimodal integration, while Meta's anticipated Llama 3.1 405B model, teased for an imminent launch, and potential updates from Google DeepMind or xAI Grok could disrupt rankings before June 30. Traders eye fresh benchmark evals and surprise releases as pivotal catalysts in this fluid race.
Experimental AI-generated summary referencing Polymarket data · Updated



Beware of external links.
Beware of external links.
Frequently Asked Questions