Anthropic's Claude 3.5 Sonnet has surged to the top of the LMSYS Chatbot Arena leaderboard since its June 20 release, demonstrating superior capabilities in coding, mathematics, and multimodal tasks over OpenAI's GPT-4o, reflecting trader consensus on its current edge as the leading large language model. This shift stems from benchmark gains validated by third-party evaluations, bolstering Anthropic's competitive positioning amid intensifying AI races. Meta's Llama 3.1, including a rumored 405B-parameter variant, is anticipated imminently and could disrupt the standings before June 30 resolution. Google DeepMind's Gemini updates and xAI's Grok lag behind, with no confirmed catalysts in the next week to alter the hierarchy significantly, though rapid evaluations keep markets volatile.
Experimental AI-generated summary referencing Polymarket data · Updated$1,137,017 Vol.

OpenAI
29%

xAI
16%

DeepSeek
8%

Alibaba
6%

Meta
6%

Z.ai
5%

Baidu
4%

Mistral
4%

Nvidia
3%

Meituan
3%
$1,137,017 Vol.

OpenAI
29%

xAI
16%

DeepSeek
8%

Alibaba
6%

Meta
6%

Z.ai
5%

Baidu
4%

Mistral
4%

Nvidia
3%

Meituan
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Market Opened: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet has surged to the top of the LMSYS Chatbot Arena leaderboard since its June 20 release, demonstrating superior capabilities in coding, mathematics, and multimodal tasks over OpenAI's GPT-4o, reflecting trader consensus on its current edge as the leading large language model. This shift stems from benchmark gains validated by third-party evaluations, bolstering Anthropic's competitive positioning amid intensifying AI races. Meta's Llama 3.1, including a rumored 405B-parameter variant, is anticipated imminently and could disrupt the standings before June 30 resolution. Google DeepMind's Gemini updates and xAI's Grok lag behind, with no confirmed catalysts in the next week to alter the hierarchy significantly, though rapid evaluations keep markets volatile.
Experimental AI-generated summary referencing Polymarket data · Updated



Beware of external links.
Beware of external links.
Frequently Asked Questions