Anthropic's Claude 3.5 Sonnet, released June 20, has captured the top spot on the LMSYS Chatbot Arena leaderboard—the key benchmark for large language model capabilities—overtaking OpenAI's GPT-4o via superior user-rated performance in coding, math, vision, and agentic tasks. This shift underscores accelerating competition among frontier AI labs, with Google DeepMind's Gemini 1.5 Pro and Meta's Llama 3 trailing but iterating rapidly on open-weight models. Traders are focused on leaderboard volatility driven by blind user votes, potential pre-June 30 counter-releases from OpenAI or xAI's Grok updates, and demonstrations against established benchmarks like MMLU or GPQA that could redefine #1 status before resolution.
Résumé expérimental généré par IA à partir des données Polymarket · Mis à jour$1,165,384 Vol.

OpenAI
29%

xAI
16%

DeepSeek
8%

Alibaba
8%

Meta
6%

Z.ai
5%

Baidu
4%

Mistral
4%

Nvidia
3%

Meituan
3%
$1,165,384 Vol.

OpenAI
29%

xAI
16%

DeepSeek
8%

Alibaba
8%

Meta
6%

Z.ai
5%

Baidu
4%

Mistral
4%

Nvidia
3%

Meituan
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Marché ouvert : Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet, released June 20, has captured the top spot on the LMSYS Chatbot Arena leaderboard—the key benchmark for large language model capabilities—overtaking OpenAI's GPT-4o via superior user-rated performance in coding, math, vision, and agentic tasks. This shift underscores accelerating competition among frontier AI labs, with Google DeepMind's Gemini 1.5 Pro and Meta's Llama 3 trailing but iterating rapidly on open-weight models. Traders are focused on leaderboard volatility driven by blind user votes, potential pre-June 30 counter-releases from OpenAI or xAI's Grok updates, and demonstrations against established benchmarks like MMLU or GPQA that could redefine #1 status before resolution.
Résumé expérimental généré par IA à partir des données Polymarket · Mis à jour
Méfiez-vous des liens externes.
Méfiez-vous des liens externes.
Questions fréquentes