Recent leaderboard volatility on the LMSYS Chatbot Arena has intensified competition among AI labs, with OpenAI's GPT-5.4 reclaiming the top Elo score in March 2026 before Anthropic's Claude Opus 4.6 surged ahead in style control benchmarks this month. Traders weigh these rapid shifts—driven by frequent model releases from Google DeepMind's Gemini 3.1 Pro, xAI's Grok updates, and challengers like DeepSeek—against historical patterns of post-launch dominance. Upcoming catalysts, including rumored GPT-5.5, full Claude 5 rollout, and Llama 4 by mid-year, could propel any frontrunner to #1 by June 30, reflecting aggregated skin-in-the-game consensus on release timelines and benchmark gains amid uncertain AI capability leaps.
Résumé expérimental généré par IA à partir des données Polymarket · Mis à jour$1,402,570 Vol.

OpenAI
27%

xAI
12%

Alibaba
10%

DeepSeek
7%

Z.ai
5%

Meta
4%

Nvidia
4%

Baidu
4%

Mistral
4%

Meituan
3%
$1,402,570 Vol.

OpenAI
27%

xAI
12%

Alibaba
10%

DeepSeek
7%

Z.ai
5%

Meta
4%

Nvidia
4%

Baidu
4%

Mistral
4%

Meituan
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Marché ouvert : Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Resolver
0x65070BE91...Recent leaderboard volatility on the LMSYS Chatbot Arena has intensified competition among AI labs, with OpenAI's GPT-5.4 reclaiming the top Elo score in March 2026 before Anthropic's Claude Opus 4.6 surged ahead in style control benchmarks this month. Traders weigh these rapid shifts—driven by frequent model releases from Google DeepMind's Gemini 3.1 Pro, xAI's Grok updates, and challengers like DeepSeek—against historical patterns of post-launch dominance. Upcoming catalysts, including rumored GPT-5.5, full Claude 5 rollout, and Llama 4 by mid-year, could propel any frontrunner to #1 by June 30, reflecting aggregated skin-in-the-game consensus on release timelines and benchmark gains amid uncertain AI capability leaps.
Résumé expérimental généré par IA à partir des données Polymarket · Mis à jour
Méfiez-vous des liens externes.
Méfiez-vous des liens externes.
Questions fréquentes