Anthropic's Claude 3.5 Sonnet currently commands the #1 spot on the LMSYS Chatbot Arena leaderboard with an Elo score above 1280, fueling trader optimism for its staying power into June 30, but Polymarket odds imply just a 35% chance, with OpenAI at 40% favored to overtake via iterative o1-series improvements or GPT-5 previews. Intense competition from Google's experimental Gemini 2.0 Flash, Meta's open-source Llama 3.1 405B, and xAI's Grok-3—trained on the massive Colossus cluster—drives volatility, as scaling compute and post-training optimizations frequently flip rankings. Key catalysts include OpenAI's potential December updates, Google's next Gemini iteration, and Anthropic's Claude 4 hints at DevDay events, amid uncertain resolution criteria tied to sustained leaderboard dominance.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日$1,162,016 Vol.

OpenAI
28%

xAI
13%

DeepSeek
8%

Z.ai
7%

アリババ
6%

Meta
6%

バイドゥ
4%

Nvidia
3%

Meituan
3%

Mistral
2%
$1,162,016 Vol.

OpenAI
28%

xAI
13%

DeepSeek
8%

Z.ai
7%

アリババ
6%

Meta
6%

バイドゥ
4%

Nvidia
3%

Meituan
3%

Mistral
2%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
マーケット開始日: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet currently commands the #1 spot on the LMSYS Chatbot Arena leaderboard with an Elo score above 1280, fueling trader optimism for its staying power into June 30, but Polymarket odds imply just a 35% chance, with OpenAI at 40% favored to overtake via iterative o1-series improvements or GPT-5 previews. Intense competition from Google's experimental Gemini 2.0 Flash, Meta's open-source Llama 3.1 405B, and xAI's Grok-3—trained on the massive Colossus cluster—drives volatility, as scaling compute and post-training optimizations frequently flip rankings. Key catalysts include OpenAI's potential December updates, Google's next Gemini iteration, and Anthropic's Claude 4 hints at DevDay events, amid uncertain resolution criteria tied to sustained leaderboard dominance.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日
外部リンクに注意してください。
外部リンクに注意してください。
よくある質問