Trader sentiment on which company claims the #1 AI model by June 30 hinges on LMSYS Chatbot Arena leaderboard dominance, where Anthropic's Claude 3.5 Sonnet holds the top ELO score (~1285) after surging past OpenAI's o1-preview in late November 2024. OpenAI retains ~55% implied probability from rapid iteration on reasoning-focused o1 models, bolstered by partnerships like Apple's integration. Competitive pressures intensify with xAI's Grok-3 launch expected imminently post-training completion, potentially leveraging 100K+ H100 GPUs for frontier performance, while Google's Gemini 2.0 Flash trails but eyes multimodal gains. Key watchpoints include December releases, CES 2025 demos, and Q1 earnings previews, as leaderboard volatility—driven by Elo fluctuations from user votes—often precedes sustained leads amid scaling laws favoring compute-heavy incumbents.
基於Polymarket數據的AI實驗性摘要 · 更新於$1,136,607 交易量

OpenAI
30%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Nvidia
3%

美團
3%

Mistral
2%
$1,136,607 交易量

OpenAI
30%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Nvidia
3%

美團
3%

Mistral
2%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
市場開放時間: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Trader sentiment on which company claims the #1 AI model by June 30 hinges on LMSYS Chatbot Arena leaderboard dominance, where Anthropic's Claude 3.5 Sonnet holds the top ELO score (~1285) after surging past OpenAI's o1-preview in late November 2024. OpenAI retains ~55% implied probability from rapid iteration on reasoning-focused o1 models, bolstered by partnerships like Apple's integration. Competitive pressures intensify with xAI's Grok-3 launch expected imminently post-training completion, potentially leveraging 100K+ H100 GPUs for frontier performance, while Google's Gemini 2.0 Flash trails but eyes multimodal gains. Key watchpoints include December releases, CES 2025 demos, and Q1 earnings previews, as leaderboard volatility—driven by Elo fluctuations from user votes—often precedes sustained leads amid scaling laws favoring compute-heavy incumbents.
基於Polymarket數據的AI實驗性摘要 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions