Anthropic's Claude 3.5 Sonnet large language model, released June 20, has captured the #1 spot on the LMSYS Chatbot Arena leaderboard, edging out OpenAI's GPT-4o with superior Elo scores in coding, mathematics, and multimodal benchmarks, driving trader sentiment toward Anthropic's current edge in general-purpose AI capabilities. This marks a pivotal competitive shift, as Claude 3.5 demonstrates stronger agentic performance while OpenAI counters with o1-preview reasoning models that excel in complex problem-solving but trail in overall rankings. With June 30 approaching, market-implied odds hinge on potential last-minute releases or leaderboard volatility from Google DeepMind's Gemini updates or xAI's Grok iterations, though Meta's anticipated Llama 3.1 405B launch targets early July, likely missing the deadline. Traders should monitor daily benchmark fluctuations and official announcements for resolution catalysts.
基于Polymarket数据的AI实验性摘要 · 更新于$1,137,017 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
$1,137,017 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
市场开放时间: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet large language model, released June 20, has captured the #1 spot on the LMSYS Chatbot Arena leaderboard, edging out OpenAI's GPT-4o with superior Elo scores in coding, mathematics, and multimodal benchmarks, driving trader sentiment toward Anthropic's current edge in general-purpose AI capabilities. This marks a pivotal competitive shift, as Claude 3.5 demonstrates stronger agentic performance while OpenAI counters with o1-preview reasoning models that excel in complex problem-solving but trail in overall rankings. With June 30 approaching, market-implied odds hinge on potential last-minute releases or leaderboard volatility from Google DeepMind's Gemini updates or xAI's Grok iterations, though Meta's anticipated Llama 3.1 405B launch targets early July, likely missing the deadline. Traders should monitor daily benchmark fluctuations and official announcements for resolution catalysts.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题