Anthropic's Claude 3.5 Sonnet has claimed the top spot on the LMSYS Chatbot Arena leaderboard since its June 20 release, edging out OpenAI's GPT-4o with a leading Elo score of 1272 versus 1269, reflecting superior performance in user-blinded benchmarks for coding, math, and reasoning. This shift underscores intensifying competition among large language models, with Meta's open-source Llama 3 405B and Google's Gemini 1.5 Pro trailing closely. Trader sentiment hinges on these real-time capability demonstrations, backed by skin-in-the-game wagers. As June 30 nears, eyes are on potential last-minute releases or updates from OpenAI, Google DeepMind, or xAI's Grok that could reclaim leadership before market resolution.
基于Polymarket数据的AI实验性摘要 · 更新于$1,137,002 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
$1,137,002 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
市场开放时间: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Anthropic's Claude 3.5 Sonnet has claimed the top spot on the LMSYS Chatbot Arena leaderboard since its June 20 release, edging out OpenAI's GPT-4o with a leading Elo score of 1272 versus 1269, reflecting superior performance in user-blinded benchmarks for coding, math, and reasoning. This shift underscores intensifying competition among large language models, with Meta's open-source Llama 3 405B and Google's Gemini 1.5 Pro trailing closely. Trader sentiment hinges on these real-time capability demonstrations, backed by skin-in-the-game wagers. As June 30 nears, eyes are on potential last-minute releases or updates from OpenAI, Google DeepMind, or xAI's Grok that could reclaim leadership before market resolution.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题