OpenAI's o1-preview model recently claimed the #1 spot on the LMSYS Chatbot Arena leaderboard, driving trader consensus with its breakthrough reasoning capabilities demonstrated in benchmarks like complex math and coding tasks, overtaking Anthropic's Claude 3.5 Sonnet after its September release. This reflects intensified competition among leading AI labs, where xAI's Grok-3—trained on massive Memphis Supercluster compute—is slated for December 2024 launch, potentially challenging with superior scale, while Google's Gemini 2.0 Flash excels in multimodal tasks and Meta's Llama 3.1 405B pushes open-source frontiers. Upcoming catalysts include NeurIPS conference disclosures in December and Q1 2025 model drops from OpenAI (possibly Orion) or Anthropic's Claude 4, amid fluid AI capability races where leaderboard volatility rewards rapid iteration.
基于Polymarket数据的AI实验性摘要 · 更新于$1,132,693 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
$1,132,693 交易量

OpenAI
29%

xAI
16%

DeepSeek
8%

阿里巴巴
6%

Meta
6%

Z.ai
5%

百度
4%

Mistral
4%

英伟达
3%

美团
3%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
市场开放时间: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...OpenAI's o1-preview model recently claimed the #1 spot on the LMSYS Chatbot Arena leaderboard, driving trader consensus with its breakthrough reasoning capabilities demonstrated in benchmarks like complex math and coding tasks, overtaking Anthropic's Claude 3.5 Sonnet after its September release. This reflects intensified competition among leading AI labs, where xAI's Grok-3—trained on massive Memphis Supercluster compute—is slated for December 2024 launch, potentially challenging with superior scale, while Google's Gemini 2.0 Flash excels in multimodal tasks and Meta's Llama 3.1 405B pushes open-source frontiers. Upcoming catalysts include NeurIPS conference disclosures in December and Q1 2025 model drops from OpenAI (possibly Orion) or Anthropic's Claude 4, amid fluid AI capability races where leaderboard volatility rewards rapid iteration.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题