Anthropic's Claude Opus 4.6 has dominated the LMArena leaderboard—succeeding LMSYS Chatbot Arena—since early March 2026, leading in text, code, document, and reasoning benchmarks after a flurry of releases including OpenAI's GPT-5.4 on March 5, Google's Gemini 3.1 Pro, and xAI's Grok 4.20 beta. This recent supremacy explains subdued odds for challengers in Polymarket's multi-outcome market, yet trader consensus prices OpenAI at 29% implied probability and xAI at 16% to claim the #1 spot by June 30, driven by OpenAI's historical edge in balanced capabilities and xAI's rapid multimodal advances like Grok Imagine topping video arenas. Key catalysts include Meta's Llama 4 launch around April 5 and potential frontier model drops from major labs, amid volatile leaderboard shifts from crowdsourced battles.
基于Polymarket数据的AI实验性摘要 · 更新于$1,310,406 交易量

OpenAI
29%

xAI
18%

阿里巴巴
8%

DeepSeek
7%

Meta
7%

Z.ai
6%

百度
4%

英伟达
4%

美团
3%

Mistral
41%
$1,310,406 交易量

OpenAI
29%

xAI
18%

阿里巴巴
8%

DeepSeek
7%

Meta
7%

Z.ai
6%

百度
4%

英伟达
4%

美团
3%

Mistral
41%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
市场开放时间: Dec 22, 2025, 5:28 PM ET
Resolver
0x65070BE91...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/ with the style control unchecked will be used to resolve this market.
If a listed model ties for #1 Arena score, it will suffice to resolve this market to "Yes."
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source becomes unavailable, the market will remain open until it is accessible again. If it becomes permanently unavailable, resolution will be based on another credible source.
Resolver
0x65070BE91...Anthropic's Claude Opus 4.6 has dominated the LMArena leaderboard—succeeding LMSYS Chatbot Arena—since early March 2026, leading in text, code, document, and reasoning benchmarks after a flurry of releases including OpenAI's GPT-5.4 on March 5, Google's Gemini 3.1 Pro, and xAI's Grok 4.20 beta. This recent supremacy explains subdued odds for challengers in Polymarket's multi-outcome market, yet trader consensus prices OpenAI at 29% implied probability and xAI at 16% to claim the #1 spot by June 30, driven by OpenAI's historical edge in balanced capabilities and xAI's rapid multimodal advances like Grok Imagine topping video arenas. Key catalysts include Meta's Llama 4 launch around April 5 and potential frontier model drops from major labs, amid volatile leaderboard shifts from crowdsourced battles.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题