Trader consensus on Polymarket heavily favors Anthropic at 66.5% implied probability for the best AI model by end of June, propelled by its Claude Opus 4.6 topping the LMSYS Chatbot Arena leaderboard with superior ELO scores in coding (2,002) and chat arenas (1,476) as of March 28, alongside a March 27 data leak revealing "Claude Mythos"—an unreleased model touted internally as dramatically outperforming Opus in reasoning, coding, and cybersecurity benchmarks. Google's Gemini 3.1 Pro trails closely at 23.5%, excelling in GPQA (94.3%) but lacking a comparable near-term upgrade signal. OpenAI's GPT-5.4 lags at 6.5% amid stagnant recent capabilities, while xAI and others remain fringe. Watch for Mythos rollout, Gemini 4 previews, or OpenAI counters before resolution.
基于Polymarket数据的AI实验性摘要 · 更新于Anthropic 66.5%
谷歌 24%
OpenAI 7%
xAI 1.7%
$2,884,596 交易量
$2,884,596 交易量

Anthropic
66%

谷歌
24%

OpenAI
7%

xAI
2%

分组项标题:DeepSeek
1%

Z.ai
<1%

阿里巴巴
<1%

分组项标题:Mistral
<1%

Moonshot
<1%

美团
<1%
Anthropic 66.5%
谷歌 24%
OpenAI 7%
xAI 1.7%
$2,884,596 交易量
$2,884,596 交易量

Anthropic
66%

谷歌
24%

OpenAI
7%

xAI
2%

分组项标题:DeepSeek
1%

Z.ai
<1%

阿里巴巴
<1%

分组项标题:Mistral
<1%

Moonshot
<1%

美团
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the top arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g., if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市场开放时间: Oct 10, 2025, 5:27 PM ET
Resolver
0x2F5e3684c...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the top arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g., if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...Trader consensus on Polymarket heavily favors Anthropic at 66.5% implied probability for the best AI model by end of June, propelled by its Claude Opus 4.6 topping the LMSYS Chatbot Arena leaderboard with superior ELO scores in coding (2,002) and chat arenas (1,476) as of March 28, alongside a March 27 data leak revealing "Claude Mythos"—an unreleased model touted internally as dramatically outperforming Opus in reasoning, coding, and cybersecurity benchmarks. Google's Gemini 3.1 Pro trails closely at 23.5%, excelling in GPQA (94.3%) but lacking a comparable near-term upgrade signal. OpenAI's GPT-5.4 lags at 6.5% amid stagnant recent capabilities, while xAI and others remain fringe. Watch for Mythos rollout, Gemini 4 previews, or OpenAI counters before resolution.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题