Trader consensus on Polymarket gives Anthropic a 94.5% implied probability of having the second-best AI model by March 31, driven primarily by the March 4 launch of Claude 3 Opus, which rocketed to #2 on the LMSYS Chatbot Arena leaderboard with an Elo score of 1264—trailing only OpenAI's GPT-4 Turbo. Anthropic's official benchmarks highlighted Opus outperforming GPT-4 in vision tasks, coding, and reasoning (e.g., 86.8% on MMLU), fueling bets amid sparse competition updates. Google's Gemini 1.5 Pro preview lagged in early evals, while xAI's Grok and others trailed further. Realistic challenges include a surprise late-March upgrade from OpenAI or Google displacing Opus, though no such announcements materialized.
Experimental AI-generated summary referencing Polymarket data · UpdatedAnthropic 95%
Google 2.5%
xAI 1.0%
OpenAI <1%
$971,695 Vol.
$971,695 Vol.

Anthropic
95%

2%

xAI
1%

OpenAI
1%

DeepSeek
<1%

Z.ai
<1%

Alibaba
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%
Anthropic 95%
Google 2.5%
xAI 1.0%
OpenAI <1%
$971,695 Vol.
$971,695 Vol.

Anthropic
95%

2%

xAI
1%

OpenAI
1%

DeepSeek
<1%

Z.ai
<1%

Alibaba
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the second best arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Market Opened: Dec 2, 2025, 6:02 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...Trader consensus on Polymarket gives Anthropic a 94.5% implied probability of having the second-best AI model by March 31, driven primarily by the March 4 launch of Claude 3 Opus, which rocketed to #2 on the LMSYS Chatbot Arena leaderboard with an Elo score of 1264—trailing only OpenAI's GPT-4 Turbo. Anthropic's official benchmarks highlighted Opus outperforming GPT-4 in vision tasks, coding, and reasoning (e.g., 86.8% on MMLU), fueling bets amid sparse competition updates. Google's Gemini 1.5 Pro preview lagged in early evals, while xAI's Grok and others trailed further. Realistic challenges include a surprise late-March upgrade from OpenAI or Google displacing Opus, though no such announcements materialized.
Experimental AI-generated summary referencing Polymarket data · Updated


Beware of external links.
Beware of external links.
Frequently Asked Questions