Trader consensus gives Anthropic a 95.5% implied probability of having the second-best AI model by end of March, driven primarily by Claude 3 Opus's March 4 launch, which delivered top-tier benchmark scores rivaling OpenAI's GPT-4 Turbo—surpassing it on GPQA reasoning and coding evals while ranking #2 on the LMSYS Chatbot Arena ELO leaderboard via blind user votes. This positions Claude as the clear runner-up in the frontier model race, bolstered by independent verifications from Artificial Analysis and Scale AI. Challenges could arise from Google's Gemini 1.5 Pro gaining traction in late-month evals or xAI's Grok-1.5 (announced March 28) surging unexpectedly, though trader odds reflect limited evidence of such shifts amid stable leaderboards.
Experimental AI-generated summary referencing Polymarket data · UpdatedAnthropic 96%
Google 2.5%
xAI 1.0%
OpenAI <1%
$972,228 Vol.
$972,228 Vol.

Anthropic
96%

3%

xAI
1%

OpenAI
1%

DeepSeek
<1%

Z.ai
<1%

Alibaba
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%
Anthropic 96%
Google 2.5%
xAI 1.0%
OpenAI <1%
$972,228 Vol.
$972,228 Vol.

Anthropic
96%

3%

xAI
1%

OpenAI
1%

DeepSeek
<1%

Z.ai
<1%

Alibaba
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the second best arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Market Opened: Dec 2, 2025, 6:02 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...Trader consensus gives Anthropic a 95.5% implied probability of having the second-best AI model by end of March, driven primarily by Claude 3 Opus's March 4 launch, which delivered top-tier benchmark scores rivaling OpenAI's GPT-4 Turbo—surpassing it on GPQA reasoning and coding evals while ranking #2 on the LMSYS Chatbot Arena ELO leaderboard via blind user votes. This positions Claude as the clear runner-up in the frontier model race, bolstered by independent verifications from Artificial Analysis and Scale AI. Challenges could arise from Google's Gemini 1.5 Pro gaining traction in late-month evals or xAI's Grok-1.5 (announced March 28) surging unexpectedly, though trader odds reflect limited evidence of such shifts amid stable leaderboards.
Experimental AI-generated summary referencing Polymarket data · Updated


Beware of external links.
Beware of external links.
Frequently Asked Questions