Trader consensus on Polymarket overwhelmingly favors Anthropic at a 99.1% implied probability for possessing the top AI model by March 31, 2026, driven by Claude Opus 4.6's commanding lead on key benchmarks like the LMSYS Chatbot Arena leaderboard (under style control) and SWE-Bench coding evaluations, where it scores 75.6% and higher than rivals. This positioning stems from Anthropic's aggressive early-2026 release cadence, including Claude Sonnet 4.6, a 1M-token context window beta, voice mode, persistent memory, and agentic "computer use" features that solidify its edge in real-world capabilities amid fierce competition from OpenAI's GPT series and Google's Gemini. With mere days to resolution, only an unforeseen last-minute model drop from challengers like Google—potentially leveraging superior compute—or benchmark methodology shifts could realistically disrupt this near-certain outcome, though historical precedents suggest such upsets are rare this close to deadlines.
Experimental AI-generated summary referencing Polymarket data · UpdatedAnthropic 99.0%
Google <1%
xAI <1%
OpenAI <1%
$15,266,903 Vol.
$15,266,903 Vol.

Anthropic
99%

<1%

xAI
<1%

OpenAI
<1%

DeepSeek
<1%

Baidu
<1%

Moonshot
<1%

Meituan
<1%

Alibaba
<1%

Z.ai
<1%

Mistral
<1%
Anthropic 99.0%
Google <1%
xAI <1%
OpenAI <1%
$15,266,903 Vol.
$15,266,903 Vol.

Anthropic
99%

<1%

xAI
<1%

OpenAI
<1%

DeepSeek
<1%

Baidu
<1%

Moonshot
<1%

Meituan
<1%

Alibaba
<1%

Z.ai
<1%

Mistral
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the highest arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g., if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Market Opened: Dec 2, 2025, 6:02 PM ET
Resolver
0x2F5e3684c...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the highest arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g., if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...Trader consensus on Polymarket overwhelmingly favors Anthropic at a 99.1% implied probability for possessing the top AI model by March 31, 2026, driven by Claude Opus 4.6's commanding lead on key benchmarks like the LMSYS Chatbot Arena leaderboard (under style control) and SWE-Bench coding evaluations, where it scores 75.6% and higher than rivals. This positioning stems from Anthropic's aggressive early-2026 release cadence, including Claude Sonnet 4.6, a 1M-token context window beta, voice mode, persistent memory, and agentic "computer use" features that solidify its edge in real-world capabilities amid fierce competition from OpenAI's GPT series and Google's Gemini. With mere days to resolution, only an unforeseen last-minute model drop from challengers like Google—potentially leveraging superior compute—or benchmark methodology shifts could realistically disrupt this near-certain outcome, though historical precedents suggest such upsets are rare this close to deadlines.
Experimental AI-generated summary referencing Polymarket data · Updated
Beware of external links.
Beware of external links.
Frequently Asked Questions