Trader consensus on Polymarket assigns an overwhelming 90% implied probability to OpenAI claiming the best AI model for coding by March 31, fueled by anticipation of its next-generation reasoning model—likely a full o1 successor or GPT-5 iteration—building on o1-preview's strong performance in complex coding tasks via extended chain-of-thought processing. Anthropic's 7% share reflects Claude 3.5 Sonnet's current dominance on benchmarks like SWE-Bench (33.4% verified) and HumanEval (92%), but traders doubt it will hold against OpenAI's rapid iteration pace. Google and others trail due to lagging coding-specific advancements amid competitive delays; key watchpoints include OpenAI's Q1 2025 releases and LMSYS leaderboard shifts.
Resumen experimental generado por IA con datos de Polymarket · ActualizadoOpenAI 90%
Anthropic 6.4%
Google 1.0%
DeepSeek <1%
$976,140 Vol.
$976,140 Vol.

OpenAI
90%

Anthropic
6%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

xAI
<1%

Moonshot
<1%
OpenAI 90%
Anthropic 6.4%
Google 1.0%
DeepSeek <1%
$976,140 Vol.
$976,140 Vol.

OpenAI
90%

Anthropic
6%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

xAI
<1%

Moonshot
<1%
If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado abierto: Dec 12, 2025, 1:29 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...Trader consensus on Polymarket assigns an overwhelming 90% implied probability to OpenAI claiming the best AI model for coding by March 31, fueled by anticipation of its next-generation reasoning model—likely a full o1 successor or GPT-5 iteration—building on o1-preview's strong performance in complex coding tasks via extended chain-of-thought processing. Anthropic's 7% share reflects Claude 3.5 Sonnet's current dominance on benchmarks like SWE-Bench (33.4% verified) and HumanEval (92%), but traders doubt it will hold against OpenAI's rapid iteration pace. Google and others trail due to lagging coding-specific advancements amid competitive delays; key watchpoints include OpenAI's Q1 2025 releases and LMSYS leaderboard shifts.
Resumen experimental generado por IA con datos de Polymarket · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes