OpenAI dominates trader sentiment with an 89% implied probability for the best AI model for coding by March 31, fueled by its o1-preview and o1-mini models' top scores on benchmarks like SWE-Bench (surpassing 30% resolution) and LiveCodeBench, leveraging advanced chain-of-thought reasoning for complex programming tasks. Recent LMSYS Chatbot Arena coding rankings reinforce this edge over Anthropic's Claude 3.5 Sonnet (6.4% odds), which excels in frontend coding but trails in agentic software engineering evals. Google's Gemini (1.3%) shows gains via recent fine-tunes yet lags in multi-step debugging, while open-source contenders like DeepSeek (0.4%) face scalability hurdles. Odds reflect shipped capabilities amid anticipation for final March evaluations.
Resumo experimental gerado por IA com dados do Polymarket · AtualizadoQual empresa terá o melhor modelo de IA para codificação em 31 de março?
Qual empresa terá o melhor modelo de IA para codificação em 31 de março?
OpenAI 90%
Anthropic 6.4%
Google 1.4%
DeepSeek <1%
$979,485 Vol.
$979,485 Vol.

OpenAI
90%

Anthropic
6%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

xAI
<1%

Moonshot
<1%
OpenAI 90%
Anthropic 6.4%
Google 1.4%
DeepSeek <1%
$979,485 Vol.
$979,485 Vol.

OpenAI
90%

Anthropic
6%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

xAI
<1%

Moonshot
<1%
If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado Aberto: Dec 12, 2025, 1:29 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...OpenAI dominates trader sentiment with an 89% implied probability for the best AI model for coding by March 31, fueled by its o1-preview and o1-mini models' top scores on benchmarks like SWE-Bench (surpassing 30% resolution) and LiveCodeBench, leveraging advanced chain-of-thought reasoning for complex programming tasks. Recent LMSYS Chatbot Arena coding rankings reinforce this edge over Anthropic's Claude 3.5 Sonnet (6.4% odds), which excels in frontend coding but trails in agentic software engineering evals. Google's Gemini (1.3%) shows gains via recent fine-tunes yet lags in multi-step debugging, while open-source contenders like DeepSeek (0.4%) face scalability hurdles. Odds reflect shipped capabilities amid anticipation for final March evaluations.
Resumo experimental gerado por IA com dados do Polymarket · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions