OpenAI's o1 reasoning models dominate coding benchmarks such as LiveCodeBench and the LMSYS Chatbot Arena coding leaderboard, where they achieve top Elo scores through superior multi-step reasoning critical for complex programming tasks, driving a 95.5% implied probability that the company holds the best AI model for coding by March 31. This trader consensus reflects real-capital bets on sustained leadership amid rivals like Anthropic's Claude 3.5 Sonnet (1.9%) trailing in recent evaluations and limited announcements of competitive releases from Google DeepMind, xAI, or DeepSeek. While product timelines can slip, potential challenges include surprise Gemini 2.0 or Grok-2 launches with breakthrough coding demos, or revised benchmarks highlighting overlooked capabilities in open-source alternatives before the deadline.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日OpenAI 95.6%
Anthropic 2.5%
DeepSeek <1%
Google <1%
$1,016,941 Vol.
$1,016,941 Vol.

OpenAI
96%

Anthropic
2%

DeepSeek
<1%

<1%

xAI
<1%

Z.ai
<1%

Mistral
<1%

アリババ
<1%

ムーンショット
<1%
OpenAI 95.6%
Anthropic 2.5%
DeepSeek <1%
Google <1%
$1,016,941 Vol.
$1,016,941 Vol.

OpenAI
96%

Anthropic
2%

DeepSeek
<1%

<1%

xAI
<1%

Z.ai
<1%

Mistral
<1%

アリババ
<1%

ムーンショット
<1%
If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
マーケット開始日: Dec 12, 2025, 1:29 PM ET
Resolver
0x2F5e3684c...If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...OpenAI's o1 reasoning models dominate coding benchmarks such as LiveCodeBench and the LMSYS Chatbot Arena coding leaderboard, where they achieve top Elo scores through superior multi-step reasoning critical for complex programming tasks, driving a 95.5% implied probability that the company holds the best AI model for coding by March 31. This trader consensus reflects real-capital bets on sustained leadership amid rivals like Anthropic's Claude 3.5 Sonnet (1.9%) trailing in recent evaluations and limited announcements of competitive releases from Google DeepMind, xAI, or DeepSeek. While product timelines can slip, potential challenges include surprise Gemini 2.0 or Grok-2 launches with breakthrough coding demos, or revised benchmarks highlighting overlooked capabilities in open-source alternatives before the deadline.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日
外部リンクに注意してください。
外部リンクに注意してください。
よくある質問