OpenAI commands a 94.5% implied probability of having the top AI model for coding by March 31, driven by its o1-preview model's breakthrough chain-of-thought reasoning, which excels on coding benchmarks like HumanEval (90%+ accuracy) and LiveCodeBench, outpacing rivals in complex problem-solving. Traders cite OpenAI's unmatched compute scale, rapid iteration history—from GPT-4o to o1—and anticipated full o1 rollout or successor as locking in dominance. Supporting this, recent LMSYS Arena coding leaderboard positions reinforce trader consensus amid quiet from competitors. Challenges include Anthropic's Claude 3.5 Sonnet surging via quick updates (4.5% odds) or surprise leaps from xAI's Grok or DeepSeek's open-source coders, but execution risks and timelines favor OpenAI's edge.
基于Polymarket数据的AI实验性摘要 · 更新于OpenAI 95%
Anthropic 5%
DeepSeek <1%
谷歌 <1%
$813,351 交易量
$813,351 交易量

OpenAI
95%

Anthropic
5%

DeepSeek
1%

谷歌
1%

xAI
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%

Moonshot
<1%
OpenAI 95%
Anthropic 5%
DeepSeek <1%
谷歌 <1%
$813,351 交易量
$813,351 交易量

OpenAI
95%

Anthropic
5%

DeepSeek
1%

谷歌
1%

xAI
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%

Moonshot
<1%
If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市场开放时间: Dec 12, 2025, 1:29 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...OpenAI commands a 94.5% implied probability of having the top AI model for coding by March 31, driven by its o1-preview model's breakthrough chain-of-thought reasoning, which excels on coding benchmarks like HumanEval (90%+ accuracy) and LiveCodeBench, outpacing rivals in complex problem-solving. Traders cite OpenAI's unmatched compute scale, rapid iteration history—from GPT-4o to o1—and anticipated full o1 rollout or successor as locking in dominance. Supporting this, recent LMSYS Arena coding leaderboard positions reinforce trader consensus amid quiet from competitors. Challenges include Anthropic's Claude 3.5 Sonnet surging via quick updates (4.5% odds) or surprise leaps from xAI's Grok or DeepSeek's open-source coders, but execution risks and timelines favor OpenAI's edge.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题