OpenAI commands a 90% implied probability as the frontrunner for the best AI coding model by March 31, driven by its o1-preview and o1-mini models topping key benchmarks like LiveCodeBench and SWE-Bench Verified, where they excel in complex reasoning and code generation. Traders anticipate OpenAI's iterative improvements will sustain this edge ahead of rivals' updates. Anthropic's 8.2% stake reflects Claude 3.5 Sonnet's recent June release, which surpassed GPT-4o in coding evals including HumanEval and front-end tasks, narrowing the gap but not overtaking o1's overall lead. Google's 1.2% trails due to Gemini 1.5 Pro's middling performance, while longshots like DeepSeek and Mistral linger on open-source strengths without broad API access or ecosystem momentum. Upcoming developer previews and benchmark refreshes could shift odds.
基於Polymarket數據的AI實驗性摘要 · 更新於OpenAI 91%
Anthropic 8.0%
Google 1.1%
DeepSeek <1%
$981,739 交易量
$981,739 交易量

OpenAI
91%

Anthropic
8%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%

xAI
<1%

Moonshot
<1%
OpenAI 91%
Anthropic 8.0%
Google 1.1%
DeepSeek <1%
$981,739 交易量
$981,739 交易量

OpenAI
91%

Anthropic
8%

1%

DeepSeek
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%

xAI
<1%

Moonshot
<1%
If two models are tied for the top LiveBench coding average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “coding average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市場開放時間: Dec 12, 2025, 1:29 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...OpenAI commands a 90% implied probability as the frontrunner for the best AI coding model by March 31, driven by its o1-preview and o1-mini models topping key benchmarks like LiveCodeBench and SWE-Bench Verified, where they excel in complex reasoning and code generation. Traders anticipate OpenAI's iterative improvements will sustain this edge ahead of rivals' updates. Anthropic's 8.2% stake reflects Claude 3.5 Sonnet's recent June release, which surpassed GPT-4o in coding evals including HumanEval and front-end tasks, narrowing the gap but not overtaking o1's overall lead. Google's 1.2% trails due to Gemini 1.5 Pro's middling performance, while longshots like DeepSeek and Mistral linger on open-source strengths without broad API access or ecosystem momentum. Upcoming developer previews and benchmark refreshes could shift odds.
基於Polymarket數據的AI實驗性摘要 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions