OpenAI dominates Polymarket trader consensus at 98.8% implied probability, driven by its o1 model's unrivaled performance on key math benchmarks like MATH (83.3% accuracy) and AIME 2024 (94.8%), leveraging advanced chain-of-thought reasoning and test-time compute that far outpace rivals such as Anthropic's Claude 3.5 Sonnet (50% on MATH) or Google's Gemini 2.0. No competitor releases in the past month have closed the gap, with recent leaderboards from LMSYS and Artificial Analysis affirming o1's lead amid a quiet period for AI math advancements. As the March 31 snapshot nears, historical product timelines suggest limited upset potential, though an unannounced model launch from DeepMind or xAI could realistically challenge if it demonstrates superior capabilities.
基于Polymarket数据的AI实验性摘要 · 更新于OpenAI 98.8%
谷歌 <1%
DeepSeek <1%
Anthropic <1%
$460,776 交易量
$460,776 交易量

OpenAI
99%

谷歌
1%

DeepSeek
<1%

Anthropic
<1%

xAI
<1%

Moonshot
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%
OpenAI 98.8%
谷歌 <1%
DeepSeek <1%
Anthropic <1%
$460,776 交易量
$460,776 交易量

OpenAI
99%

谷歌
1%

DeepSeek
<1%

Anthropic
<1%

xAI
<1%

Moonshot
<1%

Z.ai
<1%

Mistral
<1%

阿里巴巴
<1%
If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市场开放时间: Dec 12, 2025, 1:25 PM ET
Resolver
0x2F5e3684c...If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...OpenAI dominates Polymarket trader consensus at 98.8% implied probability, driven by its o1 model's unrivaled performance on key math benchmarks like MATH (83.3% accuracy) and AIME 2024 (94.8%), leveraging advanced chain-of-thought reasoning and test-time compute that far outpace rivals such as Anthropic's Claude 3.5 Sonnet (50% on MATH) or Google's Gemini 2.0. No competitor releases in the past month have closed the gap, with recent leaderboards from LMSYS and Artificial Analysis affirming o1's lead amid a quiet period for AI math advancements. As the March 31 snapshot nears, historical product timelines suggest limited upset potential, though an unannounced model launch from DeepMind or xAI could realistically challenge if it demonstrates superior capabilities.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题