OpenAI's latest models, including GPT-5.4 Thinking at 94.15% on LiveBench's Mathematics Average leaderboard as of March 28, command a commanding 99.4% implied probability among traders, reflecting their unchallenged lead in advanced math reasoning benchmarks like competition-level algebra and proofs. This dominance stems from recent optimizations in chain-of-thought processing and specialized training data, widening the gap over Google's Gemini 3.1 Pro Preview (91.04%) and Anthropic's Claude 4.6 Opus (89.32%). With resolution hinging on the March 31 snapshot of LiveBench.ai, trader consensus anticipates no upsets; however, a surprise model release from rivals—such as an enhanced Grok or DeepSeek variant—could realistically shift standings if benchmarked higher before deadline.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · AktualisiertOpenAI 99.4%
xAI <1%
Google <1%
DeepSeek <1%
$474,205 Vol.
$474,205 Vol.

OpenAI
99%

xAI
<1%

<1%

DeepSeek
<1%

Anthropic
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

Moonshot
<1%
OpenAI 99.4%
xAI <1%
Google <1%
DeepSeek <1%
$474,205 Vol.
$474,205 Vol.

OpenAI
99%

xAI
<1%

<1%

DeepSeek
<1%

Anthropic
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

Moonshot
<1%
If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Markt eröffnet: Dec 12, 2025, 1:25 PM ET
Resolver
0x2F5e3684c...If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...OpenAI's latest models, including GPT-5.4 Thinking at 94.15% on LiveBench's Mathematics Average leaderboard as of March 28, command a commanding 99.4% implied probability among traders, reflecting their unchallenged lead in advanced math reasoning benchmarks like competition-level algebra and proofs. This dominance stems from recent optimizations in chain-of-thought processing and specialized training data, widening the gap over Google's Gemini 3.1 Pro Preview (91.04%) and Anthropic's Claude 4.6 Opus (89.32%). With resolution hinging on the March 31 snapshot of LiveBench.ai, trader consensus anticipates no upsets; however, a surprise model release from rivals—such as an enhanced Grok or DeepSeek variant—could realistically shift standings if benchmarked higher before deadline.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · Aktualisiert
Vorsicht bei externen Links.
Vorsicht bei externen Links.
Häufig gestellte Fragen