OpenAI's o1 reasoning models command a 94.5% implied probability of having the top AI math model by March 31, driven by their current dominance on benchmarks like MATH (94.8% accuracy) and AIME 2024 (83.3%), where chain-of-thought techniques enable superior problem-solving over rivals like Claude 3.5 Sonnet or Gemini 1.5 Pro. Trader consensus reflects OpenAI's rapid iteration pace, massive compute resources, and track record of benchmark leadership since the September 2024 launch. Challenges could arise from xAI's Grok-3 release (expected late 2024 with 100k H100s training), DeepSeek's open-source math specialists iterating quickly, or surprise leaps from Google DeepMind ahead of key evals, though historical slippage tempers these odds.
Resumo experimental gerado por IA com dados do Polymarket · AtualizadoQual empresa terá o melhor modelo de IA para matemática em 31 de março?
Qual empresa terá o melhor modelo de IA para matemática em 31 de março?
OpenAI 95%
xAI 1.9%
DeepSeek 1.4%
Anthropic 1.1%
$173,267 Vol.
$173,267 Vol.

OpenAI
95%

xAI
2%

DeepSeek
1%

Anthropic
1%

1%

Moonshot
1%

Z.ai
<1%

Alibaba
<1%

Mistral
<1%
OpenAI 95%
xAI 1.9%
DeepSeek 1.4%
Anthropic 1.1%
$173,267 Vol.
$173,267 Vol.

OpenAI
95%

xAI
2%

DeepSeek
1%

Anthropic
1%

1%

Moonshot
1%

Z.ai
<1%

Alibaba
<1%

Mistral
<1%
If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado Aberto: Dec 12, 2025, 1:25 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...OpenAI's o1 reasoning models command a 94.5% implied probability of having the top AI math model by March 31, driven by their current dominance on benchmarks like MATH (94.8% accuracy) and AIME 2024 (83.3%), where chain-of-thought techniques enable superior problem-solving over rivals like Claude 3.5 Sonnet or Gemini 1.5 Pro. Trader consensus reflects OpenAI's rapid iteration pace, massive compute resources, and track record of benchmark leadership since the September 2024 launch. Challenges could arise from xAI's Grok-3 release (expected late 2024 with 100k H100s training), DeepSeek's open-source math specialists iterating quickly, or surprise leaps from Google DeepMind ahead of key evals, though historical slippage tempers these odds.
Resumo experimental gerado por IA com dados do Polymarket · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions