OpenAI's o1 reasoning model dominates trader consensus at 93.5% implied probability for the best AI math performance by March 31, driven by its state-of-the-art scores on key benchmarks like MATH (83% accuracy) and AIME, outpacing rivals through chain-of-thought reasoning that excels in complex problem-solving. Recent LMSYS Arena leaderboards and independent evals confirm o1's lead, with no shipped competitors closing the gap amid OpenAI's rapid iteration post-launch. Trader skepticism toward challengers stems from delayed timelines: xAI's Grok-3 training is underway but unproven, DeepSeek's open-source math models trail in closed evals, and Google's Gemini or Anthropic's Claude updates lack firm pre-deadline commitments. Upsets could arise from surprise releases or benchmark breakthroughs, but current dynamics favor OpenAI's entrenched edge.
Resumo experimental gerado por IA com dados do Polymarket · AtualizadoQual empresa terá o melhor modelo de IA para matemática em 31 de março?
Qual empresa terá o melhor modelo de IA para matemática em 31 de março?
OpenAI 94%
xAI 1.9%
DeepSeek 1.3%
Google 1.0%
$145,484 Vol.
$145,484 Vol.

OpenAI
94%

xAI
2%

DeepSeek
1%

1%

Anthropic
1%

Moonshot
1%

Z.ai
<1%

Alibaba
<1%

Mistral
<1%
OpenAI 94%
xAI 1.9%
DeepSeek 1.3%
Google 1.0%
$145,484 Vol.
$145,484 Vol.

OpenAI
94%

xAI
2%

DeepSeek
1%

1%

Anthropic
1%

Moonshot
1%

Z.ai
<1%

Alibaba
<1%

Mistral
<1%
If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado Aberto: Dec 12, 2025, 1:25 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...OpenAI's o1 reasoning model dominates trader consensus at 93.5% implied probability for the best AI math performance by March 31, driven by its state-of-the-art scores on key benchmarks like MATH (83% accuracy) and AIME, outpacing rivals through chain-of-thought reasoning that excels in complex problem-solving. Recent LMSYS Arena leaderboards and independent evals confirm o1's lead, with no shipped competitors closing the gap amid OpenAI's rapid iteration post-launch. Trader skepticism toward challengers stems from delayed timelines: xAI's Grok-3 training is underway but unproven, DeepSeek's open-source math models trail in closed evals, and Google's Gemini or Anthropic's Claude updates lack firm pre-deadline commitments. Upsets could arise from surprise releases or benchmark breakthroughs, but current dynamics favor OpenAI's entrenched edge.
Resumo experimental gerado por IA com dados do Polymarket · Atualizado
Cuidado com os links externos.
Cuidado com os links externos.
Frequently Asked Questions