OpenAI's GPT-5 series commands a commanding 98.9% implied probability on Polymarket for the best AI model in math reasoning as of March 31, driven by its dominant performance on key benchmarks like MATH-500, where it scored 99.4% as recently as March 27—far surpassing rivals such as Google's Gemini 3 or Anthropic's Claude variants. This trader consensus reflects OpenAI's sustained lead in chain-of-thought reasoning and complex problem-solving, bolstered by no credible challenger releases in the past week amid typical multi-month training cycles for frontier large language models. While technical glitches in evaluations or a surprise preview from competitors like DeepSeek could theoretically shift standings, such disruptions remain improbable given the short timeline to resolution.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · ОбновленоВ какой компании 31 марта будет лучшая модель искусственного интеллекта для математики?
В какой компании 31 марта будет лучшая модель искусственного интеллекта для математики?
OpenAI 98.8%
Google <1%
xAI <1%
DeepSeek <1%
$475,969 Объем
$475,969 Объем

OpenAI
99%

<1%

xAI
<1%

DeepSeek
<1%

Anthropic
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

Moonshot
<1%
OpenAI 98.8%
Google <1%
xAI <1%
DeepSeek <1%
$475,969 Объем
$475,969 Объем

OpenAI
99%

<1%

xAI
<1%

DeepSeek
<1%

Anthropic
<1%

Z.ai
<1%

Mistral
<1%

Alibaba
<1%

Moonshot
<1%
If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Открытие рынка: Dec 12, 2025, 1:25 PM ET
Resolver
0x2F5e3684c...If two models are tied for the highest LiveBench Mathematics Average score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order.
The primary source of resolution for this market will be LiveBench’s AI leaderboard, specifically the “Mathematics Average” category, found at livebench.ai. If this resolution source is unavailable at check time, this market will remain open until the LiveBench AI leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...OpenAI's GPT-5 series commands a commanding 98.9% implied probability on Polymarket for the best AI model in math reasoning as of March 31, driven by its dominant performance on key benchmarks like MATH-500, where it scored 99.4% as recently as March 27—far surpassing rivals such as Google's Gemini 3 or Anthropic's Claude variants. This trader consensus reflects OpenAI's sustained lead in chain-of-thought reasoning and complex problem-solving, bolstered by no credible challenger releases in the past week amid typical multi-month training cycles for frontier large language models. While technical glitches in evaluations or a surprise preview from competitors like DeepSeek could theoretically shift standings, such disruptions remain improbable given the short timeline to resolution.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · Обновлено
Не доверяй внешним ссылкам.
Не доверяй внешним ссылкам.
Часто задаваемые вопросы