Gemini models currently lag on the FrontierMath benchmark, with Gemini 3.1 Pro scoring around 38% on Tiers 1–3 and 19% on the ultra-challenging Tier 4 as of February 2026 evaluations by Epoch AI, trailing OpenAI's GPT-5.4 at 47.6% overall. This reflects Google's focus on efficiency upgrades like Gemini 3.1 Flash-Lite in March 2026, prioritizing speed and cost over raw mathematical reasoning gains amid competitive pressure from reasoning-specialized models. Trader consensus hinges on potential Gemini 4 announcements at Google I/O in May, which could introduce advanced chain-of-thought enhancements to close the gap before the June 30 deadline; however, historical delays in frontier math breakthroughs temper expectations for rapid leaps.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · ОбновленоОценка Google Gemini в FrontierMath Benchmark к 30 июня?
Оценка Google Gemini в FrontierMath Benchmark к 30 июня?
$48,923 Объем
40%+
94%
45%+
65%
50%+
42%
60%+
11%
$48,923 Объем
40%+
94%
45%+
65%
50%+
42%
60%+
11%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Открытие рынка: Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Gemini models currently lag on the FrontierMath benchmark, with Gemini 3.1 Pro scoring around 38% on Tiers 1–3 and 19% on the ultra-challenging Tier 4 as of February 2026 evaluations by Epoch AI, trailing OpenAI's GPT-5.4 at 47.6% overall. This reflects Google's focus on efficiency upgrades like Gemini 3.1 Flash-Lite in March 2026, prioritizing speed and cost over raw mathematical reasoning gains amid competitive pressure from reasoning-specialized models. Trader consensus hinges on potential Gemini 4 announcements at Google I/O in May, which could introduce advanced chain-of-thought enhancements to close the gap before the June 30 deadline; however, historical delays in frontier math breakthroughs temper expectations for rapid leaps.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · Обновлено
Не доверяй внешним ссылкам.
Не доверяй внешним ссылкам.
Часто задаваемые вопросы