Google's Gemini 3.1 Pro currently scores 36.9% on the FrontierMath Tiers 1-4 benchmark—a suite of hundreds of unpublished, research-level math problems vetted by expert mathematicians—trailing OpenAI models exceeding 50%, according to Epoch AI's latest evaluations. The February 2026 release of the Gemini 3 series, including 3 Pro Preview at 37.6% and specialized variants like Deep Think, drove these gains through advanced chain-of-thought reasoning and tool integration, narrowing the gap from prior single-digit performances. Competitive pressure from GPT-5.2 Pro's 31% on Tier 4 underscores the arms race in AI mathematical capabilities. Traders eye Google I/O on May 19-20 for potential Gemini 4 announcements or benchmark updates that could push scores higher by the June 30 deadline, though timelines often slip amid scaling challenges.
Résumé expérimental généré par IA à partir des données Polymarket. Ceci n'est pas un conseil de trading et ne joue aucun rôle dans la résolution de ce marché. · Mis à jour$127,692 Vol.
40 %+
92%
45 %+
41%
50%+
36%
60 %+
18%
$127,692 Vol.
40 %+
92%
45 %+
41%
50%+
36%
60 %+
18%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Marché ouvert : Feb 6, 2026, 6:03 PM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...Google's Gemini 3.1 Pro currently scores 36.9% on the FrontierMath Tiers 1-4 benchmark—a suite of hundreds of unpublished, research-level math problems vetted by expert mathematicians—trailing OpenAI models exceeding 50%, according to Epoch AI's latest evaluations. The February 2026 release of the Gemini 3 series, including 3 Pro Preview at 37.6% and specialized variants like Deep Think, drove these gains through advanced chain-of-thought reasoning and tool integration, narrowing the gap from prior single-digit performances. Competitive pressure from GPT-5.2 Pro's 31% on Tier 4 underscores the arms race in AI mathematical capabilities. Traders eye Google I/O on May 19-20 for potential Gemini 4 announcements or benchmark updates that could push scores higher by the June 30 deadline, though timelines often slip amid scaling challenges.
Résumé expérimental généré par IA à partir des données Polymarket. Ceci n'est pas un conseil de trading et ne joue aucun rôle dans la résolution de ce marché. · Mis à jour
Méfiez-vous des liens externes.
Méfiez-vous des liens externes.
Questions fréquentes