xAI's Grok models lag behind frontier competitors on the FrontierMath benchmark, with Grok-4 scoring just 12-14% overall and 2% on Tier 4 in Epoch AI's July 2025 evaluation, well below OpenAI's GPT-5.4 at 47.6% and recent Tier 4 records like GPT-5.4 Pro's 38%. No public Grok scores have emerged since, despite xAI's rapid iterations—Grok 4.3 Beta in April 2026 topped AIME math at 100% and other reasoning benchmarks, signaling strong mathematical capabilities elsewhere via Colossus supercluster scaling to hundreds of thousands of GPUs. Traders eye potential pre-June 30 model releases or Epoch evals amid intensifying AI compute races, though OpenAI's verifier access to FrontierMath extensions bolsters its lead.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated$19,331 Vol.
25%+
59%
30%+
55%
40%+
48%
50%+
23%
$19,331 Vol.
25%+
59%
30%+
55%
40%+
48%
50%+
23%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Market Opened: Jan 30, 2026, 12:01 AM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...xAI's Grok models lag behind frontier competitors on the FrontierMath benchmark, with Grok-4 scoring just 12-14% overall and 2% on Tier 4 in Epoch AI's July 2025 evaluation, well below OpenAI's GPT-5.4 at 47.6% and recent Tier 4 records like GPT-5.4 Pro's 38%. No public Grok scores have emerged since, despite xAI's rapid iterations—Grok 4.3 Beta in April 2026 topped AIME math at 100% and other reasoning benchmarks, signaling strong mathematical capabilities elsewhere via Colossus supercluster scaling to hundreds of thousands of GPUs. Traders eye potential pre-June 30 model releases or Epoch evals amid intensifying AI compute races, though OpenAI's verifier access to FrontierMath extensions bolsters its lead.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated
Beware of external links.
Beware of external links.
Frequently Asked Questions