Trader sentiment on xAI's Grok achieving a notable score on the FrontierMath benchmark—testing frontier models on 100 novel, PhD-level math problems—leans bearish due to the absence of any published Grok results despite low industry baselines, with OpenAI's o1-preview at just 10.4% and Anthropic's Claude 3.5 Sonnet at 1.9%. xAI's rapid scaling via the Colossus supercluster in Memphis positions Grok-3 for December training completion, potentially enabling competitive math prowess by June 30, but tight timelines and unverified leaks fuel doubt. Watch for Grok-2 evals or Elon Musk announcements at upcoming xAI updates, as benchmark submission lags model releases amid intensifying AI math race.
Experimental AI-generated summary referencing Polymarket data · Updated25%+
77%
30%+
74%
40%+
59%
50%+
27%
$0.00 Vol.
25%+
77%
30%+
74%
40%+
59%
50%+
27%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Market Opened: Jan 30, 2026, 12:01 AM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Trader sentiment on xAI's Grok achieving a notable score on the FrontierMath benchmark—testing frontier models on 100 novel, PhD-level math problems—leans bearish due to the absence of any published Grok results despite low industry baselines, with OpenAI's o1-preview at just 10.4% and Anthropic's Claude 3.5 Sonnet at 1.9%. xAI's rapid scaling via the Colossus supercluster in Memphis positions Grok-3 for December training completion, potentially enabling competitive math prowess by June 30, but tight timelines and unverified leaks fuel doubt. Watch for Grok-2 evals or Elon Musk announcements at upcoming xAI updates, as benchmark submission lags model releases amid intensifying AI math race.
Experimental AI-generated summary referencing Polymarket data · Updated
Beware of external links.
Beware of external links.
Frequently Asked Questions