FrontierMath, Epoch AI's benchmark of expert-level math problems across undergraduate to research-tier challenges, sees OpenAI's GPT-5.4 Pro leading at 38% on the hardest Tier 4 as of early March 2026, far ahead of prior records, while xAI's Grok 4 lags at just 2% from its July 2025 evaluation. Trader sentiment likely reflects xAI's muted math progress amid focus on multimodal advances like Grok Imagine 1.0's video generation in February and weekly capability tweaks announced by Elon Musk in late March, with no recent FrontierMath-specific evals or announcements. Upcoming catalysts include potential Grok 5 release leveraging xAI's Colossus supercluster, though rapid competitor iterations from OpenAI and Anthropic heighten uncertainty for a June 30 breakthrough.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日$15,883 Vol.
25%以上
73%
30%以上
69%
40%以上
57%
50%以上
27%
$15,883 Vol.
25%以上
73%
30%以上
69%
40%以上
57%
50%以上
27%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
マーケット開始日: Jan 30, 2026, 12:01 AM ET
Resolver
0x65070BE91...This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Resolver
0x65070BE91...FrontierMath, Epoch AI's benchmark of expert-level math problems across undergraduate to research-tier challenges, sees OpenAI's GPT-5.4 Pro leading at 38% on the hardest Tier 4 as of early March 2026, far ahead of prior records, while xAI's Grok 4 lags at just 2% from its July 2025 evaluation. Trader sentiment likely reflects xAI's muted math progress amid focus on multimodal advances like Grok Imagine 1.0's video generation in February and weekly capability tweaks announced by Elon Musk in late March, with no recent FrontierMath-specific evals or announcements. Upcoming catalysts include potential Grok 5 release leveraging xAI's Colossus supercluster, though rapid competitor iterations from OpenAI and Anthropic heighten uncertainty for a June 30 breakthrough.
Polymarketデータを参照したAI生成の実験的な要約 · 更新日
外部リンクに注意してください。
外部リンクに注意してください。
よくある質問