xAI traders are closely watching Grok's potential performance on the FrontierMath benchmark—a set of 35 ultra-challenging math problems released in June 2024 that expose limits in frontier artificial intelligence models, where even OpenAI's o1-preview scores just 2.2% and Anthropic's Claude 3.5 Sonnet hits 1.5%. No public Grok scores exist yet, as prior models like Grok-1.5 trail leaders on similar reasoning evals, but xAI's aggressive scaling with a 100,000-GPU Colossus cluster positions it for gains. Elon Musk recently confirmed Grok-2 training complete, with a preview rollout imminent in early August, potentially showcasing improved math capabilities amid intensifying AI lab competition. Upcoming catalysts include benchmark disclosures post-release and xAI's API expansions, with the June 30 deadline looming as a test of near-term progress.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · Обновленооценка xAI Grok по FrontierMath Benchmark к 30 июня?
оценка xAI Grok по FrontierMath Benchmark к 30 июня?
25%+
78%
30%+
73%
40%+
60%
50%+
26%
$3,171 Объем
25%+
78%
30%+
73%
40%+
60%
50%+
26%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Открытие рынка: Jan 30, 2026, 12:01 AM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...xAI traders are closely watching Grok's potential performance on the FrontierMath benchmark—a set of 35 ultra-challenging math problems released in June 2024 that expose limits in frontier artificial intelligence models, where even OpenAI's o1-preview scores just 2.2% and Anthropic's Claude 3.5 Sonnet hits 1.5%. No public Grok scores exist yet, as prior models like Grok-1.5 trail leaders on similar reasoning evals, but xAI's aggressive scaling with a 100,000-GPU Colossus cluster positions it for gains. Elon Musk recently confirmed Grok-2 training complete, with a preview rollout imminent in early August, potentially showcasing improved math capabilities amid intensifying AI lab competition. Upcoming catalysts include benchmark disclosures post-release and xAI's API expansions, with the June 30 deadline looming as a test of near-term progress.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · Обновлено
Не доверяй внешним ссылкам.
Не доверяй внешним ссылкам.
Часто задаваемые вопросы