Trader sentiment on Anthropic's Claude achieving a strong score on the FrontierMath benchmark—a rigorous test of PhD-level math problems from recent arXiv papers—by June 30 largely pivots on the imminent Claude 4 release, teased by CEO Dario Amodei as arriving in weeks with 10x more training compute than Claude 3.5 Sonnet. Current leaderboards show Claude 3.5 Sonnet at a mere 1.6%, trailing slightly behind OpenAI's o1-preview (2.3%) and Gemini 2.0 variants, underscoring broad frontier model struggles but highlighting scaling potential. Competitive pressure intensifies with rivals' math-focused updates, while no firm Claude 4 date exists; traders eye Q1 2025 launches amid slipping AI timelines.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · ОбновленоАнтропный балл Клода по FrontierMath Benchmark к 30 июня?
Антропный балл Клода по FrontierMath Benchmark к 30 июня?
$47,034 Объем
50%+
54%
$47,034 Объем
50%+
54%
This market will resolve according to the Epoch AI’s Frontier Math benchmarking leaderboard (https://epoch.ai/frontiermath) for Tier 1-3. Studies which are not included in the leaderboard (e.g. https://x.com/EpochAIResearch/status/1945905796904005720) will not be considered.
The primary resolution source will be information from EpochAI; however, a consensus of credible reporting may also be used.
Открытие рынка: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...Предложенный исход: Да
Спор отсутствует
Окончательный исход: Да
Resolver
0x65070BE91...Trader sentiment on Anthropic's Claude achieving a strong score on the FrontierMath benchmark—a rigorous test of PhD-level math problems from recent arXiv papers—by June 30 largely pivots on the imminent Claude 4 release, teased by CEO Dario Amodei as arriving in weeks with 10x more training compute than Claude 3.5 Sonnet. Current leaderboards show Claude 3.5 Sonnet at a mere 1.6%, trailing slightly behind OpenAI's o1-preview (2.3%) and Gemini 2.0 variants, underscoring broad frontier model struggles but highlighting scaling potential. Competitive pressure intensifies with rivals' math-focused updates, while no firm Claude 4 date exists; traders eye Q1 2025 launches amid slipping AI timelines.
Экспериментальная сводка, созданная ИИ на основе данных Polymarket · Обновлено
Не доверяй внешним ссылкам.
Не доверяй внешним ссылкам.
Часто задаваемые вопросы