Anthropic's April 2026 launch of Claude Opus 4.7 and the high-performing Mythos Preview has driven Claude to frontier-leading scores of 35-65% on Humanity's Last Exam—a rigorous benchmark of 2,500 expert-vetted questions probing advanced AI reasoning in math, science, and humanities. These results, often with tools and extended chain-of-thought prompting, surpass earlier models and edge competitors like OpenAI's GPT-5 series (41-59%) and Google's Gemini 3.1 Pro (up to 47%), reflecting Anthropic's focus on scalable oversight and safety-aligned scaling. Trader consensus weighs the likelihood of a public Claude variant hitting market-specific thresholds by June 30 against evaluation discrepancies and rapid competitive releases; key catalysts include potential Claude 5 previews or benchmark updates at upcoming AI conferences.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado$283,093 Vol.
45%+
18%
50%+
9%
55%+
4%
$283,093 Vol.
45%+
18%
50%+
9%
55%+
4%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercado abierto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic's April 2026 launch of Claude Opus 4.7 and the high-performing Mythos Preview has driven Claude to frontier-leading scores of 35-65% on Humanity's Last Exam—a rigorous benchmark of 2,500 expert-vetted questions probing advanced AI reasoning in math, science, and humanities. These results, often with tools and extended chain-of-thought prompting, surpass earlier models and edge competitors like OpenAI's GPT-5 series (41-59%) and Google's Gemini 3.1 Pro (up to 47%), reflecting Anthropic's focus on scalable oversight and safety-aligned scaling. Trader consensus weighs the likelihood of a public Claude variant hitting market-specific thresholds by June 30 against evaluation discrepancies and rapid competitive releases; key catalysts include potential Claude 5 previews or benchmark updates at upcoming AI conferences.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes