Anthropic's Claude Opus 4.6, released February 5, 2026, leads Humanity's Last Exam (HLE) leaderboards with 34.4% accuracy in "thinking" mode on this 2,500-question frontier benchmark probing expert-level reasoning across multimodal domains, up sharply from prior models under 10%. A March 27 leak of "Claude Mythos"—Anthropic's largest model with unprecedented reasoning capabilities—has sparked trader optimism for breakthroughs before June 30, amid the firm's rapid cadence seen with Sonnet 4.6 shortly after. OpenAI's GPT-5.4 edges ahead at 41.6% in some evals, intensifying competition, while HLE's design anticipates 50% thresholds soon; key catalysts include model launches or third-party benchmarks, though scaling hurdles and unconfirmed timelines introduce uncertainty.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · Aktualisiert$187,338 Vol.
35 %+
94%
45 %+
49%
$187,338 Vol.
35 %+
94%
45 %+
49%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Markt eröffnet: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Anthropic's Claude Opus 4.6, released February 5, 2026, leads Humanity's Last Exam (HLE) leaderboards with 34.4% accuracy in "thinking" mode on this 2,500-question frontier benchmark probing expert-level reasoning across multimodal domains, up sharply from prior models under 10%. A March 27 leak of "Claude Mythos"—Anthropic's largest model with unprecedented reasoning capabilities—has sparked trader optimism for breakthroughs before June 30, amid the firm's rapid cadence seen with Sonnet 4.6 shortly after. OpenAI's GPT-5.4 edges ahead at 41.6% in some evals, intensifying competition, while HLE's design anticipates 50% thresholds soon; key catalysts include model launches or third-party benchmarks, though scaling hurdles and unconfirmed timelines introduce uncertainty.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · Aktualisiert
Vorsicht bei externen Links.
Vorsicht bei externen Links.
Häufig gestellte Fragen