Anthropic's Claude Opus 4.6, released February 5, 2026, has propelled trader sentiment by securing a leading 34.4% score on Humanity's Last Exam (HLE)—a rigorous 2,500-question benchmark spanning expert-level topics in math, sciences, and humanities—using extended "thinking-max" configurations on Scale AI's leaderboard, though trailing OpenAI's GPT-5.4 at 44%. This marks a leap from prior Claude models under 14%, underscoring advances in agentic reasoning and tool use amid benchmark saturation challenges. Competitive pressure from Google's Gemini 3 and xAI intensifies, with Claude 5 anticipated in Q2 2026 potentially elevating scores toward 45% thresholds. Key watchpoints include Anthropic's next capability demos or API updates before June 30 resolution.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · Aktualisiert$187,563 Vol.
35 %+
94%
45 %+
40%
$187,563 Vol.
35 %+
94%
45 %+
40%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Markt eröffnet: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...Vorgeschlagenes Ergebnis: Ja
Kein Einspruch
Endgültiges Ergebnis: Ja
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Vorgeschlagenes Ergebnis: Ja
Kein Einspruch
Endgültiges Ergebnis: Ja
Anthropic's Claude Opus 4.6, released February 5, 2026, has propelled trader sentiment by securing a leading 34.4% score on Humanity's Last Exam (HLE)—a rigorous 2,500-question benchmark spanning expert-level topics in math, sciences, and humanities—using extended "thinking-max" configurations on Scale AI's leaderboard, though trailing OpenAI's GPT-5.4 at 44%. This marks a leap from prior Claude models under 14%, underscoring advances in agentic reasoning and tool use amid benchmark saturation challenges. Competitive pressure from Google's Gemini 3 and xAI intensifies, with Claude 5 anticipated in Q2 2026 potentially elevating scores toward 45% thresholds. Key watchpoints include Anthropic's next capability demos or API updates before June 30 resolution.
Experimentelle KI-generierte Zusammenfassung mit Polymarket-Daten · Aktualisiert
Vorsicht bei externen Links.
Vorsicht bei externen Links.
Häufig gestellte Fragen