Claude Fable 5’s recent deployment with adaptive reasoning and fallback mechanisms has lifted Anthropic’s top score on the Humanity’s Last Exam leaderboard to 53.3 percent as of mid-June 2026, ahead of Gemini 3.1 Pro Preview and GPT-5.4 variants. This positions the model well for the 50 percent threshold while leaving the 55 percent outcome more contested ahead of the June 30 cutoff. Traders are watching for any final leaderboard updates, extended-thinking optimizations, or decontamination adjustments Anthropic may submit in the remaining days, alongside potential competitive pushes from Google or OpenAI that could influence relative standings on the 2,500-question expert benchmark. The tight timeline and history of incremental gains on this saturated evaluation mean small technical tweaks could still shift whether higher brackets clear before resolution.
Eksperimental na AI-generated summary na nire-reference ang Polymarket data. Hindi ito trading advice at wala itong papel sa kung paano nire-resolve ang market na ito. · Na-updateClaude score on Humanity’s Last Exam by June 30?
$367,786 Vol.
45%+
39%
50%+
17%
55%+
7%
$367,786 Vol.
45%+
39%
50%+
17%
55%+
7%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Binuksan ang Market: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Claude Fable 5’s recent deployment with adaptive reasoning and fallback mechanisms has lifted Anthropic’s top score on the Humanity’s Last Exam leaderboard to 53.3 percent as of mid-June 2026, ahead of Gemini 3.1 Pro Preview and GPT-5.4 variants. This positions the model well for the 50 percent threshold while leaving the 55 percent outcome more contested ahead of the June 30 cutoff. Traders are watching for any final leaderboard updates, extended-thinking optimizations, or decontamination adjustments Anthropic may submit in the remaining days, alongside potential competitive pushes from Google or OpenAI that could influence relative standings on the 2,500-question expert benchmark. The tight timeline and history of incremental gains on this saturated evaluation mean small technical tweaks could still shift whether higher brackets clear before resolution.
Eksperimental na AI-generated summary na nire-reference ang Polymarket data. Hindi ito trading advice at wala itong papel sa kung paano nire-resolve ang market na ito. · Na-update
Mag-ingat sa mga external link.
Mag-ingat sa mga external link.
Mga Madalas na Tanong