OpenAI's GPT-5.4, released in early March 2026, achieved 39.8% accuracy on Humanity’s Last Exam—a 2,500-question benchmark of expert-level challenges across math, sciences, and humanities—without tools, marking an 8% gain over GPT-5.2 in two months and surpassing prior state-of-the-art models like Claude Opus 4.6 at around 34%. This leap underscores accelerating AI scaling in reasoning and knowledge synthesis, fueling trader consensus on potential 50%+ thresholds by June 30 amid competitive pressure from Anthropic, Google DeepMind, and xAI. However, benchmark saturation risks and unproven scaling limits introduce uncertainty, with no confirmed GPT-5.5 timeline; watch for developer previews or capability demos that could shift market-implied odds.
Experimental AI-generated summary referencing Polymarket data · Updated50%+
31%
$3,406 Vol.
50%+
31%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Market Opened: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...Outcome proposed: Yes
No dispute
Final outcome: Yes
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Outcome proposed: Yes
No dispute
Final outcome: Yes
OpenAI's GPT-5.4, released in early March 2026, achieved 39.8% accuracy on Humanity’s Last Exam—a 2,500-question benchmark of expert-level challenges across math, sciences, and humanities—without tools, marking an 8% gain over GPT-5.2 in two months and surpassing prior state-of-the-art models like Claude Opus 4.6 at around 34%. This leap underscores accelerating AI scaling in reasoning and knowledge synthesis, fueling trader consensus on potential 50%+ thresholds by June 30 amid competitive pressure from Anthropic, Google DeepMind, and xAI. However, benchmark saturation risks and unproven scaling limits introduce uncertainty, with no confirmed GPT-5.5 timeline; watch for developer previews or capability demos that could shift market-implied odds.
Experimental AI-generated summary referencing Polymarket data · Updated
Beware of external links.
Beware of external links.
Frequently Asked Questions