Anthropic’s May 28, 2026 release of Claude Opus 4.8, which posted a leading 45.7% score on Humanity’s Last Exam under adaptive reasoning and max-effort settings, is the main factor shaping trader sentiment. The 2,500-question benchmark, developed by the Center for AI Safety and Scale AI, draws from expert contributors across mathematics, physics, biology, and other fields to test frontier large language models well beyond saturated evaluations. Claude’s rapid Opus iteration cycle has lifted results from single digits in early 2025 into the mid-40s, edging out recent Gemini 3.1 Pro Preview and GPT-5 variants. With the June 30 cutoff approaching, traders are monitoring for additional reasoning enhancements or new leaderboard entries, though the benchmark’s design keeps even top models far below human-expert levels near 90%.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated$293,935 Vol.
45%+
44%
50%+
17%
55%+
7%
$293,935 Vol.
45%+
44%
50%+
17%
55%+
7%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Market Opened: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...Outcome proposed: Yes
No dispute
Final outcome: Yes
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...Outcome proposed: Yes
No dispute
Final outcome: Yes
Anthropic’s May 28, 2026 release of Claude Opus 4.8, which posted a leading 45.7% score on Humanity’s Last Exam under adaptive reasoning and max-effort settings, is the main factor shaping trader sentiment. The 2,500-question benchmark, developed by the Center for AI Safety and Scale AI, draws from expert contributors across mathematics, physics, biology, and other fields to test frontier large language models well beyond saturated evaluations. Claude’s rapid Opus iteration cycle has lifted results from single digits in early 2025 into the mid-40s, edging out recent Gemini 3.1 Pro Preview and GPT-5 variants. With the June 30 cutoff approaching, traders are monitoring for additional reasoning enhancements or new leaderboard entries, though the benchmark’s design keeps even top models far below human-expert levels near 90%.
Experimental AI-generated summary referencing Polymarket data. This is not trading advice and plays no role in how this market resolves. · Updated

Beware of external links.
Beware of external links.
Frequently Asked Questions