OpenAI's GPT-5.4 currently sits near the top of Humanity's Last Exam leaderboards with scores ranging from 41.6% to 58.7% across independent evaluations, trailing select Gemini 3.1 and Claude variants that have reached 44-64% on the 2,500-question expert benchmark. The short window to June 30 limits scope for major new model releases or fine-tuning cycles, though incremental updates or inference optimizations could shift relative positioning before resolution. Traders monitor official OpenAI announcements and third-party benchmark runs for any capability gains in reasoning-heavy categories, where frontier large language models still show substantial headroom against the test's design to resist saturation.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · AggiornatoPunteggio OpenAI GPT all'ultimo esame dell'umanità entro il 30 giugno?
$25,191 Vol.
50%+
8%
$25,191 Vol.
50%+
8%
The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Mercato aperto: Jan 30, 2026, 12:00 AM ET
Resolver
0x65070BE91...The resolution source will be the official Humanity’s Last Exam leaderboard https://scale.com/leaderboard/humanitys_last_exam.
Resolver
0x65070BE91...OpenAI's GPT-5.4 currently sits near the top of Humanity's Last Exam leaderboards with scores ranging from 41.6% to 58.7% across independent evaluations, trailing select Gemini 3.1 and Claude variants that have reached 44-64% on the 2,500-question expert benchmark. The short window to June 30 limits scope for major new model releases or fine-tuning cycles, though incremental updates or inference optimizations could shift relative positioning before resolution. Traders monitor official OpenAI announcements and third-party benchmark runs for any capability gains in reasoning-heavy categories, where frontier large language models still show substantial headroom against the test's design to resist saturation.
Riepilogo sperimentale generato dall'AI con riferimento ai dati di Polymarket. Questo non è un consiglio di trading e non ha alcun ruolo nella risoluzione di questo mercato. · Aggiornato
Fai attenzione ai link esterni.
Fai attenzione ai link esterni.
Domande frequenti