Google DeepMind's Gemini Deep Think and OpenAI's advanced reasoning models achieved official gold medal-level performance on the 2025 International Mathematical Olympiad (IMO) problems—scoring 35 out of 42 points by solving five of six challenges—certified by the IMO committee and marking a leap from DeepMind's 2024 silver standard. This rapid progress in AI mathematical reasoning, blending large language models with formal verification techniques like AlphaProof evolutions, has fueled trader consensus at 72% implied probability for an AI securing a full gold medal at IMO 2026. Competitive advancements from labs including OpenAI continue to push benchmarks, though escalating problem complexity and verification hurdles temper certainty ahead of the July event.
基於Polymarket數據的AI實驗性摘要 · 更新於是
是
The resolution source is the IMO Grand Challenge (https://imo-grand-challenge.github.io/) and the Artificial Intelligence Math Olympiad (AIMO, https://aimoprize.com/). If either source demonstrates that an AI has won the challenge/prize before the resolution date, this market will resolve to "Yes".
市場開放時間: Nov 12, 2025, 5:08 PM ET
Resolver
0x65070BE91...The resolution source is the IMO Grand Challenge (https://imo-grand-challenge.github.io/) and the Artificial Intelligence Math Olympiad (AIMO, https://aimoprize.com/). If either source demonstrates that an AI has won the challenge/prize before the resolution date, this market will resolve to "Yes".
Resolver
0x65070BE91...Google DeepMind's Gemini Deep Think and OpenAI's advanced reasoning models achieved official gold medal-level performance on the 2025 International Mathematical Olympiad (IMO) problems—scoring 35 out of 42 points by solving five of six challenges—certified by the IMO committee and marking a leap from DeepMind's 2024 silver standard. This rapid progress in AI mathematical reasoning, blending large language models with formal verification techniques like AlphaProof evolutions, has fueled trader consensus at 72% implied probability for an AI securing a full gold medal at IMO 2026. Competitive advancements from labs including OpenAI continue to push benchmarks, though escalating problem complexity and verification hurdles temper certainty ahead of the July event.
基於Polymarket數據的AI實驗性摘要 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions