Google DeepMind's experimental Gemini 2.0 Flash model surged to the top of the LMSYS Chatbot Arena leaderboard last week, achieving a record Elo score above 1330 through enhanced reasoning and speed, intensifying the large language model race. This follows OpenAI's November GPT-4o update (now ~1290 Elo) and Anthropic's Claude 3.5 Sonnet refinements, with iterative releases driving ~50-point gains in the past month via better benchmark performance in blind user votes. Traders eye year-end catalysts like xAI's Grok-3 rollout this month, potential OpenAI Orion previews, and further Gemini iterations, which could elevate the top score toward 1350+ by December 31—though integration hurdles or AI safety pauses pose downside risks to this trajectory.
基于Polymarket数据的AI实验性摘要 · 更新于$65,335 交易量
↑ 1550
57%
↑ 1600
30%
↑ 1650
13%
↑ 1700
11%
$65,335 交易量
↑ 1550
57%
↑ 1600
30%
↑ 1650
13%
↑ 1700
11%
Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market.
The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".
市场开放时间: Jan 2, 2026, 1:29 PM ET
Resolver
0x65070BE91...Resolver
0x65070BE91...Google DeepMind's experimental Gemini 2.0 Flash model surged to the top of the LMSYS Chatbot Arena leaderboard last week, achieving a record Elo score above 1330 through enhanced reasoning and speed, intensifying the large language model race. This follows OpenAI's November GPT-4o update (now ~1290 Elo) and Anthropic's Claude 3.5 Sonnet refinements, with iterative releases driving ~50-point gains in the past month via better benchmark performance in blind user votes. Traders eye year-end catalysts like xAI's Grok-3 rollout this month, potential OpenAI Orion previews, and further Gemini iterations, which could elevate the top score toward 1350+ by December 31—though integration hurdles or AI safety pauses pose downside risks to this trajectory.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题