Market icon

聊天机器人竞技场:到12月31日,人工智能得分会有多高?

Market icon

聊天机器人竞技场:到12月31日,人工智能得分会有多高?

Dec 31

Dec 31

$65,865 交易量

Dec 31, 2026
Polymarket

$65,865 交易量

Polymarket

↑ 1550

$21,587 交易量

59%

↑ 1600

$10,523 交易量

27%

↑ 1650

$0 交易量

13%

↑ 1700

$0 交易量

11%

This market will resolve to "Yes" if any model on the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) reaches at least the specified Arena Score by December 31, 2026, 11:59 PM ET. Otherwise, this market will resolve to "No". Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market. The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".Anthropic's Claude Opus 4.6 Thinking model recently surged to the top of Arena.ai's Text leaderboard with a 1504 Elo score, surpassing Google's Gemini 3.1 Pro Preview (1493) and xAI's new Grok 4.20 Beta (1491 preliminary), based on millions of crowdsourced blind battles evaluating large language model capabilities in reasoning, creative writing, and expert prompts. This reflects rapid iterative progress from frontier AI labs since early 2026, including OpenAI's GPT-5.4 High entering the top 10 in March, amid competitive dynamics pressuring higher benchmarks. Traders should monitor Q2 announcements like potential GPT-5.5 or Gemini 4.0 releases, regulatory scrutiny on AI safety, and risks of leaderboard saturation, as scores could climb toward 1550+ by year-end resolution on the style-unchecked Overall category.

Anthropic's Claude Opus 4.6 Thinking model recently surged to the top of Arena.ai's Text leaderboard with a 1504 Elo score, surpassing Google's Gemini 3.1 Pro Preview (1493) and xAI's new Grok 4.20 Beta (1491 preliminary), based on millions of crowdsourced blind battles evaluating large language model capabilities in reasoning, creative writing, and expert prompts. This reflects rapid iterative progress from frontier AI labs since early 2026, including OpenAI's GPT-5.4 High entering the top 10 in March, amid competitive dynamics pressuring higher benchmarks. Traders should monitor Q2 announcements like potential GPT-5.5 or Gemini 4.0 releases, regulatory scrutiny on AI safety, and risks of leaderboard saturation, as scores could climb toward 1550+ by year-end resolution on the style-unchecked Overall category.

基于Polymarket数据的AI实验性摘要 · 更新于
This market will resolve to "Yes" if any model on the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) reaches at least the specified Arena Score by December 31, 2026, 11:59 PM ET. Otherwise, this market will resolve to "No". Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market. The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".Anthropic's Claude Opus 4.6 Thinking model recently surged to the top of Arena.ai's Text leaderboard with a 1504 Elo score, surpassing Google's Gemini 3.1 Pro Preview (1493) and xAI's new Grok 4.20 Beta (1491 preliminary), based on millions of crowdsourced blind battles evaluating large language model capabilities in reasoning, creative writing, and expert prompts. This reflects rapid iterative progress from frontier AI labs since early 2026, including OpenAI's GPT-5.4 High entering the top 10 in March, amid competitive dynamics pressuring higher benchmarks. Traders should monitor Q2 announcements like potential GPT-5.5 or Gemini 4.0 releases, regulatory scrutiny on AI safety, and risks of leaderboard saturation, as scores could climb toward 1550+ by year-end resolution on the style-unchecked Overall category.

Anthropic's Claude Opus 4.6 Thinking model recently surged to the top of Arena.ai's Text leaderboard with a 1504 Elo score, surpassing Google's Gemini 3.1 Pro Preview (1493) and xAI's new Grok 4.20 Beta (1491 preliminary), based on millions of crowdsourced blind battles evaluating large language model capabilities in reasoning, creative writing, and expert prompts. This reflects rapid iterative progress from frontier AI labs since early 2026, including OpenAI's GPT-5.4 High entering the top 10 in March, amid competitive dynamics pressuring higher benchmarks. Traders should monitor Q2 announcements like potential GPT-5.5 or Gemini 4.0 releases, regulatory scrutiny on AI safety, and risks of leaderboard saturation, as scores could climb toward 1550+ by year-end resolution on the style-unchecked Overall category.

基于Polymarket数据的AI实验性摘要 · 更新于

警惕外部链接哦。

常见问题

"聊天机器人竞技场:到12月31日,人工智能得分会有多高?"是 Polymarket 上一个拥有 5 个可能结果的预测市场,交易者根据自己的判断买卖份额。当前领先结果为"↑ 1500",概率为 100%,其次是"↑ 1550",概率为 59%。价格反映社区的实时概率。例如,价格为 100¢ 的份额意味着市场集体认为该结果的概率为 100%。这些赔率会随着交易者的反应而不断变化。正确结果的份额在市场结算时可兑换为每份 $1。

截至目前,"聊天机器人竞技场:到12月31日,人工智能得分会有多高?"已产生 $65.9K 的总交易量(自Jan 2, 2026市场上线以来)。这一活跃度反映了 Polymarket 社区的高度参与,并确保当前赔率由广泛的市场参与者共同形成。你可以直接在本页追踪实时价格变动并交易任何结果。

要在"聊天机器人竞技场:到12月31日,人工智能得分会有多高?"上交易,浏览本页上列出的 5 个可用结果。每个结果显示一个代表市场隐含概率的当前价格。要建仓,选择你认为最可能的结果,选择"是"支持或"否"反对,输入金额并点击"交易"。如果你选择的结果在市场结算时正确,你的"是"份额每份支付 $1。如果不正确,支付 $0。你也可以在结算前随时卖出份额。

"聊天机器人竞技场:到12月31日,人工智能得分会有多高?"的当前领先者是"↑ 1500",概率为 100%,意味着市场对该结果的概率评估为 100%。紧随其后的结果是"↑ 1550",概率为 59%。这些赔率随着交易者买卖份额而实时更新。请经常回来查看或将本页加入书签。

"聊天机器人竞技场:到12月31日,人工智能得分会有多高?"的结算规则明确定义了每个结果被宣布为获胜者所需满足的条件——包括用于确定结果的官方数据来源。你可以在本页评论上方的"规则"部分查看完整的结算标准。我们建议在交易前仔细阅读规则,因为它们规定了精确的条件、特殊情况和数据来源。