Market icon

Chatbot Arena: How high will AI score by December 31?

Market icon

Chatbot Arena: How high will AI score by December 31?

Dec 31

Dec 31

$65,865 Vol.

Dec 31, 2026
Polymarket

$65,865 Vol.

Polymarket

↑ 1550

$21,587 Vol.

59%

↑ 1600

$10,523 Vol.

27%

↑ 1650

$0 Vol.

13%

↑ 1700

$0 Vol.

11%

This market will resolve to "Yes" if any model on the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) reaches at least the specified Arena Score by December 31, 2026, 11:59 PM ET. Otherwise, this market will resolve to "No". Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market. The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".Gemini 3.1 Pro holds the top spot on the LMSYS Chatbot Arena leaderboard with an Elo rating of 1505 as of late March 2026, edging out Anthropic's Claude Opus 4.6 at 1503, reflecting trader consensus on accelerating large language model (LLM) capabilities driven by recent releases like OpenAI's GPT-5.2-chat-latest (top 5 at 1478) and Claude Sonnet 4.6's coding surge. Intense competition among OpenAI, Anthropic, Google DeepMind, and xAI has fueled Elo gains of 20-40 points per iteration through enhanced instruction-following, multi-turn reasoning, and hard prompts, validated by millions of crowdsourced battles. Upcoming catalysts include full GPT-5 and Claude 4 Opus launches, potential developer conferences, and benchmark updates, though leaderboard volatility and scaling plateaus introduce uncertainty for year-end peaks.

Gemini 3.1 Pro holds the top spot on the LMSYS Chatbot Arena leaderboard with an Elo rating of 1505 as of late March 2026, edging out Anthropic's Claude Opus 4.6 at 1503, reflecting trader consensus on accelerating large language model (LLM) capabilities driven by recent releases like OpenAI's GPT-5.2-chat-latest (top 5 at 1478) and Claude Sonnet 4.6's coding surge. Intense competition among OpenAI, Anthropic, Google DeepMind, and xAI has fueled Elo gains of 20-40 points per iteration through enhanced instruction-following, multi-turn reasoning, and hard prompts, validated by millions of crowdsourced battles. Upcoming catalysts include full GPT-5 and Claude 4 Opus launches, potential developer conferences, and benchmark updates, though leaderboard volatility and scaling plateaus introduce uncertainty for year-end peaks.

Experimental AI-generated summary referencing Polymarket data · Updated
This market will resolve to "Yes" if any model on the Chatbot Arena LLM Leaderboard (https://lmarena.ai/) reaches at least the specified Arena Score by December 31, 2026, 11:59 PM ET. Otherwise, this market will resolve to "No". Results from the 'Score' section on the 'Text Arena' Leaderboard tab (https://lmarena.ai/leaderboard/text), with the style control unchecked, will be used to resolve this market. The resolution source is the Chatbot Arena LLM Leaderboard (https://lmarena.ai/). If this source is temporarily unavailable, the market remains open until it is accessible again; if permanently unavailable, this market will resolve to "No".Gemini 3.1 Pro holds the top spot on the LMSYS Chatbot Arena leaderboard with an Elo rating of 1505 as of late March 2026, edging out Anthropic's Claude Opus 4.6 at 1503, reflecting trader consensus on accelerating large language model (LLM) capabilities driven by recent releases like OpenAI's GPT-5.2-chat-latest (top 5 at 1478) and Claude Sonnet 4.6's coding surge. Intense competition among OpenAI, Anthropic, Google DeepMind, and xAI has fueled Elo gains of 20-40 points per iteration through enhanced instruction-following, multi-turn reasoning, and hard prompts, validated by millions of crowdsourced battles. Upcoming catalysts include full GPT-5 and Claude 4 Opus launches, potential developer conferences, and benchmark updates, though leaderboard volatility and scaling plateaus introduce uncertainty for year-end peaks.

Gemini 3.1 Pro holds the top spot on the LMSYS Chatbot Arena leaderboard with an Elo rating of 1505 as of late March 2026, edging out Anthropic's Claude Opus 4.6 at 1503, reflecting trader consensus on accelerating large language model (LLM) capabilities driven by recent releases like OpenAI's GPT-5.2-chat-latest (top 5 at 1478) and Claude Sonnet 4.6's coding surge. Intense competition among OpenAI, Anthropic, Google DeepMind, and xAI has fueled Elo gains of 20-40 points per iteration through enhanced instruction-following, multi-turn reasoning, and hard prompts, validated by millions of crowdsourced battles. Upcoming catalysts include full GPT-5 and Claude 4 Opus launches, potential developer conferences, and benchmark updates, though leaderboard volatility and scaling plateaus introduce uncertainty for year-end peaks.

Experimental AI-generated summary referencing Polymarket data · Updated

Beware of external links.

Frequently Asked Questions

"Chatbot Arena: How high will AI score by December 31?" is a prediction market on Polymarket with 5 possible outcomes where traders buy and sell shares based on what they believe will happen. The current leading outcome is "↑ 1500" at 100%, followed by "↑ 1550" at 59%. Prices reflect real-time crowd-sourced probabilities. For example, a share priced at 100¢ implies that the market collectively assigns a 100% chance to that outcome. These odds shift continuously as traders react to new developments and information. Shares in the correct outcome are redeemable for $1 each upon market resolution.

As of today, "Chatbot Arena: How high will AI score by December 31?" has generated $65.9K in total trading volume since the market launched on Jan 2, 2026. This level of trading activity reflects strong engagement from the Polymarket community and helps ensure that the current odds are informed by a deep pool of market participants. You can track live price movements and trade on any outcome directly on this page.

To trade on "Chatbot Arena: How high will AI score by December 31?," browse the 5 available outcomes listed on this page. Each outcome displays a current price representing the market's implied probability. To take a position, select the outcome you believe is most likely, choose "Yes" to trade in favor of it or "No" to trade against it, enter your amount, and click "Trade." If your chosen outcome is correct when the market resolves, your "Yes" shares pay out $1 each. If it's incorrect, they pay out $0. You can also sell your shares at any time before resolution if you want to lock in a profit or cut a loss.

The current frontrunner for "Chatbot Arena: How high will AI score by December 31?" is "↑ 1500" at 100%, meaning the market assigns a 100% chance to that outcome. The next closest outcome is "↑ 1550" at 59%. These odds update in real-time as traders buy and sell shares, so they reflect the latest collective view of what's most likely to happen. Check back frequently or bookmark this page to follow how the odds shift as new information emerges.

The resolution rules for "Chatbot Arena: How high will AI score by December 31?" define exactly what needs to happen for each outcome to be declared a winner — including the official data sources used to determine the result. You can review the complete resolution criteria in the "Rules" section on this page above the comments. We recommend reading the rules carefully before trading, as they specify the precise conditions, edge cases, and sources that govern how this market is settled.