Anthropic's Claude 3 Opus model solidified its position as the second-best large language model on the LMSYS Chatbot Arena leaderboard by the end of March 2024, trailing only OpenAI's GPT-4 Turbo Preview with an Elo score around 1270—driving the market's 97.5% implied probability as traders anchor on this real-money consensus benchmark for AI capabilities. The model's March 4 launch showcased superior reasoning, coding, and multimodal performance across evaluations, outpacing Google's Gemini 1.5 Pro and others like Mistral or DeepSeek, with no significant leaderboard shifts in the following weeks. This strong positioning reflects demonstrated benchmarks over marketing claims, though rare scenarios like revised Elo calculations, alternative leaderboards (e.g., MMLU), or late-month model updates from competitors could theoretically challenge it before resolution.
基于Polymarket数据的AI实验性摘要 · 更新于Anthropic 97.4%
xAI <1%
谷歌 <1%
DeepSeek <1%
$525,509 交易量
$525,509 交易量

Anthropic
97%

xAI
1%

谷歌
<1%

DeepSeek
<1%

OpenAI
<1%

阿里巴巴
<1%

百度
<1%

Moonshot
<1%

Z.ai
<1%

Mistral
<1%

美团
<1%
Anthropic 97.4%
xAI <1%
谷歌 <1%
DeepSeek <1%
$525,509 交易量
$525,509 交易量

Anthropic
97%

xAI
1%

谷歌
<1%

DeepSeek
<1%

OpenAI
<1%

阿里巴巴
<1%

百度
<1%

Moonshot
<1%

Z.ai
<1%

Mistral
<1%

美团
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the second best arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市场开放时间: Dec 2, 2025, 6:02 PM ET
Resolver
0x2F5e3684c...Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the second best arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x2F5e3684c...Anthropic's Claude 3 Opus model solidified its position as the second-best large language model on the LMSYS Chatbot Arena leaderboard by the end of March 2024, trailing only OpenAI's GPT-4 Turbo Preview with an Elo score around 1270—driving the market's 97.5% implied probability as traders anchor on this real-money consensus benchmark for AI capabilities. The model's March 4 launch showcased superior reasoning, coding, and multimodal performance across evaluations, outpacing Google's Gemini 1.5 Pro and others like Mistral or DeepSeek, with no significant leaderboard shifts in the following weeks. This strong positioning reflects demonstrated benchmarks over marketing claims, though rare scenarios like revised Elo calculations, alternative leaderboards (e.g., MMLU), or late-month model updates from competitors could theoretically challenge it before resolution.
基于Polymarket数据的AI实验性摘要 · 更新于
警惕外部链接哦。
警惕外部链接哦。
常见问题