Anthropic's Claude Opus 4.6 (thinking) dominates trader consensus at 100% implied probability as the top large language model on the LMSYS Chatbot Arena leaderboard under Style Control Off, holding the highest Elo score of 1504 as of April 10. This positioning reflects its February 2026 launch advantages—adaptive extended thinking for complex reasoning, a 1M-token context window, and leading benchmarks in coding (e.g., SWE-Bench) and agentic tasks—sustained through consistent battle wins against rivals like Gemini 3.1 Pro preview (1493 Elo) and Grok-4.20-beta1 despite user reports of minor post-release degradation. Real capital at stake underscores crowd wisdom, though a late surge in rival votes or unexpected leaderboard update could theoretically challenge it before resolution.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizadoclaude-opus-4-6-thinking 100.0%
claude-opus-4-6 <1%
gemini-3.1-pro-preview <1%
gemini-3-pro <1%
$64,801 Vol.
$64,801 Vol.
claude-opus-4-6
No
claude-opus-4-6-thinking
Yes
gemini-3.1-pro-preview
No
gemini-3-pro
No
gemini-2.5-pro
No
grok-4.20-beta1
No
gemini-3-flash
No
dola-seed-2.0-preview
No
qwen3.5-max-preview
No
kimi-k2.5-thinking
No
gpt-5.4-high
No
claude-opus-4-6-thinking 100.0%
claude-opus-4-6 <1%
gemini-3.1-pro-preview <1%
gemini-3-pro <1%
$64,801 Vol.
$64,801 Vol.
claude-opus-4-6
No
claude-opus-4-6-thinking
Yes
gemini-3.1-pro-preview
No
gemini-3-pro
No
gemini-2.5-pro
No
grok-4.20-beta1
No
gemini-3-flash
No
dola-seed-2.0-preview
No
qwen3.5-max-preview
No
kimi-k2.5-thinking
No
gpt-5.4-high
No
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Mercado abierto: Apr 2, 2026, 5:57 PM ET
Resolver
0x69c47De9D...Resultado propuesto: No
Sin disputa
Resultado final: No
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of model names as listed in this market group (full string, including suffixes such as “-thinking”) used as a tiebreaker (e.g., if the two models are tied by arena score, “claude-opus-4-6” would be ranked ahead of “claude-opus-4-6-thinking”). This market will resolve based on the model that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...Resultado propuesto: No
Sin disputa
Resultado final: No
Anthropic's Claude Opus 4.6 (thinking) dominates trader consensus at 100% implied probability as the top large language model on the LMSYS Chatbot Arena leaderboard under Style Control Off, holding the highest Elo score of 1504 as of April 10. This positioning reflects its February 2026 launch advantages—adaptive extended thinking for complex reasoning, a 1M-token context window, and leading benchmarks in coding (e.g., SWE-Bench) and agentic tasks—sustained through consistent battle wins against rivals like Gemini 3.1 Pro preview (1493 Elo) and Grok-4.20-beta1 despite user reports of minor post-release degradation. Real capital at stake underscores crowd wisdom, though a late surge in rival votes or unexpected leaderboard update could theoretically challenge it before resolution.
Resumen experimental generado por IA con datos de Polymarket. Esto no es asesoramiento de trading y no influye en cómo se resuelve este mercado. · Actualizado
Cuidado con los enlaces externos.
Cuidado con los enlaces externos.
Preguntas frecuentes