**Anthropic's Claude Opus update released April 16 has cemented its dominance in coding benchmarks like SWE-bench Verified, where it achieves the top score of 80.9%, outpacing OpenAI's GPT-5.4 and others amid trader consensus implying 94.5% odds of retaining the lead by April's end.** This follows the April 7 Claude Mythos preview, which shattered records at 93.9% on SWE-bench—though limited to select partners—highlighting superior agentic coding capabilities for complex, multi-file tasks via tools like Claude Code. No rival releases have closed the gap in the past week, with OpenAI's Codex trailing significantly. Realistic challenges include a surprise OpenAI or Google model launch with verified benchmark gains before resolution, though historical timelines suggest low likelihood given development cycles.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · GüncellendiAnthropic 95%
OpenAI 4.3%
DeepSeek <1%
Alibaba <1%
$122,914 Hac.
$122,914 Hac.

Anthropic
95%

OpenAI
4%

DeepSeek
1%

Alibaba
<1%

Z.ai
<1%

xAI
<1%

ByteDance
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%

<1%

Amazon
<1%
Anthropic 95%
OpenAI 4.3%
DeepSeek <1%
Alibaba <1%
$122,914 Hac.
$122,914 Hac.

Anthropic
95%

OpenAI
4%

DeepSeek
1%

Alibaba
<1%

Z.ai
<1%

xAI
<1%

ByteDance
<1%

Baidu
<1%

Moonshot
<1%

Mistral
<1%

Meituan
<1%

<1%

Amazon
<1%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of company names as listed in this market group used as a tiebreaker (e.g., if the two models are tied by arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Piyasa Açıldı: Apr 2, 2026, 5:39 PM ET
Resolver
0x69c47De9D...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of company names as listed in this market group used as a tiebreaker (e.g., if the two models are tied by arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...**Anthropic's Claude Opus update released April 16 has cemented its dominance in coding benchmarks like SWE-bench Verified, where it achieves the top score of 80.9%, outpacing OpenAI's GPT-5.4 and others amid trader consensus implying 94.5% odds of retaining the lead by April's end.** This follows the April 7 Claude Mythos preview, which shattered records at 93.9% on SWE-bench—though limited to select partners—highlighting superior agentic coding capabilities for complex, multi-file tasks via tools like Claude Code. No rival releases have closed the gap in the past week, with OpenAI's Codex trailing significantly. Realistic challenges include a surprise OpenAI or Google model launch with verified benchmark gains before resolution, though historical timelines suggest low likelihood given development cycles.
Polymarket verilerine atıfta bulunan deneysel AI tarafından oluşturulmuş özet. Bu bir işlem tavsiyesi değildir ve bu piyasanın nasıl çözümlendiğinde hiçbir rolü yoktur. · Güncellendi
Harici bağlantılara dikkat edin.
Harici bağlantılara dikkat edin.
Sıkça Sorulan Sorular