**Anthropic's Claude Opus update released April 16 has cemented its dominance in coding benchmarks like SWE-bench Verified, where it achieves the top score of 80.9%, outpacing OpenAI's GPT-5.4 and others amid trader consensus implying 94.5% odds of retaining the lead by April's end.** This follows the April 7 Claude Mythos preview, which shattered records at 93.9% on SWE-bench—though limited to select partners—highlighting superior agentic coding capabilities for complex, multi-file tasks via tools like Claude Code. No rival releases have closed the gap in the past week, with OpenAI's Codex trailing significantly. Realistic challenges include a surprise OpenAI or Google model launch with verified benchmark gains before resolution, though historical timelines suggest low likelihood given development cycles.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트앤트로픽 95%
OpenAI 4.3%
DeepSeek <1%
알리바바 <1%
$122,914 거래량
$122,914 거래량

앤트로픽
95%

OpenAI
4%

DeepSeek
1%

알리바바
<1%

Z.ai
<1%

xAI
<1%

바이트댄스
<1%

바이두
<1%

문샷
<1%

미스트랄
<1%

메이투안
<1%

구글
<1%

아마존
<1%
앤트로픽 95%
OpenAI 4.3%
DeepSeek <1%
알리바바 <1%
$122,914 거래량
$122,914 거래량

앤트로픽
95%

OpenAI
4%

DeepSeek
1%

알리바바
<1%

Z.ai
<1%

xAI
<1%

바이트댄스
<1%

바이두
<1%

문샷
<1%

미스트랄
<1%

메이투안
<1%

구글
<1%

아마존
<1%
Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of company names as listed in this market group used as a tiebreaker (e.g., if the two models are tied by arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
마켓 개설일: Apr 2, 2026, 5:39 PM ET
Resolver
0x69c47De9D...Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market.
Models will be ranked primarily by their arena score at this market’s check time, with alphabetical order of company names as listed in this market group used as a tiebreaker (e.g., if the two models are tied by arena score, “Google” would be ranked ahead of “xAI”). This market will resolve based on the company that occupies first place under this ranking.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and will resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
Resolver
0x69c47De9D...**Anthropic's Claude Opus update released April 16 has cemented its dominance in coding benchmarks like SWE-bench Verified, where it achieves the top score of 80.9%, outpacing OpenAI's GPT-5.4 and others amid trader consensus implying 94.5% odds of retaining the lead by April's end.** This follows the April 7 Claude Mythos preview, which shattered records at 93.9% on SWE-bench—though limited to select partners—highlighting superior agentic coding capabilities for complex, multi-file tasks via tools like Claude Code. No rival releases have closed the gap in the past week, with OpenAI's Codex trailing significantly. Realistic challenges include a surprise OpenAI or Google model launch with verified benchmark gains before resolution, though historical timelines suggest low likelihood given development cycles.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문