xAI's May 1 launch of Grok 4.3—a 500 billion parameter large language model that leads CaseLaw v2 and CorpFin benchmarks while slashing input/output costs 40-60%—highlights its agentic reasoning gains and Colossus supercluster edge, fueling trader focus on an LMSYS Chatbot Arena debut. With Anthropic's Claude Opus 4.6 atop the Arena at 1504 Elo and prior Grok 4.20 at #4 overall, the market hinges on whether this compact frontier model or its successor appears on the crowdsourced leaderboard, a key blind benchmark for real-world capabilities. xAI's blistering cadence amid rivalry from OpenAI and Google keeps uncertainty high, with Grok-5 training whispers as a potential 2026 catalyst.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트$28,291 거래량
1440+
62%
1460+
43%
1480+
20%
$28,291 거래량
1440+
62%
1460+
43%
1480+
20%
Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
If no qualifying score for the specified model is available on the Arena.AI Leaderboard at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If no qualifying score becomes available by the end of the seventh day following the day of the model’s release, or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".
If multiple models are released on the same calendar date or if multiple variants of the specified model appear on the Arena.AI Leaderboard at the relevant check time (e.g., base, “Thinking,” or “Instant”), the highest-scoring variant will be used for resolution.
A qualifying model must be launched and publicly accessible, including via open beta or open rolling waitlist signups. A closed beta or any form of private access will not suffice. The release must be either clearly defined and publicly announced as being accessible to the general public or otherwise made publicly accessible and explicitly labeled within the company’s official website. Labeling errors, placeholder text, or version names displayed on the website that do not correspond to a model that is actually accessible to the general public will not qualify.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after a qualifying release, it will resolve to "No".
마켓 개설일: May 1, 2026, 6:09 PM ET
Resolver
0x65070BE91...Results from the "Score" column under the "Text Arena | Overall" Leaderboard tab at https://lmarena.ai/leaderboard/text with style control off will be used to resolve this market.
If no qualifying score for the specified model is available on the Arena.AI Leaderboard at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If no qualifying score becomes available by the end of the seventh day following the day of the model’s release, or if no qualifying model release occurs by December 31, 2026, 11:59 PM ET, this market will resolve to "No".
If multiple models are released on the same calendar date or if multiple variants of the specified model appear on the Arena.AI Leaderboard at the relevant check time (e.g., base, “Thinking,” or “Instant”), the highest-scoring variant will be used for resolution.
A qualifying model must be launched and publicly accessible, including via open beta or open rolling waitlist signups. A closed beta or any form of private access will not suffice. The release must be either clearly defined and publicly announced as being accessible to the general public or otherwise made publicly accessible and explicitly labeled within the company’s official website. Labeling errors, placeholder text, or version names displayed on the website that do not correspond to a model that is actually accessible to the general public will not qualify.
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text. If this resolution source is unavailable at 12:00 PM ET following the date of the release, this market will resolve based on the first subsequent instance at which such a score becomes available on the leaderboard. If it remains unavailable through the end of the seventh day after a qualifying release, it will resolve to "No".
Resolver
0x65070BE91...xAI's May 1 launch of Grok 4.3—a 500 billion parameter large language model that leads CaseLaw v2 and CorpFin benchmarks while slashing input/output costs 40-60%—highlights its agentic reasoning gains and Colossus supercluster edge, fueling trader focus on an LMSYS Chatbot Arena debut. With Anthropic's Claude Opus 4.6 atop the Arena at 1504 Elo and prior Grok 4.20 at #4 overall, the market hinges on whether this compact frontier model or its successor appears on the crowdsourced leaderboard, a key blind benchmark for real-world capabilities. xAI's blistering cadence amid rivalry from OpenAI and Google keeps uncertainty high, with Grok-5 training whispers as a potential 2026 catalyst.
Polymarket 데이터를 참조하는 실험적 AI 생성 요약입니다. 이것은 거래 조언이 아니며 이 마켓의 정산에 영향을 미치지 않습니다. · 업데이트
외부 링크에 주의하세요.
외부 링크에 주의하세요.
자주 묻는 질문