Anthropic's release of Claude 3.5 Sonnet on June 20 has driven its 39% implied probability as the trader consensus for the second-best AI model by end of June, topping the LMSYS Chatbot Arena leaderboard with superior coding (SWE-Bench) and reasoning benchmarks that briefly eclipsed OpenAI's GPT-4o. Google trails closely at 31.5% thanks to Gemini 1.5 Pro's advantages in long-context processing and multimodal capabilities, positioning it as a resilient contender amid competitive frontier large language model races. The narrow lead underscores uncertainty in volatile user-voted rankings and aggregate metrics, with xAI's Grok gaining niche traction via real-time data access but lacking broad benchmark dominance; resolution hinges on the June 30 LMSYS snapshot and any late tweaks from labs.
基於Polymarket數據的AI實驗性摘要 · 更新於Anthropic 39%
Google 32%
xAI 14%
OpenAI 11%

Anthropic
39%

32%

xAI
14%

OpenAI
11%

Mistral
6%

DeepSeek
5%

美團
2%

阿里巴巴
1%

Moonshot
1%

Z.ai
<1%
Anthropic 39%
Google 32%
xAI 14%
OpenAI 11%

Anthropic
39%

32%

xAI
14%

OpenAI
11%

Mistral
6%

DeepSeek
5%

美團
2%

阿里巴巴
1%

Moonshot
1%

Z.ai
<1%
Results from the "Arena Score" section on the Leaderboard tab of https://lmarena.ai/leaderboard/text with the style control off will be used to resolve this market.
If two models are tied for the second best arena score at this market's check time, resolution will be based on whichever company's name, as it is described in this market group, comes first in alphabetical order (e.g. if both were tied, "Google" would resolve to "Yes", and "xAI" would resolve to "No")
The resolution source for this market is the Chatbot Arena LLM Leaderboard found at https://lmarena.ai/. If this resolution source is unavailable at check time, this market will remain open until the leaderboard comes back online and resolve based on the first check after it becomes available. If it becomes permanently unavailable, this market will resolve based on another resolution source.
市場開放時間: Oct 10, 2025, 5:27 PM ET
Resolver
0x2F5e3684c...Resolver
0x2F5e3684c...Anthropic's release of Claude 3.5 Sonnet on June 20 has driven its 39% implied probability as the trader consensus for the second-best AI model by end of June, topping the LMSYS Chatbot Arena leaderboard with superior coding (SWE-Bench) and reasoning benchmarks that briefly eclipsed OpenAI's GPT-4o. Google trails closely at 31.5% thanks to Gemini 1.5 Pro's advantages in long-context processing and multimodal capabilities, positioning it as a resilient contender amid competitive frontier large language model races. The narrow lead underscores uncertainty in volatile user-voted rankings and aggregate metrics, with xAI's Grok gaining niche traction via real-time data access but lacking broad benchmark dominance; resolution hinges on the June 30 LMSYS snapshot and any late tweaks from labs.
基於Polymarket數據的AI實驗性摘要 · 更新於
警惕外部連結哦。
警惕外部連結哦。
Frequently Asked Questions