编程权威榜单:千问3.7仅次于Claude,阿里全球第二
Quick Take
The latest Code Arena programming benchmark reveals Alibaba's Qwen3.7-Max scoring 1541, placing it just behind Claude and surpassing models like GPT-5.5 and Gemini-3.5-Flash. This positions Alibaba as a leading player in AI development, showcasing significant advancements in their flagship model.
Key Points
- Qwen3.7-Max scores 1541 in the Code Arena benchmark.
- Model ranks second globally, just behind Claude.
- Surpasses notable competitors like GPT-5.5 and Gemini-3.5-Flash.
- Highlights Alibaba's advancements in AI technology.
- Strengthens Alibaba's position in the global AI landscape.
Article Excerpt
From source RSS / original summary5月26日凌晨,全球权威三方编程榜单Code Arena放榜,阿里最新旗舰模型Qwen3. 7-Max得分1541,超越GPT-5. 5、Gemini-3. 5-Flash、GLM-5. 1、Kimi-K2. 6等一众
Reader Mode unavailable (could not extract clean content).
Want this in your inbox every morning?
Daily brief at your local 8am — bilingual EN/中文, free.
More from WebSearch (Tavily)
See more →Anthropic releases new model, Opus 4.8 - Axios
Anthropic has launched Claude Opus 4.8, an upgrade to its AI model that enhances coding and knowledge work capabilities while maintaining the same price. Although it still trails behind the upcoming Mythos-class models, Opus 4.8 outperformed competitors in key benchmarks such as agentic coding and financial analysis.