Qwen2.5-Max is now ranked #7 OVERALL in the Chatbot Arena, ranked 1st in math and coding, and 2nd in hard prompts

Alibaba's Qwen2.5-Max is now ranked #7 OVERALL in the Chatbot Arena, surpassing DeepSeek V3, o1-mini and Claude-3.5-Sonnet.

Qwen-Max is strong across domains, especially in technical ones (Coding, Math, Hard Prompts) It is ranked 1st in math and coding, and 2nd in hard prompts.

Besides, Qwen devs are working on reasoning models!! Stay tuned 🔥