Home ยป Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.

Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.

Alibaba has unveiled the Qwen2.5-Max artificial intelligence model, a large-scale MoE (Mixture-of-Expert) language model similar to DeepSeek V3. It has been pre-trained with over 20 trillion tokens and post-trained using SFT (Supervised Fine-Tuning) and RLHF (Reinforcement Learning from Human Feedback) methods.

In testing, Qwen2.5-Max outperformed DeepSeek-V3, GPT-4o, and Claude-3.5-Sonnet in categories like Arena-Hard and LiveBench. It scored higher than DeepSeek-V3 but lower than Claude-3.5-Sonnet in MMLU-Pro and LiveCodeBench categories.

Qwen2.5-Max is now available for use through Alibaba Cloud’s API and via the Qwen Chat service.

TLDR: Alibaba introduces Qwen2.5-Max AI model, surpassing competitors in various testing categories and now accessible through Alibaba Cloud API and Qwen Chat.

More Reading

Post navigation

Leave a Comment

Leave a Reply

Your email address will not be published. Required fields are marked *

GraphCast: Cutting-Edge AI Model by DeepMind Unveils Revolutionary Weather Forecasting Capabilities

Liang Wenfeng, Founder of DeepSeek, hailed as a hero upon returning home to celebrate Chinese New Year, as villagers applaud his achievements.

Advanced Artificial Intelligence poised to secure additional funding of up to $10 billion for a business valuation of $75 billion.