Home » Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.

Posted inin Technology

Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.

Posted byby
1 year ago

Alibaba has unveiled the Qwen2.5-Max artificial intelligence model, a large-scale MoE (Mixture-of-Expert) language model similar to DeepSeek V3. It has been pre-trained with over 20 trillion tokens and post-trained using SFT (Supervised Fine-Tuning) and RLHF (Reinforcement Learning from Human Feedback) methods.

In testing, Qwen2.5-Max outperformed DeepSeek-V3, GPT-4o, and Claude-3.5-Sonnet in categories like Arena-Hard and LiveBench. It scored higher than DeepSeek-V3 but lower than Claude-3.5-Sonnet in MMLU-Pro and LiveCodeBench categories.

Qwen2.5-Max is now available for use through Alibaba Cloud’s API and via the Qwen Chat service.

TLDR: Alibaba introduces Qwen2.5-Max AI model, surpassing competitors in various testing categories and now accessible through Alibaba Cloud API and Qwen Chat.

Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.

More Reading

AI Agent Goose Unveiled: Running Locally with Open-Source Flexibility, Opt for LLM Autonomy

Government of the United Kingdom proposes collecting fees for watching BBC from households without a television, while subscribing to streaming services like Netflix.

Leave a Comment

Leave a Reply Cancel reply

More Reading

Post navigation

Leave a Comment

Leave a Reply Cancel reply

DALL·E Unveiled by OpenAI: An Enigmatic AI Mastermind Transforming Imagery to Astonishing Precision While Seamlessly Embedding Textual Descriptions

OpenAI Unveils Continual Innovations in 12-Day Press Event Starting Tomorrow.

Unveiling Janus-Pro: The Cutting-Edge AI Model for Analyzing and Generating New Images