Revealed by Alibaba: Qwen2.5-Max, a Large-Scale AI MoE Model with Higher Test Scores than DeepSeek V3.
Alibaba has unveiled the Qwen2.5-Max artificial intelligence model, a large-scale MoE (Mixture-of-Expert) language model similar to DeepSeek V3. It has been pre-trained with over 20 trillion tokens and post-trained using...