Microsoft Unveils Full Suite Phi-3 Model, Introducing Groundbreaking Vision Image Model for the First Time

At the Build 2024 event last night, Microsoft unveiled the compact Phi 3 complete set model after launching the Phi-3-mini in April.

The Phi 3 set includes a total of 4 models, consisting of 3 small language models (SML) and the new multimodal model. The first of Microsoft’s open-source models is the Phi-3-vision.

Phi-3-vision is a model that supports both images and text, with 4.2B parameters customized for excellent chart and map reading, deep question answering capabilities. Microsoft’s own benchmarking found that it outperformed larger models like Claude-3 Haiku and Gemini 1.0 Pro V in multiple test sets.

Individuals interested in trying out Phi-3-vision can test it through the Azure AI Studio web interface. An example of Phi-3-vision reading a chart and providing an explanation.

The language model Phi 3 also emphasizes being a compact model, using low resources to run but with high efficiency. The models introduced come in 3 sizes:
– Phi-3-mini with 3.8B parameters
– Phi-3-small with 7B parameters
– Phi-3-medium with 14B parameters

Microsoft’s demonstration showed that the mid-size model Phi-3-small with 7B parameters could outperform the much larger GPT-3.5T. Meanwhile, the top-tier Phi-3-medium model with 14B parameters surpassed Gemini 1.0 Pro.

Another intriguing aspect is that Microsoft states the Phi series models are customized to run on a variety of hardware, not just NVIDIA, but also in partnership with Intel to be compatible with Intel hardware (Xeon, Gaudi, Arc, Core Ultra) along with AMD. This broadens compatibility with popular frameworks like ONNX Runtime and DirectML for diverse applications across portable devices and web-based operations.

The Phi-3 model set is now available for use through Azure AI and Hugging Face platforms.

TLDR: Microsoft’s Phi 3 model series offers compact, high-performing language and multimodal models, customized for diverse hardware compatibility and available through Azure AI and Hugging Face.

Microsoft Unveils Full Suite Phi-3 Model, Introducing Groundbreaking Vision Image Model for the First Time

More Reading

Exodus of 175 Employees: Pixar Shifts Focus to Monetizing Animation Apart from Disney+

Introducing Sonos Ace: The groundbreaking wireless headset with Spatial sound and Lossless support, priced at 17,900 baht.

Leave a Comment

Leave a Reply Cancel reply

More Reading

Post navigation

Leave a Comment

Leave a Reply Cancel reply

Vector Search Expansion: Oracle Introduces Cutting-Edge Enhancement in Oracle Database 23c

Unveiling Roblox’s Cube 3D Model: Crafting Tri-dimensional Objects from Prompts to Open Source

Hugging Face: An AI Developer Community Platform Secures $235 Million Funding from Tech Giants