Meet Microsoft's Phi-3 Vision, a small multimodal AI model capable of processing images and videos

Phi-3 Vision has 4.2 billion parameters.

News

1 min. read

Published on May 21, 2024

by Flavius Floare

published on May 21, 2024

Share this article

Readers help support Windows Report. We may get a commission if you buy through our links.

Earlier this year, Microsoft released Phi-3 Mini, an AI model with 3.8 billion parameters, which can run locally, rather than on-cloud, and it’s quite impressive.

Today, at the Microsoft Build 2024, the Redmond-based tech giant announced the new Phi-3 Vision, which can accept images and videos as prompts, and it can generate answers based on them.

Microsoft made Phi-3 Vision bigger than its Mini version: it has 4.2 billion parameters, and it’s now available in a private preview.

However, there is no word when it will become generally available, but the Redmond-based tech giant will most likely release it later this year.

Speaking of AI, Microsoft Teams will also get a Team Copilot model, and similar to Google’s AI Teammate, this one will help users access information easily and faster.

Microsoft wasn’t joking when it said this year’s Build is all about AI.

More about the topics: AI, microsoft

Flavius Floare

Tech Journalist

Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling. He's always curious and ready to take on everything new in the tech world, covering Microsoft's products on a daily basis. The passion for gaming and hardware feeds his journalistic approach, making him a great researcher and news writer that's always ready to bring you the bleeding edge!

User forum

0 messages

Sort by:

Leave a Reply Cancel reply