Gemini 1.5 Pro is here! Get ready to experience the next level of audio recognition

The new AI model is available in 180 different countries around the globe

News

Milan Stanojevic

Windows Toubleshooting Expert

News

2 min. read

Published on April 10, 2024

All major tech giants are working on AI models, and speaking of which, it seems that Google has released a new version of Gemini.

The Gemini 1.5 Pro has been released, and it offers some interesting features, so let’s dive into it and see what’s new.

Gemini 1.5 Pro is here, comes with an audio recognition feature

Google has recently updated its AI model, and Gemini 1.5 Pro is available in more than 180 countries through Google AI Studio’s public preview, as MSPowerUser writes.

Gemini now has a 1 million context window that allows developers to better analyze and understand information.

That’s not all, this version also has an audio recognition feature, so it can process spoken language. File upload is also supported, so you can upload an audio file and Gemini will analyze it.

Here’s what developers had to say about this feature:

You can upload a recording of a lecture, like 117,000+ token lecture from Jeff Dean, and Gemini 1.5 Pro can turn it into a quiz with an answer key.

The update also brings greater control and functionality to the developers, and there’s also support for system instructions so you can easily specify roles, formats, and goals.

Lastly, there’s JSON mode available that allows structured data extraction from both images and text. cURL is currently supported, and Python SDK support is coming soon, according to developers.

That’s not all from Google, there are also reports that Gemini will bring replay suggestions to Gmail for Android soon, so stay tuned.

More about the topics: Gemini, Google

Milan Stanojevic

Windows Toubleshooting Expert

Milan has been enthusiastic about technology ever since his childhood days, and this led him to take interest in all PC-related technologies. He's a PC enthusiast and he spends most of his time learning about computers and technology. Before joining WindowsReport, he worked as a front-end web developer. Now, he's one of the Troubleshooting experts in our worldwide team, specializing in Windows errors & software issues.

Readers help support Windows Report. We may get a commission if you buy through our links.

Improve this guide

User forum

0 messages

Sort by:

Gemini 1.5 Pro is here, comes with an audio recognition feature

Leave a Reply Cancel reply