Google unveils Gemini Nano, an artificial intelligence model for the latest smartphone models that supports multiple input formats including images, sound, and text. Along with the capabilities of the Gemini app that integrates seamlessly with the full Android system.
Gemini Nano reads images, allowing for detailed image descriptions without prior data. Google has incorporated this ability into the TalkBack feature, which assists visually impaired individuals, enabling detailed image descriptions and sound playback. This feature also allows for continuous audio listening and immediate notifications if a conversation seems deceptive. This feature is opt-in and must be manually activated, and will be available within this year (country support unspecified).
The Gemini app for Android will further integrate with the operating system, allowing users to summon Gemini on top of other apps for assistance such as creating images and querying information from videos or documents while recognizing user activities.
Lastly, the Circle to Search service enhances the understanding of new image formats like mathematical equations, graphs, or charts. Originally available only for Samsung Galaxy S24, the service has expanded to 100 million devices currently and aims to reach 200 million by the end of this year.
Source – Google Blog
**TLDR:** Google introduces Gemini Nano, an AI model for smartphones supporting various input formats. Features include image and sound descriptions, opt-in conversation alerts, and expanding services for diverse image understanding capabilities.
Leave a Comment