Google’s Pixel Recorder leverages Gemini Nano with Multimodality to enhance AI-powered summaries, boosting app engagement and user retention.

Google’s Pixel Recorder on the Pixel 9 series integrates Gemini Nano with Multimodality, a genAI model nearly twice as large as its predecessor. This model enables the Recorder to process image, audio, and text inputs, significantly improving its capabilities and accuracy. The AI-powered summarization feature has led to a 24% increase in saved recordings, with users engaging with the AI-powered summarization feature 2 to 5 times daily.

Additionally, Gemini Nano with Multimodality model’s larger size and improved functionality eliminate the need for extensive fine-tuning, simplifying the development process and supporting more creative applications. It also features expanded token support, allowing for the summarization of longer transcripts and the inclusion of grammar assessments for better inference quality.

Challenges in integrating this model were addressed by leveraging existing fine-tuning datasets, streamlining the process. The enhanced token support allows for summarizing longer transcripts and assessing grammar quality. This model’s capabilities reduce the need for extensive fine-tuning, enabling more creative applications.

Google is already developing additional GenAI features to further enhance user experience, with early internal testing underway. The focus remains on delivering advanced AI functionalities that simplify tasks and improve usability.