Multi-modal Support on Google Gemini Live Now Accessible to Pixel 9 Users



While Samsung launched the Galaxy S25 series, Google also updated its AI chatbot, Gemini. Gemini Live now supports multi-modality, where users can attach PDF files, images or YouTube video links to ask and chat with Gemini about them.


This feature was first distributed to Galaxy S25, Galaxy S25+ and Galaxy S25 Ultra users. Then Pixel 9 began supporting Gemini Live with the ability to attach images. Most recently, it supports PDF and YouTube files. For PDF files, it only works if the user opens the file with the Files by Google app. Meanwhile, for images, just invoke Gemini and you can add images to then continue chatting.


Finally, for YouTube videos, it only supports English-language videos or videos that can generate English text transcripts. Malay-language videos such as on the our YouTube channel are still not usable.


In my experiment, asking about images was quite interesting and it could see many details that I had overlooked. For videos, it's quite interesting that a video is over 10 minutes long and Gemini can answer a variety of questions including details about the host in a blink of an eye. For PDFs, it still doesn't appear on my device but it used to – it's the same as uploading a file to Gemini's chatbot, only now it's in conversation mode instead of generating text.


This feature is now available on all Galaxy S25 devices. After all Pixel 9s get it, the Galaxy S24 will be the next to get this feature.

Previous Post Next Post

Contact Form