Google Teases Real-Time Multimodal AI Camera Feature Ahead of I/O Conference

Updated: May 13 2024 20:17

Google has just given us a sneak peek of an exciting new multimodal AI-powered camera feature, just one day before the highly anticipated I/O developer conference, right after the exciting announcement from OpenAI about the new GPT-4o modal. The teaser video, shared on X (formerly Twitter), showcases a prototype that could revolutionize the way we interact with our smartphone cameras.


Here is the OpenAI GPT-4o video for comparison, which demo do you think is better?


A Glimpse into the Future of AI-Powered Cameras

In the brief video, we see what appears to be a Pixel device with its camera pointed at the keynote stage at I/O. The user asks, "Hey, what do you think is happening here?" and a voice responds with a surprisingly accurate description of the scene:

It looks like people are setting up for a large event, perhaps a conference or presentation.


The AI not only recognizes the "IO" letters on the stage but also ties them to Google's developer conference, mentioning:

I am always excited to learn about new advancements in artificial intelligence, and how they can help people in their daily lives.


Is This the Next Generation of Google Lens?

While the exact nature of this feature remains unclear, it bears a striking resemblance to Google Lens, the company's existing camera-powered search tool. However, the teaser suggests that this new AI feature takes things to the next level by: