Google’s Gemini Omni AI Model Promises to Create 'Anything' From Any Type of Input

Google just announced Gemini Omni, a new AI model that it claims can “create anything from any input,” at its annual I/O developer conference on Tuesday. The company said the model is starting off with just video generation and editing capabilities. On its website, Google says to think of it like “Nano Banana — but for video,” referencing the company’s image model that came out last year. Gemini Omni Flash, the first model in the Omni family, can edit existing videos and generate new ones using plain-language prompts. It’s already available to try on the Gemini app, Google Flow AI studio, and YouTube Shorts. “With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini’s real-world knowledge. You can also easily edit your videos through conversation,” wrote Google DeepMind Chief Technology Officer Koray Kavukcuoglu in a blog post.

As with Nano Banana, users can make edits that build off each other through natural conversation. The model is designed to keep characters and environments consistent across edits and use its knowledge of the real world including history, biology, physics, and narrative logic to make clips that actually make sense.

Google’s Gemini Omni AI Model Promises to Create 'Anything' From Any Type of Input

Google’s Gemini Omni AI Model Promises to Create 'Anything' From Any Type of Input

Other newsrooms on this story

Related reading

Gemini Omni is a new family of AI models meant to ‘create anything’

Google unveils Gemini Omni 'any-to-any' AI model: what enterprises should know

Gemini 'Omni' Will Generate Media From Any Input, Starting With Video

Google Unveils Gemini Omni—A Next-Gen AI Video Builder That Can 'Simulate the…

Introducing Gemini Omni

Google's Nano Banana just got a powerful sibling called Gemini Omni Flash

Related reading

Gemini Omni is a new family of AI models meant to ‘create anything’

Google unveils Gemini Omni 'any-to-any' AI model: what enterprises should know

Gemini 'Omni' Will Generate Media From Any Input, Starting With Video

Google Unveils Gemini Omni—A Next-Gen AI Video Builder That Can 'Simulate the…

Introducing Gemini Omni

Google's Nano Banana just got a powerful sibling called Gemini Omni Flash

Other newsrooms on this story