Google just announced Gemini Omni, a new AI model that it claims can “create anything from any input,” at its annual I/O developer conference on Tuesday. The company said the model is starting off with just video generation and editing capabilities. On its website, Google says to think of it like “Nano Banana — but for video,” referencing the company’s image model that came out last year. Gemini Omni Flash, the first model in the Omni family, can edit existing videos and generate new ones using plain-language prompts. It’s already available to try on the Gemini app, Google Flow AI studio, and YouTube Shorts. “With Omni, you can combine images, audio, video and text as input and generate high-quality videos grounded in Gemini’s real-world knowledge. You can also easily edit your videos through conversation,” wrote Google DeepMind Chief Technology Officer Koray Kavukcuoglu in a blog post.

As with Nano Banana, users can make edits that build off each other through natural conversation. The model is designed to keep characters and environments consistent across edits and use its knowledge of the real world including history, biology, physics, and narrative logic to make clips that actually make sense.