This is a submission for the Google I/O 2026 Challenge: Explore Google I/O 2026
At Google I/O 2026, Google announced Gemini Omni: a unified multimodal model that generates ~10-second video clips with synchronized audio from text, image, and audio inputs.
Every tech company has a video generation model now. That's not the story.
The story is conversational editing — and if you actually sit with what it means, it changes how you think about the entire creative workflow.
What Conversational Editing Is











