This is a submission for the Google I/O 2026 Challenge: Explore Google I/O 2026

At Google I/O 2026, Google announced Gemini Omni: a unified multimodal model that generates ~10-second video clips with synchronized audio from text, image, and audio inputs.

Every tech company has a video generation model now. That's not the story.

The story is conversational editing — and if you actually sit with what it means, it changes how you think about the entire creative workflow.

What Conversational Editing Is