Google Launches Gemini Omni: A Revolutionary Multimodal AI for Video Creation
Gemini Omni: How To Turn Image, Text, Video And Audio Into Single Output

Image: Ndtv
At Google I/O 2026, Google introduced Gemini Omni, a multimodal AI model that allows users to create and edit videos using text, images, audio, and video prompts. The model's standout feature is its ability to edit videos through conversational commands, making video creation more intuitive and accessible.
- 01Gemini Omni can generate videos from various inputs including photos, drawings, and voice references.
- 02The first version, Gemini Omni Flash, is being rolled out through the Gemini app, Google Flow, and YouTube Shorts.
- 03Users can edit videos by simply describing changes in plain language, enhancing user interaction.
- 04Every AI-generated video includes an invisible SynthID watermark for verification and transparency.
- 05Currently, only voice references are supported for audio inputs, with plans for future enhancements.
Advertisement
In-Article Ad
At the Google I/O 2026 event, Google unveiled Gemini Omni, a groundbreaking multimodal AI model that merges text, images, audio, and video into cohesive video outputs. This innovative tool allows users to upload various media types, including photos, drawings, and existing videos, and combine them into a single video. A notable feature of Gemini Omni is its conversational editing capability, enabling users to describe desired changes in simple language, such as 'turn a mirror into liquid' or 'change the background to a futuristic city.' This user-friendly approach aims to make video creation more accessible to everyone. The first iteration, named Gemini Omni Flash, is currently being rolled out through platforms like the Gemini app, Google Flow, and YouTube Shorts. Additionally, every video produced by Gemini Omni is embedded with an invisible SynthID watermark for verification purposes. While the initial version supports only voice references for audio input, Google has plans to expand audio capabilities in future updates, further enhancing the platform's versatility.
Advertisement
In-Article Ad
Advertisement
In-Article Ad
Reader Poll
How do you feel about AI tools for video creation?
Connecting to poll...
More about Google

Google presenta su nuevo chat de IA para crear documentos y más en segundos
La Republica • May 22, 2026

Trump Postpones Executive Order on AI Amid Competitiveness Concerns
Upi • May 22, 2026

Google Unveils $15 Billion Investment in Missouri Data Center Expansion with Energy Commitments
Power Magazine • May 21, 2026
Read the original article
Visit the source for the complete story.



