Technology / Mon, 25 May 2026 Adgully.com

Google unveils Gemini Omni, a multimodal AI model for advanced video creation and editing

Adgully Bureau | 11 hours ago Views: 241Google has officially launched Gemini Omni, a next-generation multimodal artificial intelligence model engineered for advanced video creation and editing. The model allows users to manipulate and generate video content by seamlessly combining inputs across text, images, video clips, and audio. The debut model of the new lineup, Gemini Omni Flash, is currently rolling out across the Gemini app, Google Flow, and YouTube Shorts. To address safety and authenticity concerns regarding AI-generated media, Google confirmed that all video content produced via Gemini Omni will automatically embed SynthID digital watermarks. Audiences and platforms can check the provenance of Omni-generated videos directly through the Gemini app, Gemini in Chrome, and Google Search.

Adgully Bureau | 11 hours ago Views: 241

Google has officially launched Gemini Omni, a next-generation multimodal artificial intelligence model engineered for advanced video creation and editing. The model allows users to manipulate and generate video content by seamlessly combining inputs across text, images, video clips, and audio.

The debut model of the new lineup, Gemini Omni Flash, is currently rolling out across the Gemini app, Google Flow, and YouTube Shorts.

A core capability of the Gemini Omni architecture is its conversational editing engine. According to Google, the model allows creators to modify videos using sequential, natural language prompts. The system retains context from previous instructions, ensuring structural continuity across scenes, recurring characters, and complex visual elements throughout the editing timeline.

The model also supports highly flexible generative inputs, allowing users to direct video production using:

Text prompts and structural scripts

Images, hand-drawn sketches, and illustrations

Existing video clips as style or structural references

Direct voice inputs (with compatibility for additional audio input formats scheduled for a later release)

By merging Gemini’s advanced reasoning capabilities with dedicated video generation functions, the model produces scenes informed by real-world concepts, including physics, historical accuracy, and persistent visual consistency.

Alongside the core model, Google introduced a personalized avatar feature. This tool enables creators to generate synthetic video content utilizing a highly precise digital version of themselves, synthesized alongside a clone of their own voice.

To address safety and authenticity concerns regarding AI-generated media, Google confirmed that all video content produced via Gemini Omni will automatically embed SynthID digital watermarks. These imperceptible metadata markers ensure that generated content remains verifiable. Audiences and platforms can check the provenance of Omni-generated videos directly through the Gemini app, Gemini in Chrome, and Google Search.

The launch represents a significant expansion of Google’s creative AI ecosystem, which previously focused heavily on specialized image-generation and static editing tools like Nano Banana.