How to Use Gemini Omni in 2026: A Step-by-Step Guide for the Gemini App and Google Flow
A practical 2026 walkthrough for using Gemini Omni — including how to enable the model, prompt for video, layer references, edit in chat, and export with the SynthID watermark.
What you need before you start
Gemini Omni Flash is live as of Google I/O 2026 (May 20, 2026). Before you generate your first video, make sure you have:
- A Google Account in good standing, age 18 or older (Omni Flash has age gating).
- One of the following subscriptions if you want to use the Gemini app or Google Flow:
- Google AI Plus
- Google AI Pro
- Google AI Ultra
- If you only have a free Google account, you can still use Omni Flash inside the YouTube Shorts app or the YouTube Create app, which now expose Omni for free.
A modern browser (Chrome, Edge, Safari, Firefox) and a stable connection are enough — nothing else needs installing. Google Flow runs in the browser, and the Gemini app is available on web, Android and iOS.
Step 1 · Open the right surface
Where you open Omni decides the workflow. The three production paths:
- Gemini app (gemini.google.com or mobile app). Best for fast, conversational generation and quick edits — ask in chat, get a clip back, iterate by chat.
- Google Flow (labs.google/flow). Best for filmmakers, agencies and anyone storyboarding multiple clips. Flow exposes the model with explicit credit costs, model picker and scene-level controls.
- YouTube Shorts / YouTube Create App. Best for free, casual social creation. Omni shows up as a generation option inside the creator surface.
Pick one. The exact same model — Gemini Omni Flash — powers all three; only the UI differs.
Step 2 · Confirm Omni is the active model
Inside the Gemini app, look in the prompt-bar settings: a model picker should let you choose Gemini Omni (currently equivalent to Omni Flash). If you don’t see it, your account region or subscription tier may not yet have it enabled — try logging out and back in, or open Google Flow where the model picker is more prominent.
Inside Google Flow, open the prompt panel and confirm:
- Active model: Gemini Omni Flash
- Aspect ratio: 16:9, 9:16 or 1:1
- Duration: typically 5 / 8 / 10 seconds per generation
- Credit cost shown: always live, and may change between rollouts
Google’s own Flow Help docs explicitly recommend checking the active model and credit cost in settings before each generation — limits and pricing can move.
Step 3 · Write a brief, not a sentence
Omni is multimodal in both input and output, so the strongest prompts read like a one-paragraph creative brief, not a tagline. A reliable template:
You are [PERSONA]. Generate a [DURATION] [ASPECT-RATIO] clip of [SUBJECT] [ACTION] in [SETTING]. Camera: [lens / movement / framing]. Lighting: [direction / colour temperature / contrast]. Audio: [ambient bed], [music style], [key sound cue at timecode], dialogue: [short line]. Reference: [attach image / video / audio].
Concrete example:
You are a luxury cinematographer in the vein of Wong Kar-wai. Generate a 10-second 16:9 clip of a matte-black wireless headphone resting on a textured concrete plinth. Camera: slow 35mm tracking shot camera-left to camera-right, shallow depth of field. Lighting: soft golden-hour back-light, warm 3200K, low contrast. Audio: low atmospheric drone with a single soft bell strike at 0:07. No dialogue. Reference image: attached product photo for exact colour, stitching and brand mark.
The longer prompt is not extra work — it actively reduces the number of regenerations you need.
Step 4 · Layer your references aggressively
The single biggest 2026 unlock is that Omni accepts images, video clips and audio tracks in the same instruction. Use that:
- Character lock: drop a reference photo so the protagonist stays consistent across clips.
- Style lock: drop a frame from an existing piece you want to imitate.
- Motion lock: drop a short reference video to mimic a camera move.
- Beat lock: drop a music track and ask Omni to cut visuals to the beat.
A 30-word prompt with three solid references almost always outperforms a 300-word prompt with no references.
Step 5 · Generate, then edit in chat (don’t re-roll)
Once your first clip comes back, resist the urge to immediately re-run with tweaked text. Omni’s headline workflow is conversational editing. Examples that work:
“Swap the watch on the model’s wrist for a brushed-silver chronograph. Keep all other framing, lighting and audio exactly the same.”
“Slow the camera move by 30% and warm the colour temperature by 200 K.”
“Remove the bell strike at 0:07 and add a soft ambient swell from 0:08 to 0:10 instead.”
“Change the background to a futuristic cityscape but keep the subject identical.”
Conversational edits are cheaper than fresh generations, they protect the audio bed you already love, and they preserve character identity. Treat your first generation as a base; iterate from there.
Step 6 · Chain clips for longer sequences
Omni Flash caps individual clips at 10 seconds. To produce a longer piece, generate multiple omni-clips, attach the previous clip as reference and instruct Omni to maintain continuity:
“Generate a 10-second clip that continues from the attached clip. Keep the subject, wardrobe and lighting identical. Hold the audio bed. Begin the camera move where the previous clip ended.”
Google Flow makes this easier by exposing scene-level chaining; in the Gemini app you can do the same manually using the attached-reference pattern.
Step 7 · Export, watermark and provenance
Every Omni export carries:
- SynthID: an imperceptible Google watermark embedded in pixels and audio.
- C2PA Content Credentials: open-standard metadata identifying the clip as Gemini-generated.
You can verify any Omni output via the Gemini app, with verification rolling out to Chrome and Google Search next. From a creator’s point of view there’s nothing extra to do — the marks are added automatically — but it’s worth knowing they’re there, especially if you license clips to clients who care about AI-content disclosure.
Step 8 · Set up your AI Avatar (optional)
If you want Gemini Omni to consistently render you across multiple videos, set up an AI Avatar once:
- Open the Avatar section in the Gemini app or Google Flow.
- Follow the on-screen prompts for facial and voice capture.
- Confirm the consent flow — only you can use your avatar.
- From then on, prompts can reference “me” or “the avatar” without re-uploading photos.
Google has explicitly said it’s still tightening the avatar safeguards, so expect a few country and feature restrictions during the rollout window.
Common mistakes to avoid
- Skipping references. Text-only prompts are the fastest way to waste credits.
- Re-rolling instead of editing. Each fresh generation costs more and discards a working audio bed.
- Ignoring duration. A 10-second clip is not always the right answer — a tight 5-second product hero is cheaper, faster and often stronger.
- Forgetting the active model. If you switch to Veo for one shot and back to Omni for the next, your style consistency will drift.
Bottom line
Using Gemini Omni well in 2026 is less about discovering hidden settings and more about treating the model like a director would treat a crew: write a clear brief, give it strong references, ask for incremental tweaks instead of full re-shoots, and chain clips to grow the runtime. Do those four things consistently and Omni Flash quickly becomes the fastest way to ship publishable video on the web.