Will video generation work in the Gemini mobile app?

Yes. In Seedance, you can generate and manage videos on mobile as well. Use the same create flow and model selection from your phone browser.

Who can access video generation with Veo models?

Access depends on plan tier. Veo 3.1 Fast maps to Pro-level access and Veo 3.1 Pro is targeted for higher-tier rollout as it becomes available.

How do you approach safety for AI video generation?

We apply policy controls, moderation checks, and visible AI-generation labeling workflows. Final outputs still depend on user prompts and should be used responsibly.

Veo 3.1Google DeepMind

Break the silence with Veo 3.1

Upload multiple reference images to guide characters, objects, and style. Generate vertical outputs for mobile-first storytelling.

Max clip duration

Reference images

48kHz

Audio sample rate

24fps

Frame rate

Try Veo See Pricing

720p & 1080p16:9 · 9:16SynthID watermarked$0.30/s via Seedance

Reference images

Character and style direction

Use multi-image references to direct scene consistency across characters, props, and tone.

Native Audio Generation

Video that sounds as real as it looks

Veo 3.1 does not bolt audio on after generation. Dialogue, music, ambient sound effects, and environmental layers are all synthesised in a single unified pass — timed at the model level with 10ms audio-video synchronisation precision. The result is sound that feels physically present in the scene, not layered on top of it.

48kHz stereoAAC 192kbps10ms AV syncDialogue generation

Dialogue & Voice

Realistic speech with proper lip sync and intonation.

Music & Score

Generative background music matched to scene mood.

Ambient SFX

Environmental soundscapes layered with precise timing.

10ms Sync

Industry-leading audio-to-video synchronisation precision.

Audio timeline — Scene 01

48kHz · Stereo · AAC 192kbps

LIVE

Generation Modes

Three ways to create

Choose the creation path that fits your workflow — start from pure text, anchor with a reference image, or interpolate between two keyframes.

Generate from a text prompt

Describe your scene in natural language. Veo 3.1 interprets camera direction, motion style, character behaviour, and atmosphere from a single prompt — producing a complete 8-second clip with native audio in one pass.

No image required8s outputNative audio

Try this mode

Reference Images

Up to 3 reference images

Lock character identity, visual style, and environment simultaneously. The more references you provide, the tighter the consistency across every frame.

Ref 1

Video output

Single reference

Lock a character or object. The model anchors identity and appearance across the full clip.

Works with photographs, illustrations, and renders
Recognised across all 8 seconds of output
Compatible with all generation modes

Veo 3.1 speaks for itself.

Prompt-driven examples inspired by the same storytelling style shown on the Gemini Veo experience.

Example Prompt

Moonlit forest follow shot

A follow shot of a wise old owl above moonlit clouds, circling a forest clearing, then diving beside a nervous badger on a quiet path. Include layered ambient audio.

Result: coherent camera motion, atmospheric woodland tone, and dialogue-friendly scene continuity.

Example Prompt

Opera cat with full orchestra

A cat singing opera with full orchestra, expressive close-up performance, dramatic lighting, and playful emotional beats.

Result: stylized, meme-ready musical motion with character focus and synchronized energy.

Example Prompt

Elderly sailor spaghetti sequence

Eye-level medium shot of an elderly sailor by the pier, lifting spaghetti from a white ceramic plate in warm natural light with subtle ocean ambience.

Result: cinematic realism, consistent framing, and strong continuity across multi-step action.

Dream it. Describe it. Done.

For Exploring

Play with styles, animate characters, and combine ideas you normally could not test quickly.

Model Comparison

Learn more about our Veo Models

Common specs

Duration8s per clip

Chain to60s+

FPS24

Resolutions720p · 1080p

Ratios16:9 · 9:16

Gen time150–180s

WatermarkSynthID

Veo 3.1 Fast

Google AI Pro

Create videos with sound using a model optimized for speed while keeping high visual quality.

Create 8-second videos
High quality optimized for speed
Native audio generation
Turn a photo into a video

Use Veo 3.1 Fast in Create

SynthID watermarking — AI generation transparency

Every clip produced by Veo 3.1 carries an invisible SynthID watermark embedded by Google DeepMind. The watermark is imperceptible to viewers but machine-readable, enabling provenance verification at any point in the content lifecycle.

Google AI Safety

Frequently asked questions

Veo 3.1 Pro — Available now

Start generating with Google's most advanced video model

Native audio. Up to 3 reference images. First + last frame mode. 1080p output. Every generation ready to publish.

Try Veo See Pricing

Video that sounds as real as it looks

48kHz stereoAAC 192kbps10ms AV syncDialogue generation