Veo logo
Veo
  • Features
  • Pricing
Veo logo
Veo

A controllable multi-modal AI video platform

X (Twitter)X (Twitter)DiscordYouTubeYouTubeEmail
support@veopro.netOfficial X: @veonano
Product
  • Features
  • Pricing
  • FAQ
  • AI Tools
  • Answer Guides
Resources
  • Blog
  • Changelog
  • Roadmap
Company
  • About
  • Contact
  • Gallery
Legal
  • Cookie Policy
  • Privacy Policy
  • Terms of Service
© 2026 Veo All Rights Reserved.
Dang.ai
Veo 3.1Google DeepMind

Break the silence with Veo 3.1

Upload multiple reference images to guide characters, objects, and style. Generate vertical outputs for mobile-first storytelling.

8s

Max clip duration

3

Reference images

48kHz

Audio sample rate

24fps

Frame rate

Try VeoSee Pricing
720p & 1080p16:9 · 9:16SynthID watermarked$0.30/s via Seedance

Reference images

Character and style direction

Use multi-image references to direct scene consistency across characters, props, and tone.

Native Audio Generation

Video that sounds as real as it looks

Veo 3.1 does not bolt audio on after generation. Dialogue, music, ambient sound effects, and environmental layers are all synthesised in a single unified pass — timed at the model level with 10ms audio-video synchronisation precision. The result is sound that feels physically present in the scene, not layered on top of it.

48kHz stereoAAC 192kbps10ms AV syncDialogue generation

Dialogue & Voice

Realistic speech with proper lip sync and intonation.

Music & Score

Generative background music matched to scene mood.

Ambient SFX

Environmental soundscapes layered with precise timing.

10ms Sync

Industry-leading audio-to-video synchronisation precision.

Audio timeline — Scene 01

48kHz · Stereo · AAC 192kbps

LIVE
Generation Modes

Three ways to create

Choose the creation path that fits your workflow — start from pure text, anchor with a reference image, or interpolate between two keyframes.

Generate from a text prompt

Describe your scene in natural language. Veo 3.1 interprets camera direction, motion style, character behaviour, and atmosphere from a single prompt — producing a complete 8-second clip with native audio in one pass.

No image required8s outputNative audio
Try this mode
Reference Images

Up to 3 reference images

Lock character identity, visual style, and environment simultaneously. The more references you provide, the tighter the consistency across every frame.

Ref 1
Video output

Single reference

Lock a character or object. The model anchors identity and appearance across the full clip.

  • Works with photographs, illustrations, and renders
  • Recognised across all 8 seconds of output
  • Compatible with all generation modes

Veo 3.1 speaks for itself.

Prompt-driven examples inspired by the same storytelling style shown on the Gemini Veo experience.

Example Prompt

Moonlit forest follow shot

A follow shot of a wise old owl above moonlit clouds, circling a forest clearing, then diving beside a nervous badger on a quiet path. Include layered ambient audio.

Result: coherent camera motion, atmospheric woodland tone, and dialogue-friendly scene continuity.

Example Prompt

Opera cat with full orchestra

A cat singing opera with full orchestra, expressive close-up performance, dramatic lighting, and playful emotional beats.

Result: stylized, meme-ready musical motion with character focus and synchronized energy.

Example Prompt

Elderly sailor spaghetti sequence

Eye-level medium shot of an elderly sailor by the pier, lifting spaghetti from a white ceramic plate in warm natural light with subtle ocean ambience.

Result: cinematic realism, consistent framing, and strong continuity across multi-step action.

Dream it. Describe it. Done.

For Exploring

Play with styles, animate characters, and combine ideas you normally could not test quickly.

Model Comparison

Learn more about our Veo Models

Common specs

Duration8s per clip
Chain to60s+
FPS24
Resolutions720p · 1080p
Ratios16:9 · 9:16
Gen time150–180s
WatermarkSynthID

Veo 3.1 Fast

Google AI Pro

Create videos with sound using a model optimized for speed while keeping high visual quality.

  • Create 8-second videos
  • High quality optimized for speed
  • Native audio generation
  • Turn a photo into a video
Use Veo 3.1 Fast in Create

SynthID watermarking — AI generation transparency

Every clip produced by Veo 3.1 carries an invisible SynthID watermark embedded by Google DeepMind. The watermark is imperceptible to viewers but machine-readable, enabling provenance verification at any point in the content lifecycle.

Google AI Safety

Frequently asked questions

Veo 3.1 Pro — Available now

Start generating with Google's most advanced video model

Native audio. Up to 3 reference images. First + last frame mode. 1080p output. Every generation ready to publish.

Try VeoSee Pricing