HR Limon
Background

The Ultimate VEO 3 Prompt Playbook: High-Impact Prompt Formulas for Viral Videos!

ByHabibur RahmanHabibur Rahman

A passionate SEO professional with 8+ years of experience helping businesses grow their online presence.

October 15, 2025

In today’s fast-paced digital world, content creation is evolving at lightning speed. AI-driven video tools are opening up entirely new possibilities for storytelling, empowering creators across the globe to craft high-quality videos without the need for expensive cameras or production equipment. Whether you're a filmmaker, digital marketer, YouTuber, or just starting out, Google VEO 3 is a game-changer for creating cinematic videos using nothing but text prompts.

VEO 3 isn’t just another text-to-video tool. It’s Google’s most advanced generative video model to date, capable of producing hyper-realistic movement, dynamic camera angles, immersive environments, and even synced voice audio — all from a simple description. Gone are the days when you needed a crew and a studio to produce professional-looking content.

However, as with any powerful tool, the quality of the output depends on how well you use it. That’s why I’ve created this comprehensive guide to help you master Google VEO 3 — even if you’ve never used an AI video tool before. Whether you're looking to create:

  • Cinematic short films
  • Vlog-style talking characters
  • Engaging social media content
  • Stunning ad creatives
  • Fantasy-inspired scenes
  • Or even animated loops for music videos or ASMR...

This guide will walk you through every step of the process — from setting up your account to writing advanced prompts that bring your creative vision to life.

What You’ll Learn in This Guide:

Google VEO 3 is a powerful tool that opens up endless possibilities for creators, and this guide will walk you through everything you need to know to unlock its full potential. Here’s what you can expect to learn:

  • How to Access VEO 3 Through Google Flow (And Get Free Credits)
  • The Key Differences Between VEO 2 and VEO 3
  • What Makes Flow the Better Option for Serious Creators
  • How to Write Structured Prompts That Get Great Results
  • How to Generate Realistic Dialogue, Sound, Music, and Motion
  • How to Keep Characters and Styles Consistent Across Multiple Scenes
  • How to Build Content for TikTok, YouTube Shorts, or Full-Length Films
  • Prompt Formulas, Cheatsheets, and Real Examples Included

Whether you’re experimenting with AI video for the first time, building content for clients, or launching a full YouTube channel, this guide will be your shortcut to success with Google VEO 3.

1. What is Google VEO 3?

Imagine being able to bring your wildest ideas to life in a cinematic short film just by describing them. With Google VEO 3, that’s no longer a dream — it’s a reality.

VEO 3 is Google's most powerful text-to-video AI model, designed to turn your written prompts into hyper-realistic, cinematic-quality videos. Whether you're envisioning a knight battling a snowstorm, a cat riding a skateboard, or a robot walking through a futuristic cyberpunk city, VEO 3 can transform your ideas into video — complete with dynamic lighting, camera movement, detailed environments, and even synchronized voice dialogue.

What Makes VEO 3 So Powerful?

Here are some of the standout features that set VEO 3 apart from previous video models and its competitors:

  • Realistic Motion & Cinematic: Camera Work VEO 3 supports dynamic camera angles, such as panning, zooming, and bird’s-eye views. The movement feels incredibly natural — like something straight out of a real movie. Whether it’s a close-up shot or a sweeping aerial view, the camera transitions smoothly to make your scenes feel lifelike.
  • Built-in Audio Generation: Unlike older models, VEO 3 doesn’t just generate visuals. It also creates realistic character dialogue, ambient sounds, and even background music, all synced to the action. Say goodbye to the need for separate audio tools (unless you’re using reference images, which we’ll get into later).
  • Photorealistic Visuals: The quality of visuals in VEO 3 is unparalleled. Characters, objects, and environments are rendered with stunning photorealism. Whether you're creating historical scenes or natural landscapes, VEO 3 beautifully handles lighting, weather effects, and intricate texture details, making every frame come alive.
  • Genre & Style Awareness: Whether you're aiming for a Pixar-style animation, a gritty war film, or a fantasy short, VEO 3 can adapt to the style and genre you envision. Just describe the aesthetic you’re after, and VEO 3 will deliver, ensuring that your video matches the tone and atmosphere you want.

VEO 3 vs VEO 2 — What’s the Difference?

When comparing VEO 2 to the current model VEO 3, it’s clear that VEO 3 offers several significant upgrades that make it the go-to choice for serious creators. Here’s a quick look at how the two versions stack up:

Feature VEO 2 VEO 3 (Current Model)
Video Resolution 720p (only) 720p + Upscale to 1080p
Audio Support ❌ None ✅ Dialogue + Ambient Sound
Character Dialogue
Prompt Accuracy Basic Advanced
Credit Cost (per gen) ~10 credits (Fast) ~100 credits (High Quality) or 20 credits VEO 3 Fast Gen
Use Case Suitability Demos, tests Social content, filmmaking

Key Differences Explained:

  • Video Resolution: While VEO 2 is limited to 720p resolution, VEO 3 allows you to upscale to 1080p, delivering crisper, high-definition visuals perfect for social media or filmmaking.
  • Audio Support: VEO 2 doesn’t include audio generation, but VEO 3 introduces built-in audio capabilities, including character dialogue and ambient sound, adding an extra layer of realism to your videos.
  • Character Dialogue: VEO 2 lacks character dialogue features, whereas VEO 3 generates realistic character dialogue, making it ideal for storytelling and scene-based content.
  • Prompt Accuracy: VEO 3 offers advanced prompt accuracy, ensuring that your videos match your vision more closely compared to VEO 2’s basic prompt system.
  • Credit Cost: VEO 3 operates on a tiered credit system. Fast generation costs 20 credits, while high-quality videos require around 100 credits, giving you more control over the quality and speed of your video generation.
  • Use Case Suitability: VEO 2 is mostly suited for basic demos and tests, while VEO 3 is geared towards social content, product mockups, short films, and professional-quality filmmaking.

Limitations of VEO 3:

While VEO 3 is a remarkable tool, it’s still evolving. Some limitations include:

  • Occasional Subtitle Overlays: Generated videos may sometimes include subtitles, which can be fixed with simple editing.
  • Character Consistency: Ensuring consistency in character appearance can be tricky, but with the right techniques, this can be managed.
  • Video Game-Like Aesthetics: Certain fantasy or high-energy scenes might have a video game-like aesthetic that might need additional refinement.
  • Short Video Duration: Videos typically last under 10 seconds unless extended using editing tricks.

What Can You Use VEO 3 For?

With its powerful features, VEO 3 is suitable for creating a wide variety of content, including:

  • AI Short Films
  • Character Vlogs (Bigfoot, Stormtrooper, Anime characters)
  • TikTok/YouTube Shorts
  • Product Ad Mockups
  • Fantasy World-Building
  • POV Narratives
  • ASMR Loops and Storytelling Scenes

2. How to Access VEO 3?

Google VEO 3, one of the most powerful AI video tools, isn’t available on a standalone platform. Instead, you access it through Google’s AI ecosystem, specifically via Gemini or Google Flow. While both have their benefits, Google Flow is the recommended option if you want maximum control, creative freedom, and the ability to build scenes more effectively.

Here’s how you can get access to Google VEO 3:

Option 1: Access via Gemini (Quick & Simple)

If you’re already using Gemini Advanced, you might have access to VEO 3 through Gemini Labs.

Steps to Access VEO 3 via Gemini:

  1. Visit Google’s VEO 3 page
  2. Click on "Try in Gemini".

Limitations:

  • Gemini offers limited functionality with preset demos and simplified prompt formats.
  • There’s no timeline control or project management options.
  • You can’t use camera or model adjustments.
  • Best for quick tests rather than full creative projects.

Free Daily Generations in Gemini:

  • You can generate up to 3 videos per day at no extra cost.
  • Great for testing ideas and generating quick visuals.

For more serious creators, Google Flow is where the real magic happens. It offers a visual editor and creative suite, allowing you to have complete control over your project.

With Flow, you can:

  • Write advanced prompts.
  • Choose VEO 3 or earlier models.
  • Upload reference images and assets.
  • Organize your projects efficiently.
  • Access Flow TV, which is a hub for inspiration and prompts.

How to Get Started with Google Flow:

  1. Visit Flow: Go to Google Flow's VEO page and click “Try with Flow” to launch the interface.
  2. Create or Sign in with a Google Account: You’ll need a Gmail or Workspace account to sign up.
  3. Start Your Free Trial: Click on “Get Google AI Pro”.
    1. Plan: $19.99/month
    2. Includes 1,000 credits.
    3. First 30 days are free.
  4. Open Your Dashboard: Once logged in, click “Create with Flow” → “New Project” to begin your first video project.

Gemini vs. Flow — What’s the Difference?

Here’s a breakdown of the differences between Gemini and Flow:

Feature Gemini Google Flow (Recommended)
Prompt Control Basic Full scene control
Use of VEO 3 Limited access Full access
Reference Images ❌ Not supported ✅ Upload + animate
Scene Builder ❌ Not available ✅ Interactive workspace
Dialogue Generation Basic ✅ Detailed character dialogue
Access to Prompt ❌ No access ✅ Built-in via Flow TV

3. Why Use Google Flow?

Google Flow is your ultimate creative control center for professional-level VEO 3 video creation. While Gemini is great for quick demos and beginner experimentation, Google Flow is where the magic happens for serious creators looking to make high-quality, cinematic videos.

Think of Flow as your personal AI production studio — a visual workspace where you can organize ideas, generate videos with precision, and control every aspect of the creative process, from camera motion to dialogue to sound. If you're planning to create:

  • Multi-scene video stories
  • Consistent characters
  • Product ads or vertical reels
  • YouTube videos or TikToks
  • High-quality cinematic content

Then Google Flow is the platform you need.

What Makes Flow So Powerful?

Google Flow offers several features that empower creators to produce professional-grade content with full creative freedom:

  • Full Prompt Control Unlike GeminiFlow allows you to write complex prompts that include detailed instructions for lightingenvironmentcharacter motiongenrecamera angles, and more. You have the flexibility to be as specific as you want, ensuring your videos match your exact vision.
  • Scene-by-Scene Organization In Flow, you can organize your video into multiple scenes, making it easy to build stories frame by frame. You also have room to test different variations of each scene, giving you the flexibility to tweak and refine until it’s perfect.
  • Upload Reference Images Use the “Frames to Video” feature to animate your own images, control aspect ratios, and preserve character consistency across scenes. You can even use a green screen for smoother transitions and better continuity in your videos.
  • Model Selection Choose between different models depending on your needs:
    • VEO 3 for high-quality, cinematic results with full audio capabilities.
    • Fast mode (VEO 2) for quicker, more credit-efficient tests.
    • Ingredients to Video for multi-element animation (though with limited support).
  • Flow TV – Your Built-In Prompt Playground Flow TV is a prompt inspiration hub where you can browse videos made by other creators. You can even click on “Show Prompt” to see how they generated their content. Categories available in Flow TV include:
    • Short Films
    • Fantasy Scenes
    • Sci-Fi Environments
    • Vlogs & POVs
  • You can also view videos in Grid View for fast browsing and inspiration.
  • Audio Capabilities Flow supports generating:
    • Spoken dialogue (text-to-video).
    • Ambient sounds like wind, footsteps, or nature sounds.
    • Background music which you can customize in your prompt.

A Quick Tour of the Flow Interface

Here’s a breakdown of the main features of the Flow interface and their functions:

Feature Purpose
New Project Create, label, and organize your video scenes.
Prompt Box Write detailed text prompts with full customization.
Settings Panel Choose your model (VEO 2/3), resolution, and output options.
Image Upload Add reference images to use with the Frames-to-Video feature.
Flow TV Browse example videos and view their prompts for inspiration.
Credits Display Shows remaining credits and the cost per video generation.

Why Google Flow is Perfect for Serious Creators?

If you’re looking to have full control over your video projects, manage multi-scene creations, and fine-tune every aspect of your videos, Google Flow is the tool you need. Whether you're working on cinematic short films, consistent character vlogs, or social media content like TikToks or YouTube Shorts, Flow allows you to craft high-quality videos that stand out.

4. Understanding VEO 3 Capabilities

Before diving into the world of Google VEO 3, it’s important to understand its capabilities and limitations. Knowing what VEO 3 can—and can’t—do will help you craft the best prompts from day one.

1. Resolution & Upscaling

  • VEO 3 generates native videos at 720p, but the good news is you can upscale to 1080p using Flow's download options.
  • While the video quality is quite good, be aware that slight graininess can occur in fine details, especially in fast-moving scenes.

2. Video Duration

  • Currently, the maximum length per clip is 8 seconds.
  • For longer videos, you can use features like “Add to Scene” in Flow or stitch multiple clips together externally.
  • This short format is perfect for TikTok-style content, intros, loops, or quick scene snippets.

3. Native Audio Generation

One of VEO 3’s standout features is true audio integration. It can generate:

  • Spoken dialogue (with lip-syncing)
  • Ambient sounds (like wind, traffic, or nature sounds)
  • Music or sound effects

You can also customize the mood-specific audio, such as “soft piano music” or “wind rustling through leaves,” to enhance the emotional impact of your scenes.

4. Cinematic Camera & Motion

VEO 3 supports advanced camera directions, including:

  • Pantiltzoombird’s-eye, and POV (point-of-view) shots.
  • The movement of characters and objects feels incredibly natural due to physics-aware animation, almost like watching real footage.
  • However, the prompt structure is crucial—without specifying how the camera or subjects should move, you may end up with static shots instead of dynamic motion.

5. Genres, Styles & Visual Control

VEO 3 gives you control over the style of your video, allowing you to request:

  • Cinematic film
  • Anime
  • Pixar-3D animation
  • Sci-fi aesthetics, and more!

You can also adjust:

  • Color grading (muted, pastel, gritty)
  • Lighting (harsh moonlight, soft sunlight)
  • Genre conventions (horror, comedy, epic)

To get the best results, ensure you place style-related prompts early in your description for better weighting.

6. Credits & Cost

  • VEO 3 high-quality video generation uses around 100 credits per generation.
  • Flow also offers Fast Mode (VEO 2) for quicker, more credit-efficient tests at approximately 10 credits per generation.
  • Tip: Start with Fast Mode for rough drafts before investing in high-quality VEO 3 outputs.

7. Character Consistency & Limitations

  • While VEO 3 can produce relatively consistent faces and characters, it’s not perfect.
  • Visual consistency improves with techniques like:
    • Detailed character descriptions
    • Reference image uploads (green-screen or Frame-to-Video)
    • Feeding in consistent image assets via Flow
  • Expect some variation—you’ll learn how to manage this in later sections of the guide.

8. Known Limitations

  • Audio sync can sometimes be imperfect.
  • Some fantasy scenes may appear with a video-game-like aesthetic, even with cinematic prompts.
  • Every frame of the video includes an invisible SynthID watermark, which is useful for ethical purposes but may affect commercial use.
  • Non-U.S. users might experience delays in full feature access depending on regional rollouts.

Who Should Use VEO 3?

VEO 3 is ideal for creators who want:

  • Short, visually rich clips with dialogue and ambient sound
  • Cinematic styles with smooth camera movement
  • Quick ads, intros, product promos, or ASMR loops
  • First-person or POV storytelling

If you're creating longer narratives, multi-character scenes, or product demosVEO 3 + Flow will allow you to push your creative limits while staying within production constraints.

Quick Reference Table

Here’s a quick reference guide to help you understand VEO 3's features at a glance:

Feature Support in VEO 3
Resolution 720p native → 1080p upscaling
Max Duration 8 seconds per clip
Audio ✅ Dialogue, ambiance, music
Camera Control ✅ Pan, tilt, zoom, POV angles
Style Flexibility ✅ Cinema, anime, 3D, etc.
Cost (High Quality) ~100 credits per gen
Fast Mode Available ✅ VEO 2 for drafts (~10 credits)
Character Consistency 🟡 Possible with prompt/image tech
Watermarking ✅ SynthID digital watermark

5. Writing Prompts — The Basics

Mastering the Language That Controls the Visuals

In Google VEO 3, a prompt is more than just a description. It’s a creative blueprint — telling the AI what to show, how to shoot it, what mood to create, and even what sounds to include.

If you’ve ever used image generators, you might be familiar with prompts like:

"A dragon in the clouds, cinematic, dark tones."

But video prompts — especially in VEO 3 — require more clarity, more structure, and more direction. Why? Because now, you're not just creating a still image. You're asking the AI to animate, light, direct, and even perform the scene

A Prompt in VEO 3 Does All This

A well-crafted prompt in VEO 3 will:

  • Choose your subject and define their appearance.
  • Direct the camera (angle, motion, zoom).
  • Set the lighting and time of day.
  • Define the environment and mood.
  • Control movement and emotion.
  • Add voice, background sound, and music.

This is why writing a detailed prompt is crucial for generating high-quality videos that match your vision.

Why Weak Prompts Fail?

Let’s take a look at how a weak prompt compares to a strong one:

Basic Prompt:

"A warrior on a battlefield."

While this may generate a scene, the result will likely be stiff or awkward, and it might even look like a video game cutscene.

Here’s how that might look in the video:

Battlefield_Warrior_Video_Ready.mp4

As you can see, it’s animated and looks like a video game.

Better Prompt:

"A wounded warrior kneeling on a scorched battlefield. The camera slowly zooms in from a high angle. Smoke rises in the background. The atmosphere is tense, lit by fading orange twilight. The video has a gritty cinematic war style. The warrior breathes heavily and looks around in pain. The color palette is dark, with muted oranges and blackened grays."

This gives VEO 3 something to work with — characters, emotion, visuals, movement, tone, and environment.

Here’s how this would look in the video:

Video_Ready_Scorched_Battlefield.mp4 

Now, you can clearly see the difference. It’s more realistic and looks like a real movie scene.

Think of a Prompt As:

  • Your script: The foundation of your video.
  • Your director’s vision: What you want the audience to feel and experience.
  • Your cinematographer’s shot list: How each shot should be framed and what movement is needed.
  • Your composer’s mood board: The ambiance, tone, and emotion conveyed through sound.

6. Text to Video Prompt Formula Guide: Build Cinematic Prompts

To master AI filmmaking with Google VEO 3, you need to think like a director. Crafting the right prompts is essential to bringing your creative vision to life. This chapter will guide you through each part of the VEO 3 prompt formula, helping you build cinematic scenes that feel real and immersive.

Formula Reminder:

The basic formula for writing a successful VEO 3 prompt is: 

[CHARACTER] + [ACTION] + [CAMERA ANGLE] + [LIGHTING] + [GENRE/STYLE] + [MOTION] + [ENVIRONMENT] + [COLOR PALETTE]

SECTION 6.1: CHARACTER

What It Does: The character defines your subject. VEO 3 will shape the entire video around the character description you provide.

Prompt Tips:

  • Use distinct physical traits like hair, skin, height, and outfit.
  • Define the expression, vibe, and posture of your character.
  • Optionally, add cultural or historical context for more depth.

Examples:

  • A muscular Viking warrior with a braided beard and fur cloak.
  • A schoolgirl with short blue hair, wearing a Japanese uniform.
  • A robot with glowing blue eyes and a sleek chrome body.
  • An elderly man with sun-weathered skin in desert robes.

SECTION 6.2: ACTION

What It Does: 

The action specifies what the character is doing or how they are positioned in the scene.

Prompt Tips:

  • Think in verbs (e.g., walking, kneeling, holding, shouting).
  • Add emotional context or pacing to create more drama.

Examples:

  • Raises his axe and shouts toward the sky.
  • Drawing a sword while stepping forward.
  • Running across a rooftop under moonlight.
  • Floating weightlessly in space.

SECTION 6.3: CAMERA ANGLE

What It Does:

Camera angle controls where the viewer is watching from. It's what makes a scene cinematic instead of flat.

Common Camera Angles to Use:

Camera Angle Prompt Keyword Use For
Close-up close-up shot of Facial expressions, emotion
Wide shot wide-angle view Full character + setting
Low angle shot from below Power, intimidation
High angle shot from above Vulnerability, smallness
Over-the-shoulder OTS shot of Conversations, POV effect
Bird’s-eye view top-down shot Map-like or godlike views
First-person first-person POV Immersive, vlog-style clips
Dolly/Tracking camera follows from behind Movement-based sequences

Examples:

  • A low-angle shot from behind the Viking, emphasizing his strength.
  • A bird’s-eye view of the battlefield from above.
  • First-person POV running through a rainy alley.
  • Over-the-shoulder shot of a detective looking at a clue.

SECTION 6.4: LIGHTING

What It Does: 

Lighting sets the mood, time of day, and overall tone of your video.

Lighting Types:

  • Soft golden sunlight at dawn.
  • Cold blue moonlight.
  • Harsh industrial fluorescent lighting.
  • Warm candlelight from a fireplace.
  • Flickering neon reflections in puddles.

Examples:

  • Harsh golden sunlight spills through storm clouds.
  • Bright sunlight streams through broken windows.
  • A dimly lit tavern with a single lantern swaying.
  • Lightning flashes illuminate the mountains briefly.

SECTION 6.5: GENRE / STYLE

What It Does: 

Tells VEO 3 the visual tone and cinematic style to follow.

Common Styles:

  • Cinematic film style
  • Fantasy epic style
  • 2D anime with expressive features
  • Pixar-like 3D animation
  • Noir detective film
  • Gritty war drama
  • Retro sci-fi VHS look
  • Documentary realism

Examples:

  • Styled like an epic historical battle sequence.
  • Shot like a post-apocalyptic anime opening.
  • A dramatic war film with handheld-style camera movements.
  • Looks like a 1980s horror flick with grainy effects.

SECTION 6.6: MOTION

What It Does:

Motion adds life to your scene, making it feel more dynamic and real.

Examples of Motion:

  • His fur cloak flutters in the wind.
  • Hair moves slowly in underwater currents.
  • Rain pelts against the glass window.
  • Leaves swirl around their feet.
  • Dust rises as the knight walks forward.

Examples:

  • His fur cloak flutters in the wind.
  • Rain pours down and splashes on the stone floor.
  • Sparks fly as the machine powers up.
  • The wolf lowers its head and snarls.

SECTION 6.7: ENVIRONMENT

What It Does:

The environment builds the world around your character, making it essential for immersion.

Prompt Tips:

  • Describe geography, weather, and structures.
  • Use emotional adjectives (e.g., abandoned, vibrant, haunted).

Examples:

  • A crumbling stone fortress on a snowy cliff.
  • A narrow Tokyo alley glowing with neon signs.
  • A misty forest filled with glowing mushrooms.
  • A battlefield littered with broken armor.

SECTION 6.8: COLOR PALETTE

What It Does: The color palette influences the emotional tone through visuals.

Common Palettes:

  • Warm golden tones (hopeful, nostalgic)
  • Cool blues and grays (cold, futuristic)
  • Desaturated/monochrome (gritty, serious)
  • Pastel colors (dreamy, lighthearted)
  • High contrast neon (cyberpunk, energy)

Examples:

  • Dark earth tones and muted steel with crimson highlights.
  • Rich golden sunlight with red-brown earth tones.
  • Neon pink and purple dominate the cityscape.
  • Muted olive and gray tones give it a war documentary feel.

Final Prompt Example (All Elements Combined):

Now, let’s combine everything into a complete prompt:

“[A young woman in a flowing white dress] [stands barefoot at the edge of a cliff]. [A low-angle shot from behind her with the camera slowly pushing forward]. The golden sunrise casts soft light over the ocean waves below. The scene is styled like an indie drama film. Her dress billows gently in the wind. The cliffs are covered in tall grass and blooming wildflowers. The color palette includes warm yellows, faded greens, and soft peach tones.”

 
Now, as you can see this video looks like it was shot by a pro videographer. This is the power of writing good prompts.

VEO 3 Complete Prompt Formula [Audio & Dialogue Scenes]

Teach Your Characters to Speak and Your World to Sound Alive

With Google VEO 3, you can do more than just generate visuals; you can create fully immersive audio-visual experiences, complete with dialogueambient sounds, and music. However, to truly harness this capability, your prompts need to be structured to communicate exactly what VEO 3 should generate.

Why This Formula Is Different?

VEO 3 can generate dialogue, ambient sounds, and music — but these only work when you're using Text-to-Video (not Frames-to-Video). That means your prompts need to be written in a way that clearly tells VEO 3:

  • Who is speaking
  • What they’re saying
  • How they’re saying it
  • What background audio should exist

Just like directing an actor, you need to tell the AI both the lines and the delivery.

Prompt Formula for Dialogue Scenes

Here’s the basic formula for crafting a dialogue scene in VEO 3:

[CHARACTER] + [ACTION] + [CAMERA ANGLE] + [ENVIRONMENT/LIGHTING] + [DIALOGUE] + [TONE OF VOICE] + [AMBIENT SOUND or MUSIC (Optional)]

Or:

[CHARACTER] + [ACTION] + [CAMERA ANGLE] + [ENVIRONMENT/LIGHTING]. [Character] says in [Tone of Voice] + [Dialogue]

Example Breakdown:

[CHARACTER = Bigfoot, a towering, ancient-looking bipedal ape, with matted, deep forest-green fur and piercing amber eyes] + [Action = walking in a forest holding a selfie stick and talking to the camera] + [CAMERA ANGLE= selfie camera angle shot] + [ENVIRONMENT]/LIGHTING = a dense forest during a severe blizzard, with heavy snowfall reducing visibility to almost zero, wind whipping through the trees, and branches creaking under the weight of accumulating snow]. [Bigfoot] says in [Tone of Voice= A deep, gravely, and highly resonant Scottish male voice] + [Dialogue Guys, It's too damn cold today and my legs are freezing, honestly]

Bigfoot, a towering, ancient-looking bipedal ape, with matted, deep forest-green fur and piercing amber eyes walking in a forest holding a selfie stick and talking to the camera selfie camera angle shot a dense forest during a severe blizzard, with heavy snowfall reducing visibility to almost zero, wind whipping through the trees, and branches creaking under the weight of accumulating snow. Bigfoot says in a deep, gravely, and highly resonant Scottish male voice. Guys, It's too damn cold today and my legs are freezing, honestly.

Video: Bigfoot_a_towering_202506301544_otxha.mp4

Now, let’s change the character, voice type and the scene.

Bigfoot, towering, shaggy dark reddish-brown fur, primate-like face, intelligent dark eyes. running very fast through the forest while holding a selfie stick, looking panicked and slightly out of breath, selfie camera angle shot, shaky and fast-paced, in a dense temperate rainforest, with blurred trees rushing past, A hiker is chasing him with a Gun in his hand. Bigfoot says in a shakily, resonant, and slightly gruff British male voice: Guys, they found me and they are gonna kill me. Shitting in their tent was a bad idea.

Video = Bigfoot_towering_shaggy_202506301605_1yaox.mp4

A big fluffy bear with sunglasses and a GoPro harness bolts out of a tent, holding a half-eaten peanut butter jar and a selfie stick in one paw. Selfie camera angle from left side, as soon as it gets out, 2 hikers on the right side are seen tied with the trees. Bear says in an old southern american male voice, Whew! These generous humans let me sample their entire pantry. Hospitality ain’t dead, boys!’

Video = A_big_fluffy_202506301639_dq4vw.mp4

This formula adds two new elements:

  • [DIALOGUE] — The exact line(s) you want the character to say
  • [TONE OF VOICE] — Emotion, accent, or delivery style (e.g., nervous, whispering, angry)

SECTION 7.1: DIALOGUE

What It Does:

Instructs the AI to generate a speaking character with synced lip movement and matching expression.

Examples: 

  • “Today… we take back the throne!”
  • “I haven’t seen a sunrise like this in 30 years.”
  • “Welcome back to Yeti Adventures — today, we’re ice fishing!”
  • “Are the rumors about war in the Outer Rim true?”

Note: Avoid using quotation marks in the prompt (they sometimes trigger subtitles). Just include the sentence clearly in plain text.

SECTION 7.2: TONE OF VOICE

What It Does:

Sets the emotion, accent, or energy level of the spoken dialogue.

Prompt Tips:

  • Combine tone + character type
  • Add emotion to create nuance
  • Can affect performance more than words

Examples: 

  • Nervous young man stammering his words
  • Confident commander shouting with intensity
  • Elderly woman whispering in a tired voice
  • Child speaking with excitement and curiosity
  • Sarcastic tone with casual delivery 

SECTION 7.3: AMBIENT SOUND / BACKGROUND MUSIC

What It Does:

Makes the world feel immersive by layering background audio.

Sound Options to Prompt:

  • Ambient sounds: birds chirping, waves crashing, wind rustling, footsteps
  • Background music: soft piano, epic orchestral, lo-fi beats, eerie drones
  • Specific sound events: thunderclaps, echoing cave reverb, marketplace chatter 

Examples: 

  • The sound of ocean waves crashing below
  • Calm lo-fi piano playing in the background
  • Thunder rumbles in the distance
  • Street noise and occasional car horns


🧱 Full Prompt Example (With Audio & Dialogue):

Prompt: “A young man in a futuristic flight suit stands on a landing platform at sunset. The camera pushes in slowly from chest height. The scene is lit by the warm glow of a setting alien sun. The video has a sci-fi drama tone. He looks out at the sky and says, ‘I don’t know what’s out there… but I’m ready to find out.’ His tone is quiet, but determined. Wind howls in the background and low ambient synth music plays.”

BONUS TIPS:

  • Place dialogue lines near the middle of your prompt
  • Don’t overload your scene with too many background sounds — choose 1–2 max
  • If the speech sounds off, try rephrasing the line or changing the tone of voice
  • Dialogue only works in text-to-video, not when using reference images 

Prompt: A teenage boy in a gray hoodie falls while skating through an empty underground parking garage. The camera follows in a fast dolly shot from behind, then cuts to a low-angle close-up as he stands up, brushing dust off. Harsh overhead lighting casts sharp shadows. The scene has a gritty, raw urban feel. He grins and says: “That one’s going in the highlight reel.” His tone is sarcastic and confident. You hear echoing footsteps, wheels rolling, and distant bass thumps from a nearby car. Skateboarding_Highlight_Reel_Ready.mp4 Prompt: A mother in a house robe pours tea in the kitchen and leans on the counter. A tripod-style medium shot shows her as she says: “You’ve barely eaten. Is everything okay at school?” Her tone is soft and concerned. And her teenage son, seated at the table with a hoodie on, pushes scrambled eggs around with his fork. The camera cuts to a side-angle close-up as he replies: “It’s just exams. I’m fine.” His tone is tired and dismissive. Lighting: warm interior kitchen lights with morning sun peeking in. Style: family drama. Environment: modest kitchen. Ambient sound: kettle steaming, birds chirping outside. 

Video: Skateboarding_Highlight_Reel_Ready.mp4

Prompt: A mother in a house robe pours tea in the kitchen and leans on the counter. A tripod-style medium shot shows her as she says: “You’ve barely eaten. Is everything okay at school?” Her tone is soft and concerned. And her teenage son, seated at the table with a hoodie on, pushes scrambled eggs around with his fork. The camera cuts to a side-angle close-up as he replies: “It’s just exams. I’m fine.” His tone is tired and dismissive. Lighting: warm interior kitchen lights with morning sun peeking in. Style: family drama. Environment: modest kitchen. Ambient sound: kettle steaming, birds chirping outside.

Video: Exam_Stress_A_Family_Drama.mp4

Prompt: A barista and a regular customer sit across from each other at a small café table. The front-facing camera shot frames them both from the waist up. Late morning sunlight spills through the window behind them. Style: slice-of-life with natural light. Environment: cozy urban café. Ambient sound: soft jazz, milk steaming, light chatter. The barista says: “You always order the same thing.” (Tone: teasing) The customer replies: “And you always judge me for it.” (Tone: playful)

Video: The_Usual_Order_Banter.mp4

Prompt: A young man in a hoodie holding a mic and an older man in a suit stand under a streetlamp in a busy plaza. A medium side-by-side shot captures both. Lighting: cool evening light, flickering neon in the background. Style: fast-cut social media clip. Ambient: city noise, people passing, car horns. The interviewer asks: “Has AI affected your job?” (Tone: curious) Stranger replies: “Yeah, I got replaced by a chatbot that says ‘per my last email’ 10x faster.” (Tone: sarcastic)

Video: A_young_man_202506271533_es7dx.mp4

7. Character Consistency in VEO 3 Videos

Create recognizable, recurring characters that your audience will remember.

Now, keeping the character consistency is much easier than the human characters. For animals simple animal character descriptions work but for human characters, you need to be more detailed.

Why Character Consistency Matters

If your character changes appearance from scene to scene — different fur color, hairstyle, outfit, or face — your video loses continuity and trust. Whether you're building a comedic series, a cinematic short film, or a talking animal vlog, keeping your character visually consistent is key to immersion and storytelling.

Think of it like a movie actor: same costume, same face, same attitude — across all shots.

What Can Break Consistency?

Here’s what often causes inconsistency in VEO 3:

  • Slight changes in phrasing across prompts
  • Over-describing or changing outfits/moods
  • Including too many random traits in every scene
  • Not referencing the same core features every time

Core Traits You Must Lock In

To make your character recognizable, pick and repeat these traits:

Trait What to Specify
Species / Race Human, bear, alien, robot, monkey, etc.
Physical Features Fur color, skin tone, eye color, size
Outfit / Accessories Scarf, backpack, helmet, cloak
Face / Expression Goofy smile, sad eyes, grumpy look
Voice Style Accent, tone, gender, attitude
Posture / Vibe Confident, awkward, clumsy, elegant

🐻 Example: The Bear Influencer

Scene 1: A big fluffy bear with sunglasses and a GoPro harness walks slowly through a dense forest, holding a selfie stick. Selfie-style camera angle from front, slightly tilted. Light rays pierce through trees. Bear says in an old southern American male voice: “I ain’t had a decent snack in three days. If I find one more pinecone, I’m gonna cry.”

Video = A_big_fluffy_202506301836_lljym.mp4

Scene 2: A big fluffy bear with sunglasses and a GoPro harness suddenly stops walking in the forest and points the selfie stick toward a colorful tent visible between the trees. Selfie camera angle from the left, zooming in slightly as he spots it. The bear’s eyes widened with excitement. The big fluffy bear says in an old Southern American male voice: “Oh sweet gravy — is that a tent? Please tell me they packed snacks!”

Video = A_big_fluffy_202506301835_d00l5.mp4

Scene 3: A big fluffy bear with sunglasses and a GoPro harness bolts out of a tent, holding a half-eaten peanut butter jar and a selfie stick in one paw. Selfie camera angle from left side, as soon as it gets out, 2 hikers on the right side are seen tied with the trees. Bear says in an old southern american male voice, “Whew! These generous humans let me sample their entire pantry. Hospitality ain’t dead, boys!’

Video = A_big_fluffy_202506301838_ngcgq.mp4

Scene 4: A big fluffy bear with sunglasses and a GoPro harness walks casually through a forest trail holding a selfie stick. He suddenly stops and tilts the camera slightly to the right, revealing Bigfoot standing behind a tree, also holding a selfie camera and vlogging. Selfie-style camera angle from the bear’s front-left. The big fluffy bear says in an old Southern American male voice: Dang… didn’t think I’d run into the forest’s second-most handsome vlogger today.

Video = A_big_fluffy_202506301820_spxcx.mp4

Pro Tip: Copy your base character prompt and paste it at the start of every new scene you write. Treat it like a casting note — the AI model needs that repetition to stay visually locked in.

Avoid These Mistakes:

  • Changing outfits between scenes (“a red hoodie” ➜ “a camo jacket”)
  • Swapping emotional tone without reason (“awkward” ➜ “angry”)
  • Adding extra accessories or items randomly
  • Rewriting the prompt from scratch instead of pasting the master description

Summary

Do This Avoid This
Lock in physical + outfit traits Changing fur color or gear randomly
Use the same sentence to describe character in every prompt Rewriting from memory each time
Keep voice, tone, and attitude identical Mixing accents, changing personalities

Now, you’re ready to build consistently recognizable characters that will immerse your audience in the story!

8. Mastering Camera Angles, Shots & Movements in VEO 3

Camera angles affect how the viewer feels about the subject. With VEO 3, you can simulate cinematic perspectives by simply describing the angle in your prompt.

General Prompt (Base Scene Used for All Examples)

A young man with a red hiking jacket stands on a grassy hilltop overlooking a vast valley. The wind rustles his hair as he gazes into the distance. Morning fog lingers in the background. Style: cinematic nature documentary. Ambient: soft wind, distant birds.

Now, we’ll reuse this exact prompt and only change the camera angle for each example to demonstrate the difference.

1. Eye-Level Shot (Neutral, Balanced Perspective)

  • Description: Camera at eye level with the subject.
  • Effect: Feels grounded and natural — like the viewer is standing with him.

2. High Angle Shot (From Above, Makes Subject Small)

  • High angle shot looking down at the man from above the hill.
  • Effect: Creates a sense of vulnerability, insignificance compared to nature.

3. Low Angle Shot (From Below, Makes Subject Powerful)

  • Low angle shot from the grass, looking up at the man as he faces the valley
  • Effect: Makes him appear confident, heroic, or awe-inspiring.

4. Worm’s-Eye View (Extreme Low Angle, Emphasizes Scale)

  • Worm’s eye shot from near the ground, framing the man with tall grass in the foreground and the sky above.
  • Effect: Adds drama and grandeur to the landscape and subject.

5. Bird’s-Eye View (Overhead, Tactical, Wide)

  • Bird’s-eye shot directly above the man standing still on the grassy hilltop.
  • Effect: Emphasizes his isolation in the environment and gives geographical context.

6. Over-the-Shoulder Shot (Great for Dialogue or Reaction)

  • Over-the-shoulder shot from behind the man, looking out over the valley.
  • Effect: Creates intimacy — puts the viewer in his shoes, sharing the view.

7. Dutch Angle / Tilted Frame (Unstable, Tense)

  • Tilted shot of the man as he stands unevenly on the hill, horizon slightly slanted.
  • Effect: Adds tension, unease — good for mysterious or off-balance moods

Shot Types – Framing the Story

Shot types define how much of the subject and environment is visible. With VEO 3, you can control this by describing the distance, framing, and what’s in view.

1. Wide Shot (Establishing the Scene)

  • Wide shot of the man standing alone on the hilltop, with the valley stretching out behind him.
  • Effect: Shows scale, environment, and mood. Ideal for setting the scene.

2. Medium Shot (Balanced Human Focus)

  • Medium shot of the man from the waist up, hill and sky in the background.
  • Effect: Adds emotion while maintaining environmental context. Good for monologues or vlogs.

3. Close-Up (Emotional Detail)

  • Close-up shot of the man’s face, eyes focused on the horizon, hair moving in the breeze.
  • Effect: Creates emotional connection. Highlights subtle expressions.

4. Extreme Close-Up (Micro Moments)

  • Extreme close-up of a tear forming at the corner of his eye as wind brushes his cheek.
  • Effect: Intense focus on detail — creates intimacy or drama.

5. Two-Shot (Interaction Focus)

  • Wide two-shot of the man and a woman standing together on the hilltop, both facing the valley.
  • Effect: Perfect for dialogue or shared experience between characters.

6. POV Shot (First-Person View)

  • POV shot walking toward the cliff edge, arms and boots visible below. The valley opens up ahead.
  • Effect: Feels immersive. Puts the viewer inside the character’s shoes.

7. Selfie Shot (Social, Personal Style)

  • Selfie video of the man holding the camera at arm’s length with the valley behind him. He smiles and says: “This was so worth the climb.” (Tone: happy, proud)
  • Effect: Personal and engaging — great for vlog-style content.

8. Tracking / Follow Shot (Dynamic Framing)

  • Tracking shot from behind as the man walks along the hill ridge toward the sunrise.
  • Effect: Adds motion, maintains composition around subject.

Camera Movements – Bringing Scenes to Life

Camera movements add dynamism and realism. They replicate how real-world filmmakers move the camera to follow action, build tension, or reveal beauty.

VEO 3 responds well to movement keywords when you describe how the camera moves, not just what’s in the scene.

  1. Pan Shot (Left or Right): The camera slowly pans right or left across the hilltop, following the character’s gaze as they turn toward the valley.
  2. Tilt Shot (Up or Down): The camera tilts up from the character’s boots to their face as they stand still on the hill.
  3. Tracking Shot (Follow): The camera tracks behind the character as they walk along the ridge, with the valley opening up beside them.
  4. Dolly Shot (In or Out): The camera slowly dollies in toward the character’s face as they look into the horizon.
  5. Crane / Jib Shot (Vertical Move): The camera cranes up from the base of the hill, revealing the character standing at the peak.
  6. Zoom Shot (Simulated Optical Zoom): The camera zooms in slowly from a wide shot to focus on the character’s eyes.
  7. 7. Handheld Motion: Description: Handheld-style shot as the character adjusts their backpack, with the camera swaying gently with each motion.

9. Google VEO 3 Viral Video Prompts & Templates

Welcome to the Viral Content Lab

This chapter features a curated set of viral video formats specifically designed for Google VEO 3 — including their exact prompt structures and fully written example prompts. You’ll find most of the most categories of the viral videos you probably seen on the internet by now.

From ASMR slicing videos and selfie-style vlogs to street interviews and cinematic animal adventures, each format includes:

  • A consistent prompt template
  • Three full-length example prompts to generate high-engagement clips
  • Categories designed for viralityrelatabilityhumor, or shock value

Whether you're creating content for YouTube Shorts, TikTok, Instagram Reels, or viral ad reels, this chapter gives you copy-paste prompt formulas and creative fuel to turn simple AI clips into scroll-stopping gold.

Section 8.1: Bigfoot & Yeti Vlogs

These prompts are built for hilarious, chaotic, and surprisingly human-feeling vlogs from two mythical creatures: Bigfoot and his questionable sidekick, Yeti.

  • Tone: Comedic, expressive, unpredictable
  • Style: Handheld selfie-style vlog camera
  • Goal: Blend realism with absurd humor for viral short-form content
  • Use Case: TikTok, YouTube Shorts, Reels

Prompt Structure:

A selfie video of [CREATURE NAME & DESCRIPTION] in [LOCATION/Environment]. He holds the camera at arm’s length, and his hairy arm is clearly visible in the frame. The style is YouTube Vlog-like. The creature [describe action, e.g., running, sneaking, pressing button]. He then turns the camera toward [describe what’s in the background or other character]. He says in a [voice tone/accent] voice: “[Dialogue]”

Sounds (optional): [ambient + specific sounds, e.g., wind, footsteps, monster growl, alarm].

No subtitles.

Example 1: Bigfoot in the Nevada Desert

A selfie video of Bigfoot, towering, shaggy dark reddish-brown fur, primate-like face, intelligent dark eyes, in the Nevada desert at night, holding a selfie stick at arm's length. His long hairy arm is clearly visible in the frame. The style is YouTube Vlog like. The bigfoot is sprinting with his companion — a large white Yeti with icy blue eyes and thick fur. They both race toward a fortified gate marked “AREA 51.” Bigfoot slaps a huge red button. He turns the camera back to his face and says in a breathless British male voice: “We touched it! We actually touched it! We’re legends now, lads!”

Sounds: desert wind, running footsteps, beeping alarm in background. No subtitles.

Example 2: Bigfoot Inside Area 51

A selfie video of Bigfoot, towering, shaggy dark reddish-brown fur, primate-like face, intelligent dark eyes, running down a sterile sci-fi hallway inside Area 51. He holds the selfie stick at arm’s length, his hairy arm visible. The white Yeti runs behind him, looking terrified. The camera bounces slightly with movement. The style is YouTube Vlog like. Bigfoot glances back, then spins the camera briefly to show two small gray aliens chasing them with glowing stun batons. He says in a panicked British male voice: “They are NOT fans. They are monsters trying to kill us!”

Sounds: alarm sirens, alien shrieks, stomping footsteps, metallic clanging. No subtitles.

Example 3: Bigfoot’s Desert Canyon Escape

A selfie video of Bigfoot, towering, shaggy dark reddish-brown fur, primate-like face, intelligent dark eyes, running breathlessly through a desert canyon at dawn. He holds the camera at arm’s length while glancing behind. His companion Yeti sprints beside him. As he pans the camera, a hovering silver UFO can be seen rising in the distance behind them. The style is YouTube Vlog like. He looks at the camera, wide-eyed, and says in a deadpan British male voice: “So… we *may* have stolen a teleporter.”

Sounds: wind, rapid footsteps, low UFO hum, sand crunching. No subtitles.

Full Video = Bigfoot Alien Adventure.mp4

Section 8.2: ASMR Object Slicing Videos

This section explores how to use VEO 3 to create visually stunning and audibly satisfying ASMR-style videos. These videos are perfect for audiences who love satisfying, meditative visuals combined with high-quality sound design.

  • Tone: Slow, satisfying, peaceful
  • Style: Macro cinematic, silent room with crisp slicing sounds
  • Goal: Create viral sensory content for TikTok, YouTube Shorts, and Instagram Reels
  • Use Case: Relaxing reels, loop-worthy visuals, aesthetic edits

Prompt Template:

A hyper-realistic ASMR video shot in extreme macro perspective. A razor-sharp knife slowly slices through [OBJECT TYPE] — The object rests on a rustic wooden cutting board, bathed in warm, cinematic lighting. Each intentional cut delivers a deeply satisfying ASMR audio experience — [describe sound: crisp, wet, metallic, etc.] — echoing in an otherwise silent, meditative space. Sliced piece drops gently to the cutting board. The overall pacing is slow and soothing, crafting a tranquil ASMR experience. No background music. No talking.

Example Prompt:

A hyper-realistic ASMR video shot in extreme macro perspective. A razor-sharp knife slowly slices through a crystal strawberry made of transparent red glass — subtle inner reflections and smooth edges shimmer under soft light. The strawberry rests on a rustic wooden cutting board, bathed in warm, cinematic lighting. Each intentional cut delivers a deeply satisfying ASMR audio experience — clean glass crackles and sharp slicing. A sliced wedge drops gently onto the board. Pacing is slow and calming. No background music. No talking.

Video: Crystal_Strawberry_ASMR.mp4

Section 8.3: Stormtrooper Vlogs (Viral Sci-Fi Style Comedy)

This section focuses on creating viral first-person vlogs using a consistent character — the classic Stormtrooper. These videos blend YouTube-style handheld vlogging with Star Wars-inspired humor and dialogue, making them ideal for meme pages, comedy shorts, and sci-fi parody accounts

  • Tone: Sarcastic, awkward, chaotic
  • Style: Selfie-style handheld vlog
  • Use Case: YouTube Shorts, TikTok humor channels, meme accounts
  • Key Visual Cues: Stormtrooper armor, slightly scratched gear, handheld camera vibe, expressive body language

Prompt Template:

A selfie-style video of a standard Imperial Stormtrooper in full white armor and helmet, slightly scratched and weathered from field duty. He holds the camera at arm’s length, his white-gloved hand clearly visible. The character is [describe action or situation]. He speaks in a young, energetic American male voice with dry, sarcastic delivery: “[dialogue]”. The background is [describe location: desert, ship interior, etc.], captured with realistic camera shake and ambient sounds like ship hums, sand winds, or distant explosions. No music. No subtitles.

Stormtrooper in the Desert After a Crash:

A selfie-style video of a standard Imperial Stormtrooper in full white armor and helmet, slightly scratched and weathered from field duty. He holds the camera at arm’s length, his white-gloved hand clearly visible. The character is standing in a hot, wind-swept desert with a smoldering escape pod behind him. He wipes his visor dramatically and says in a dry, sarcastic tone: “Well, Greg said he could land it manually… and technically, he did.” Background includes drifting sand, distant ship debris, and occasional electrical sparks. No music. No subtitles.

Stormtrooper Lost Inside a Ship:

A selfie-style video of a standard Imperial Stormtrooper in full white armor and helmet, slightly scratched and weathered from field duty. He holds the camera at arm’s length, his white-gloved hand clearly visible. The character is walking quickly through a dimly lit Imperial ship hallway, glancing side to side at confusing signs. He says in a confused, muttering voice: “Okay, deck 9… hallway 3B… wait, where the hell am I? This place is a maze.” Footsteps echo. Background shows flickering red hallway lights and emergency sirens. No music. No subtitles.

A selfie-style video of a standard Imperial Stormtrooper in full white armor and helmet, slightly scratched and weathered from field duty. He holds the camera at arm’s length, ducking behind a scorched wall. The character peeks out, then quickly ducks as blaster fire whizzes by. He says in a panicked but sarcastic tone: “Okay… I’m pinned down… Greg went to ‘flank’ the rebels and never came back. I think he flanked into a different timeline.” Ambient sound includes blaster shots, distant yelling, and armor clanking. No music. No subtitles.

Section 8.4: Animal Olympics (Cinematic Viral Shorts)

This section showcases viral prompt templates where animals perform Olympic-level sports in ultra-realistic cinematic fashion. These surreal, high-action videos get high engagement due to their humor, creativity, and visual absurdity — perfect for shorts or meme edits.

  • Tone: Epic, dramatic, or comedic
  • Style: Cinematic realism with wide-angle or tracking shots
  • Use Case: Viral reels, parody content, sports-themed edits
  • Key Elements: Stadium settings, exaggerated camera angles, slow motion, unexpected animal behavior

Base Prompt:

A televised cinematic video of [animal/character description] performing an acrobatic dive into a swimming pool at the Olympics, from a 10m high diving board, with slow-motion flips and spins. The shot starts with a close-up of the character’s face for dramatic effect, then switches to a wide-angle aerial shot as the dive begins. Stadium lights shimmer off the water’s surface. The crowd erupts in awe. Sound includes loud applause, water splash, and echoing commentator voice. No music. No subtitles.

Example Prompts:

A televised cinematic video of a muscular gorilla with a determined expression performing an acrobatic dive into a swimming pool at the Olympics, from a 10m high diving board. The shot begins with a dramatic close-up on his fur-covered face, then cuts to a wide aerial shot as he launches off the board, spinning in the air with surprising grace. Stadium lights shimmer on the water. The crowd gasps in awe. Loud applause, echoing arena acoustics, and muffled announcer voice. No music. No subtitles.

A televised cinematic video of a confident emperor penguin performing an Olympic-style dive from a 10m board. The camera starts with a steady zoom as the penguin waddles to the edge, flaps its wings twice, and leaps off. In slow motion, it performs a graceful mid-air twist before plunging into the pool. Spotlights beam down. The audience bursts into cheers. Sound includes splashing, soft crowd echo, and excited announcer commentary. No music. No subtitles.

A televised cinematic video of a fluffy white goat wearing a red sports scarf performing a professional-level dive at the Olympics. The goat stands confidently on the 10m board, bleats once, then springs into a fast front tuck. Slow-motion captures the rotation, fur rippling, and a perfectly straight water entry. Massive crowd noise and camera flashes follow. Ambient crowd reaction and stadium PA system announcing the perfect score. No music. No subtitles.

Other Activities in Animal Olympics

A televised cinematic realistic video of a [describe animal with detail] performing a professional-level [Olympic sport] inside a massive modern Olympic stadium filled with cheering crowds. The scene is captured in [describe camera movement: e.g., dramatic tracking shot, slow motion dive, close-up spin]. The animal shows surprising form and skill. The video includes ambient stadium sounds like roaring applause, loudspeaker commentary, camera flashes, and background national flags waving. No music. No subtitles.

Orangutan Gymnastics

A cinematic realistic video of a focused orangutan with chalk-covered hands and a custom-fit gymnastics uniform performing a professional-level floor routine in a packed Olympic stadium. Captured with dynamic crane and dolly shots as he executes backflips, spins, and a final pose with arms wide. The crowd erupts in cheers, flashes go off from cameras. Stadium lighting reflects off his sweaty brow. Ambient sounds include cheering, camera shutters, and floor mat impacts. No music. No subtitles.

Elephant Weightlifting Finals

A cinematic realistic video of a massive Asian elephant wearing a weightlifting belt and numbered jersey attempting an Olympic deadlift in a giant, echoing weightlifting arena. The camera starts low-angle, slowly pans up as the elephant lifts a barbell stacked with glowing titanium weights. Slow motion as his muscles flex, trunk curls, and tusks gleam under stadium lights. The crowd gasps, then roars. Ambient sound includes heavy breathing, the judge’s whistle, and a loud *clang* as the weights drop. No music. No subtitles.

Cheetah 100m Sprint Finals

A cinematic realistic video of a lean, muscular cheetah with a sleek racing vest crouched in the starting blocks of an Olympic track. Tracking side shot as the gun fires and the cheetah launches forward, its paws pounding the turf at incredible speed. The camera follows at ground level as it overtakes robotic competitors and breaks the finish line ribbon with a roar. The crowd goes wild. Sound includes starting pistol crack, wind rushing, and high-speed paw impacts. No music. No subtitles.

Section 8.5: Street Interview Prompts

This section is all about capturing the humor, awkwardness, and absurdity of man-on-the-street style interviews — but with a twist. You’ll find a structured prompt formula and brand-new examples designed to go viral on TikTok, Reels, and YouTube Shorts.

These videos typically feature a dramatic or comedic background, a reporter with a mic, and a highly expressive subject. Perfect for commentary on current events, parody interviews, or surreal humor.

Prompt Template for Street Interviews:

A street interview style video. Over-the-shoulder shot of a reporter holding a microphone with a news logo. [INTERVIEWED SUBJECT] giving an interview. Background: [BACKGROUND DESCRIPTION]. Reporter says: “[Reporter dialogue]” [INTERVIEWED SUBJECT] says: “[Subject response]” No subtitles.

Example Prompt:

A street interview style video. Over-the-shoulder shot of a female reporter holding a microphone with a news logo. A confused man wearing futuristic armor and a holographic watch is giving an interview.

Background: A modern-day gas station with cars refueling, as people stare at him.

Reporter says: “Sir, are you aware this isn’t the year 3025?”

The man says with a robotic accent: “Wait… this isn’t the fuel depot on Mars 9? Then who filled my quantum tank with… gasoline?”

VideoTime_Traveler_s_Gas_Station_Mishap.mp4

Section 8.6: Cinematic Animal Shorts

This section focuses on creating hyper-realistic, story-rich cinematic videos starring animals in unexpected human-like roles. These are perfect for storytelling, parody, or just visually stunning viral clips.

Each prompt uses the cinematic structure to craft scenes that feel like they belong in a movie trailer — except the main characters are animals.

Cinematic realistic [Camera Motion or Shot Type] [Consistent Character Description] [Background] [Action] [Style] [Ambiance] [Time of Day] [Dialogue] [Sound effects] [Voice-over] [Ambient noise] [Music] No subtitles

Prompt:

Cinematic realistic tracking shot of a silver convertible racing through winding mountain roads. A large, muscular gorilla wearing a red leather jacket grips the wheel tightly, sunglasses reflecting the road ahead.

Background: Sunset-drenched cliffs with pine trees rushing past.

Action: The gorilla shifts gears and glances at the rearview mirror as sirens wail in the distance.

Ambiance: Golden hour, cinematic lighting with lens flares. 

Dialogue: The gorilla growls in a deep, cocky voice: “They’re fast… but I’m banana-fueled.”

Sound effects: Engine roar, tire screeches. No subtitles.

Prompt:

Cinematic realistic dolly shot following a sleek black cat in a tuxedo walking across a luxury hotel lobby.

Character: Green eyes, tiny earpiece, fluffy tail swaying with purpose.

Background: Crystal chandeliers, wealthy guests, bellhops.

Action: The cat leaps smoothly onto a luggage cart, then disappears behind a curtain.

Time of Day: Night.

Dialogue: Cat in smooth British male voice: “Target acquired. Pigeon in the penthouse.”

Sound: Jazz piano in background, low chatter. No subtitles.

10. VEO 3 JSON Prompting

What is JSON Prompting?

JSON prompting isn’t some mystical AI spell — it’s just a more structured way to talk to AI video generators like Google VEO 3. 

Instead of giving a single messy sentence like:

“Make a drink ad that explodes with particles and lands on a rock.”

You give the AI a clear breakdown of exactly what to show, how it moves, what it looks like, and how the scene should unfold. Like this:

{
"product_name": "Blaze Energy",
"description": "A glowing can forms from swirling liquid, lands with impact, and ripples energy
across a mountain cliff.",
"style": "cinematic, slow-mo",
"camera": "orbit shot, macro close-up",
"motion": "assembles mid-air, lands with energy pulse",
...
}

So what does this do?

By using JSON formatting, you're:

  • Giving the AI a step-by-step storyboard
  • Controlling the camera style, lighting, and animation
  • Ensuring your product is the main hero of the scene
  • Creating reusable templates for different products

Why is it so effective?

The truth is, AI struggles with vague or chaotic input. JSON prompting breaks things down into logical components. It mimics how human directors think:

“What is the setting? What’s the lighting? What’s the motion? What should happen first?”

This results in:

  • More consistent animation
  • Higher visual quality
  • Ads that actually look professional

How to Use This Prompt Pack

This section shows you exactly how to take the prompt template from this guide and generate stunning CGI-style video ads using Google VEO 3 (or any compatible platform like Google Flow, OpenArt, etc).

Step 1: Customize the JSON Prompt

Start by copying the JSON template from this guide. Then update the values to match your product.

Don’t forget to describe:

  • Product name
  • Product type
  • Visual style
  • Lighting
  • Motion/animation
  • Ending scene
  • CTA (call to action) effect

Here’s a quick example:

{
"product_name": "Digital Assets Energy",
"product_type": "Drink",
"style": "futuristic cinematic",
"lighting": "glowing pink-blue blend",
"motion": "assembles in air and blasts down with particle trails",}

Step 2: Use ChatGPT to Help Build the Prompt

You don’t even need to write everything yourself.

Use this prompt in ChatGPT:

“Using this JSON template [paste the template], generate a cinematic CGI-style
video prompt for a [type of ad like product reveal or unboxing] of a [product name],
describing the motion, setting, camera style, lighting, and atmosphere in a Google
VEO 3-compatible structure.”

Step 3: Paste Prompt into VEO 3 or Your Preferred Platform

Once you have your JSON prompt:

  • Go to Google Flow or OpenArt
  • Choose Text to Video
  • Set model to VEO 3
  • Select 1080p and desired speed (Fast is cheaper)
  • Paste in your full JSON prompt
  • Hit Generate

That’s it — your cinematic ad starts rendering!

Ad Templates – The Backbone of Viral Ads

Before you start creating your own mind-blowing product ads, you need a solid prompt structure — and that’s where templates come in.

Below are two powerful JSON prompt templates that I personally use for creating Google VEO 3 video ads. These are beginner-friendly, modular, and can work for almost any product.

General Template:

{
"description": "",
"product_name": "",
"product_type": "",
"brand_style": "",
"style": "",
"camera": "",
"lighting": "",
"location": "",
"time_of_day": "",
"atmosphere": "",
"elements": [
"main product animation idea #1",
"supporting visual element #2",
"environment enhancement #3"
],
"motion": "",
"cta_motion": "",
"ending": "",
"text": "",
"keywords": [
"aspect ratio",
"brand",
"visual effects",
"camera",
"mood",
"no people"
]
}

Template 2: No People Product Reveal Ad

Use this for cinematic floating objects, product reveal sequences, or hyper-stylized ads with no humans involved.

{
"product_name": "Your Product Name",
"product_type": "e.g., Energy Drink, Perfume, Smartwatch",
"description": "Short cinematic idea – like a glowing can forming in mid-air and crashing down
with particle FX.",
"brand_style": "Futuristic, Luxurious, Bold, Minimalist, etc.",
"style": "cinematic, slow motion, glossy, abstract, surreal",
"camera": "macro close-ups, zoom-ins, orbiting shots, etc.",
"lighting": "neon glow, ambient shadows, light pulses, rim lighting",
"location": "Digital chamber, cliffside, cityscape – whatever fits the product",
"time_of_day": "Twilight, Night, Undefined",
"atmosphere": "Foggy mist, particles in the air, glowing energy, floral vapor",
"elements": [
"Describe 3 to 4 key moments (e.g., 'product assembles mid-air')",
"Highlight the movement and animation style"
],
"motion": "Main animation sequence (e.g., spins, glows, shoots particles)",
"cta_motion": "How the logo appears (e.g., zooms in with glitch effect)",
"ending": "Final scene — product glowing or resting, background pulsing",
"text": "none",
"keywords": [
"16:9",
"product name",
"cinematic",
"no people",
"CGI animation",
"VEO 3"
]
}

Let me show an example. I have taken this prompt template and pasted in Chatgpt and asked it:

“Using this JSON template , generate a cinematic CGI-style video prompt for a Rolex Watch Reveal, describing the motion, setting, camera style, lighting, and atmosphere in a Google VEO 3-compatible structure”

It gave me this prompt:

{
"product_name": "Rolex Skycraft",
"product_type": "Luxury Watch",
"description": "A high-end Rolex watch emerges from swirling golden mist, assembling piece
by piece mid-air with precision and elegance.",
"brand_style": "Luxurious, Timeless, Bold",
"style": "cinematic, ultra slow motion, glossy, surreal, hyper-realistic",
"camera": "macro close-ups of gears, orbiting shot around dial, smooth dolly zoom",
"lighting": "soft rim lighting, golden glow, high contrast highlights with reflections",
"location": "floating digital pedestal above a sea of clouds",
"time_of_day": "Twilight",
"atmosphere": "gold particles in the air, misty horizon, light rays shining from above",
"elements": [
"The watch gears assemble mid-air with mechanical precision",
"The face of the Rolex rotates into view as the light glints off the glass",
"The band wraps itself around the core with a metallic ripple",
"A final wide shot reveals the full watch levitating in golden light"
],
"motion": "Slow 360° spin with subtle ticking motion, golden particles radiate outward,
reflections move across the watch surface",
"cta_motion": "Rolex logo fades in over the horizon with a soft pulse and shimmer",
"ending": "The watch gently lands on a marble slab, glowing softly as fog curls around its
base",
"text": "none",
"keywords": [
"16:9",
"Rolex Skycraft",
"cinematic",
"no people",
"CGI animation",
"VEO 3"
]
}

And the video generated by this prompt:

Here’s the Video: _product_name_rolex_202508041340_kec55.mp4

Template 3: General Product Ad (People Allowed)

Use this for lifestyle scenes, unboxings, influencer-style ads, and storytelling-based videos.

{
"product_name": "Your Product Name",
"product_type": "e.g., Hairbrush, Coffee Mug, Skincare Serum",
"description": "Overview of the ad – e.g., a woman unboxes the product and tries it with a
smile",
"brand_style": "Clean, Natural, Friendly, Luxurious, Bold",
"style": "Lifestyle ad, influencer reel, cinematic studio, or vlog-style",
"camera": "close-ups on hands, slow pans, over-the-shoulder shots",
"lighting": "natural daylight, studio backlight, warm tones, etc.",
"location": "bathroom counter, rooftop, cozy living room – keep it relevant",
"time_of_day": "morning, evening, undefined",
"atmosphere": "Cozy, luxurious, clean, calm, or energetic",
"elements": [
"List 2–3 visual actions (e.g., 'girl brushing hair in front of mirror')",
"Include any product usage or lifestyle framing"
],
"motion": "Describe flow – e.g., hand gestures, product rotation, character reactions",
"cta_motion": "Logo or slogan appearing (e.g., glowing text, brand flash)",
"ending": "Final shot — product resting in scene, character smiling, etc.",
WSODOWNLOADS.IN
"text": "If you want on-screen tagline, write it here",
"keywords": [
"16:9",
"your product",
"people",
"lifestyle",
"realistic ad",
"influencer style"
]
}

11. VEO3 Viral Product Ads Prompts: Creating High-Impact CGI Ads

Let’s put the templates into action. Below are real JSON prompts I personally used to generate viral CGI ads — all created using Google VEO 3 with zero manual editing.

You can take these as inspiration, or copy-paste them directly into your workflow and tweak them to fit your product.

Demo videos can be viewed here:

ENERGY DRINK — "Blaze Energy"

{
"description": "A hyper-cinematic ad showing a glowing Blaze Energy can forming from
liquid particles mid-air, landing with an impact on a rock ledge as energy waves ripple
out and light up the entire scene.",
"product_name": "Blaze Energy",
"product_type": "Energy Drink",
"brand_style": "Bold, extreme, futuristic",
"style": "cinematic, ultra slow-mo, high contrast",
"camera": "high-speed macro close-up, whip zoom, orbiting shots",
"lighting": "electric neon pulses with glowing mist",
"location": "futuristic mountain cliff",
"time_of_day": "twilight",
"atmosphere": "charged, energetic air with sparks",
"elements": [
"Blaze Energy can forming mid-air from swirling particles",
"electric shockwave lighting up terrain on impact",
"glowing crystal rock surface with crackling energy veins"
],
"motion": "product assembles from particles, lands hard, sends energy pulse across
ground",
"cta_motion": "logo zooms in with glitch burst and freeze-frame glow",
"ending": "wide angle of glowing can on the ledge with terrain still pulsing around it",
"text": "none",
"keywords": [
"16:9",
"Blaze Energy",
"glitch",
"particle FX",
"no people",
"impact shot",
"energy burst"
]
}

PERFUME — "Noir Bloom"

{
"description": "A luxurious Noir Bloom perfume bottle materializes from swirling violet smoke,
spins in the air with glowing runes, and lands on a reflective obsidian pedestal inside a high-end
black glass chamber.",
"product_name": "Noir Bloom",
"product_type": "Perfume",
"brand_style": "Luxury, dark elegance, mystic",
"style": "glossy cinematic, dark moody tones",
"camera": "floating dolly-in, glass reflections, macro glides",
"lighting": "soft neon purple glow, ambient shadows, moving specular highlights",
"location": "dark perfume chamber with reflective black walls",
"time_of_day": "timeless / undefined",
"atmosphere": "dense floral mist, glowing particles in the air",
"elements": [
"perfume bottle rising out of swirling violet smoke",
"hovering in air with animated symbols orbiting it",
"lands on floating obsidian pedestal with soft light bloom"
],
"motion": "slow rotational spin with ribbons of vapor wrapping around",
"cta_motion": "slogan materializes letter by letter in elegant serif over particle background",
"ending": "product glows gently with mist swirling and reflection shimmering",
"text": "none",
"keywords": [
"16:9",
"Noir Bloom",
"perfume",
"particles",
"floating bottle",
"vapor FX",
"no people"
]
}

Watch – "ChronoEdge S7"

{
"description": "A sleek smartwatch, the ChronoEdge S7, emerges from a glowing void,
with each part of the watch materializing in mid-air with futuristic mechanical animations.
The watch faces flicker between various designs, showcasing its versatility.",
"product_name": "ChronoEdge S7",
"product_type": "Smartwatch",
"brand_style": "Futuristic, tech-savvy, precise",
"style": "sleek, high-tech, glossy",
"camera": "360-degree rotation around the watch, macro close-ups, zooming in on
details",
"lighting": "neon blue and silver glow, with sharp reflective surfaces",
"location": "infinite digital space, sleek black surface",
"time_of_day": "timeless",
"atmosphere": "futuristic, digital, pulsating lights",
"elements": [
"watch face materializing from the center",
"geometric light trails tracing the watch’s edges",
"holographic projections of watch faces rotating around"
],
"motion": "watch assembles in mid-air, surfaces shine with intricate light patterns",
"cta_motion": "product zooms out, logo appears from digital particles",
"ending": "watch resting on a glass surface with a glowing holographic logo",
"text": "none",
"keywords": [
"16:9",
"smartwatch",
"tech",
"futuristic",
"no people",
"digital assembly",
"holographic"
]
}

Airpods – "AirPods Pro X"

{
"description": "AirPods Pro X float out of a floating cube, assembling slowly with
glowing circuit-like connections, followed by dynamic, swirling animations of sound
waves, as the product showcases its noise-canceling technology.",
"product_name": "AirPods Pro X",
"product_type": "Earbuds",
"brand_style": "Minimalist, elegant, high-performance",
"style": "dynamic, clean, high-tech",
"camera": "floating close-ups, orbiting around the AirPods",
"lighting": "soft white with glowing highlights on product edges",
"location": "floating in a digital soundscape",
"time_of_day": "timeless",
"atmosphere": "sound waves, ambient tech light reflections",
"elements": [
"AirPods materializing from a floating glowing cube",
"dynamic sound waves visualizing around the product",
"motion trail effects as the AirPods rotate"
],
"motion": "AirPods rotate in the air, sound waves flow around them",
"cta_motion": "logo materializes with a soft pulse around the AirPods",
"ending": "floating AirPods surrounded by swirling sound waves, logo shines",
"text": "none",
"keywords": [
"16:9",
"AirPods",
"sound waves",
"noise canceling",
"tech",
"high-quality",
"no people"
]
}

Noise-Canceling Headphones

{
"description": "The QuietCore 360 headphones float out of an orb of pulsating sound
waves, the ear cups forming from liquid metal. Sound waves clash against the
headphones and dissipate, illustrating the powerful noise-canceling feature.",
"product_name": "QuietCore 360",
"product_type": "Noise-Canceling Headphones",
"brand_style": "premium, immersive, powerful",
"style": "futuristic, clean, energetic",
"camera": "slow, smooth zooms around headphones, macro detail shots",
"lighting": "deep blue and silver, with energetic pulses and reflections",
"location": "dark space with floating sound waves",
"time_of_day": "timeless",
"atmosphere": "sound isolation, dark energy pulses",
"elements": [
"headphones materializing from liquid metal",
"pulsating sound waves crashing against headphones",
"ambient electric light trails"
],
"motion": "headphones form and adjust as sound waves clash, then dissipate",
"cta_motion": "logo emerges from the calm after the sound storm",
"ending": "headphones float in a quiet void with calming lights",
"text": "none",
"keywords": [
"16:9",
"headphones",
"sound waves",
"noise-canceling",
"premium",
"immersive",
"no people"
]
}

Lamborghini – "Aventador Reveal

{
"description": "A Lamborghini Aventador is revealed in a dark futuristic hangar. Metallic
panels rise from the floor and float around, locking together piece by piece to form the
car. Yellow lighting pulses with each connection, and the environment transforms into a
high-end Lamborghini showroom.",
"product_name": "Lamborghini Aventador",
"product_type": "Luxury Sports Car",
"brand_style": "bold, mechanical precision, ultra-luxury",
"style": "futuristic cinematic, assembly animation",
"camera": "slow orbit, macro detail shots on forming parts, wide pullback",
"lighting": "intense spotlighting with yellow underglow and glossy black reflections",
"location": "futuristic vehicle chamber with reflective floor",
"time_of_day": "night",
"atmosphere": "mysterious, ambient tech hum, light fog",
"elements": [
"floating car parts assembling mid-air",
"yellow energy pulses and connector sparks",
"reflective black showroom platform emerging below"
],
"motion": "Lamborghini forms from levitating parts, showroom rises as lighting
activates",
"cta_motion": "logo folds in like a metallic badge over top of the car",
"ending": "finished Lamborghini surrounded by a glowing showroom with subtle
camera rotation",
"text": "none",
"keywords": [
"16:9",
"Lamborghini",
"reveal animation",
"car assembly",
"cinematic",
"no people",
"futuristic showroom"
]
}

Ferrari – "F8 Tributo Reveal"

{
"description": "The Ferrari F8 Tributo is revealed inside a dark red chamber as silky red
cloth flows across the air and wraps around the invisible shape of a car. The fabric
disintegrates mid-air, revealing the polished Ferrari underneath, surrounded by floating
spotlight drones that activate as the logo appears.",
"product_name": "Ferrari F8 Tributo",
"product_type": "Luxury Sports Car",
"brand_style": "sleek, refined, elegant power",
"style": "elegant cinematic reveal",
"camera": "slow dolly-ins, sweeping camera from low angles, dramatic zoom on
curves",
"lighting": "deep red and gold accents, focused spotlights, soft floor reflections",
"location": "dimly lit red reveal chamber with glossy black floor",
"time_of_day": "undefined / ambient",
"atmosphere": "suspenseful, luxurious, silk particles in air",
"elements": [
"floating silk fabric wrapping invisible car silhouette",
"particles dispersing to reveal shiny Ferrari body",
"spotlight drones activating and circling"
],
"motion": "fabric flows and forms over car, vanishes to reveal full Ferrari",
"cta_motion": "Ferrari logo emerges in soft gold shimmer above the hood",
"ending": "F8 Tributo displayed center-stage with moving light accents and full-body
reveal",
"text": "none",
"keywords": [
"16:9",
"Ferrari",
"luxury",
"reveal animation",
"silk effect",
"elegant presentation",
"no people"
]
}

Sneaker – "Volt Runner X"

{
"description": "A large city plaza is shown at dusk. The ground rumbles and buildings
around begin to shift. Pieces of a giant sneaker start rising from underground,
assembling piece by piece in front of a mirrored skyscraper. Once fully formed, lights
streak across its sides, finishing with the brand logo beaming above.",
"product_name": "Volt Runner X",
"product_type": "Sneaker",
"brand_style": "Urban, bold, futuristic",
"style": "cinematic CGI building ad",
"camera": "wide-angle flyover, dynamic orbit, slow push-in on final form",
"lighting": "cool sunset tones with neon reflections",
"location": "urban plaza between high-rise buildings",
"time_of_day": "dusk",
"atmosphere": "urban ambient, light wind, subtle rumble",
"elements": [
"city pavement cracking open",
"giant sneaker parts rising with mechanical sounds",
"skyscraper lights syncing with product shape"
],
"motion": "sneaker parts rise from ground and lock together, building lights pulse in
sync",
"cta_motion": "brand logo beams from the top of the final product like a hologram",
"ending": "fully assembled sneaker towering above the plaza, glowing and reflecting
city lights",
"text": "none",
"keywords": [
"16:9",
"sneaker ad",
"building reveal",
"CGI outdoor animation",
"urban setting",
"no people",
"product formation"
]
}

Rolex Watch – "Rolex Skycraft" (CGI Tower Reveal)

{
"description": "A glass building façade begins to shift at night. Panels fold out, rotate, and pull
back to form a giant Rolex watch face embedded into the building. Gears rotate behind the
windows, and the city skyline reflects off the crystal-clear dial. Golden lights ripple around the
structure like time itself is ticking through the skyline.",
"product_name": "Rolex Skycraft",
"product_type": "Luxury Watch",
"brand_style": "timeless, prestigious, architectural",
"style": "luxury CGI architectural ad",
"camera": "dolly-up from street level, upward sweep to the skyline, dramatic pullback",
"lighting": "golden city lights with cool glass glows",
"location": "downtown cityscape, glass skyscraper",
"time_of_day": "night",
"atmosphere": "quiet prestige, wind ambiance, ticking sounds",
"elements": [
"building façade rotating to reveal giant Rolex watch",
"moving internal gears visible through windows",
"glowing outlines of hour marks forming from lights"
],
"motion": "building morphs into product from glass and steel plates, time pulses across scene",
"cta_motion": "Rolex logo illuminates across the top of the skyscraper like a crown",
"ending": "watch embedded in tower glass, shimmering with skyline reflections",
"text": "none",
"keywords": [
"16:9",
"Rolex",
"architectural CGI",
"watch reveal",
"city skyline",
"no people",
"night luxury"
]
}

SparklePop' soda

{ "description": "CGI close-up of a chilled, dewy drink can (e.g., 'SparklePop' soda) sitting on a
plain tabletop. As a hand reaches in and opens the tab, a vibrant, effervescent burst of liquid
and fruit particles explodes from the can's opening. This colorful burst expands rapidly,
transforming the dull tabletop and background into a lively, refreshing landscape filled with
oversized, stylized fruits (e.g., citrus slices, berries), shimmering ice cubes, and dynamic liquid
splashes. The scene is full of energy and refreshment. No text.", "style": "CGI, vibrant,
refreshing, dynamic, hyper-realistic", "camera": "starts ultra close on the can and opening tab,
pulls back and cranes overhead as the flavor burst expands, then orbits the transformed scene",
"lighting": "starts with soft ambient light, transitions to bright, high-key lighting that emphasizes
the vibrant colors and refreshment", "environment": "plain, mundane tabletop transforming into
an energetic, fruit-filled, icy landscape", "elements": [ "chilled drink can (brand clearly visible,
condensation prominent)", "realistic hand opening the tab", "effervescent liquid spray, foam, and
dynamic bubbles", "stylized, oversized fruit pieces (e.g., lemon, lime, orange, strawberry, mint
leaves) appearing and swirling", "glowing ice cubes forming and clinking", "liquid splashes and
ripples" ], "motion": "tab pop and rapid, explosive burst; fast-motion expansion of liquid and fruit
elements; swirling and settling motions", "ending": "the open can in the foreground, with the
vibrant, refreshing, and dynamic fruit/ice landscape in full glory behind it", "text": "none",
"keywords": ["CGI", "drink", "soda", "refreshment", "flavor", "explosion", "vibrant", "fruit", "ice",
"summer"] }

Luxury Car

{ "description": "Cinematic CGI shot starting with fragmented, glowing components of a luxury
electric car hovering in an abstract, futuristic void. Each component (e.g., individual body
panels, sleek interior elements, wheel assemblies) slowly rotates, then flawlessly glides and
snaps into perfect position, assembling the complete vehicle with an elegant, silent precision. As
the final piece locks, the car powers on, its headlights beaming, and the abstract void
seamlessly transitions into a pristine, winding mountain road at dawn, ready for adventure. No
text.", "style": "CGI, futuristic, luxurious, precise, minimalist", "camera": "starts with fragmented
components, smoothly tracks and orbits as they assemble, then a grand reveal shot of the
finished car on the road", "lighting": "starts with internal component glows, transitions to crisp,
dynamic studio lighting on the car, then natural, golden hour light on the road", "environment":
"abstract, dark void transforming into a breathtaking, realistic mountain road scene", "elements":
[ "glowing, disassembled car components (e.g., chassis, doors, seats, digital dashboard,
wheels)", "sleek, fully assembled luxury electric car (brand visible, e.g., 'AeroLux')", "dynamic
light trails following component assembly", "pristine asphalt road, mountain landscape, sunrise
sky" ], "motion": "slow, deliberate rotation of fragments; rapid, magnetic-like assembly; seamless
environmental transition; car powering on with light effects", "ending": "the car parked
majestically on the mountain road, headlights illuminating the dawn, with a sense of freedom
and readiness", "text": "none", "keywords": ["CGI", "car", "electric vehicle", "luxury", "assembly",
"futuristic", "precision", "seamless", "road trip", "innovation"] }

Headphones

{ "description": "Cinematic CGI shot of a person sitting in a bustling, chaotic urban environment.
They put on a pair of sleek, noise-canceling headphones (e.g., 'AuraFlow'). As the headphones
settle, the surrounding city noise doesn't just fade, but visually 'crystallizes' and shatters into
countless tiny, silent shards that drift away. Simultaneously, from the headphones themselves, a
vibrant, ethereal 3D soundscape (e.g., a serene forest, a cosmic nebula, a grand concert hall)
organically grows and envelops the person, replacing the urban chaos with a perfect,
personalized audio world. No text.", "style": "CGI, surreal, ethereal, immersive, dynamic",
"camera": "starts wide on the chaotic scene, zooms in as headphones are worn, then smoothly
pulls back and orbits to reveal the person enveloped in the soundscape", "lighting": "starts harsh
urban light, transitions to soft, ambient, and dynamic light emanating from the created
soundscape", "environment": "chaotic city street transforming into a pristine, abstract, and
immersive visual representation of perfect audio", "elements": [ "person (neutral expression
initially, then serene)", "sleek, high-end headphones (brand visible, subtle glow)", "stylized 'noise
shards' or visual distortion of city chaos", "vibrant, evolving 3D soundscape elements (e.g.,
glowing trees, swirling nebulae, orchestral architecture)", "subtle visual vibrations synchronized
with the generated 'music'" ], "motion": "shattering effect of chaos, organic growth and
expansion of soundscape, fluid camera movement through the new environment", "ending": "the
person perfectly serene within their custom soundscape, with the headphones subtly
shimmering, as if creating the entire world around them", "text": "none", "keywords": ["CGI",
"headphones", "soundscape", "noise canceling", "immersion", "audio quality", "surreal",
"transformation", "escape"] }

Tesla Unboxing Ad

{ "description": "Cinematic shot of a minimalist Tesla-branded crate magically opening to
reveal a fully formed Tesla vehicle and an instantly assembled, sleek Tesla-themed
showroom around it. No text.", "style": "cinematic", "camera": "fixed wide angle, with
subtle zooms on key transformations", "lighting": "controlled, high-tech, transitioning
from dim to bright and clean", "room": "empty futuristic space transforming into a
minimalist Tesla showroom", "elements": [ "Tesla-branded crate (glowing seams)",
"Tesla vehicle (e.g., Model 3/Y/Cybertruck)", "charging station", "minimalist display
panels", "sleek showroom furniture", "ambient lighting elements" ], "motion": "crate
panels retract smoothly and silently, car revealed, showroom elements rise/unfold
precisely and rapidly", "ending": "pristine, inviting Tesla showroom with car as
centerpiece", "text": "none", "keywords": [ "16:9", "Tesla", "magic assembly",
"showroom", "innovation", "futuristic", "no text", "clean design", "reveal" ] }

Final Thoughts & Next Steps

Congratulations — you now hold in your hands one of the most practical, creative, and actionable VEO 3 prompt resources available. 

Whether you're a solo creator, a studio experimenting with AI, or just someone curious about cinematic storytelling — you've learned how to:

  • Understand the structure of effective VEO 3 prompts
  • Maintain consistent characters and voices across videos
  • Apply professional camera movements and angles to elevate realism
  • Build engaging sequences using text-to-video formulas
  • Recreate viral formats like ASMR, vlog-style storytelling, and cinematic scenes
  • Develop multi-character interactions and dynamic camera-driven storytelling

What Makes a Prompt Go Viral?

It's not just about cool visuals — it's about emotion, timing, and surprise.

The best-performing prompts often:

  • Tell a story in 8 seconds or less
  • Feature relatable or absurd characters
  • Use dynamic motion to hook the viewer
  • End with an unexpected line or moment
  • Feel like a “real” video someone could’ve filmed

Your Next Creative Move

This guide is just the beginning. Now it's your turn to:

  • Remix the prompts
  • Experiment with new camera angles
  • Turn your ideas into visual stories
  • Share your VEO 3 creations with your audience
  • Package and sell your own prompt packs or animations

Whether you’re making memes, storytelling shorts, or professional-grade cinematics — the tools are in your hands.

Be part of the future With Me.

Need any projects plan? Please contact us soon!

Habibur Rahman

I'm Habibur Rahman

I am Habibur Rahman, a passionate SEO professional. I have been working in the SEO field for more than 8+ years. I love doing SEO because it is a challenging and interesting job. I always try to learn new things about SEO so that I can improve my skills.

0 responses to "The Ultimate VEO 3 Prompt Playbook: High-Impact Prompt Formulas for Viral Videos!"


Leave a Reply

Your email address will not be published. Required fields are marked *