Skip to main content
All CollectionsEditing Features
How to write effective text prompts to generate AI videos?
How to write effective text prompts to generate AI videos?
Michael avatar
Written by Michael
Updated this week

A well-crafted prompt is key to dictating the content of the video produced by an AI model, whether you're generating AI videos from text or images. In this article, we present a formula for writing AI video generation prompts that can help you achieve optimal video output.

Structures of AI Video Generation Prompts

For Text to Video:

Prompt = Subject + Action + Scene + (Camera Language + Lighting + Style)

Subject: What or who is the focus of the video? It can be people, animals, plants, objects, etc. To clarify, the subject should be described in detail, covering elements like:

  • Appearance (e.g., athletic performance, hairstyle, clothing, accessories)

  • Facial features, expressions, and emotions

  • Body postures...

Action: What is the subject doing? This is the core of your prompt, as it drives the video’s storyline. Ensure that the action is clear and concise, as the prompt is designed for a 5-10 second video.

Scene: Where is the action taking place? This includes the foreground, background, and any other elements that set the scene.

Camera Language: Refers to the type of camera shot, angle, and movement that adds to the narrative and visual appeal. Use camera techniques like:

  • Close-up, wide shot

  • Low-angle and high-angle shots

  • Aerial views, depth of field

Lighting: The lighting in the video can significantly impact its mood and depth. Descriptions of lighting should enhance the atmosphere and emotion of the video, such as warm light, morning light, spotlight on the subject, and backlighting.

Style: Setting the tone and style of the video. This can include visual style, emotional tone, and overall mood.

For Image to Video:

Prompt = Subject + Action + Background + Background Movement

Subject: Just like in Text-to-Video, the subject represents the main focus, and its appearance should be described in detail.

Action: Describes the motion of the subject within the scene. Since it's an image-to-video prompt, you might describe the subtle movement that turns a still image into a short, dynamic video.

Background: Describes the surrounding environment, which can help create a more immersive scene.

Background Movement: Refers to the dynamic elements or subtle shifts in the environment that help bring the scene to life.

High-Quality Video Examples

Text to Video

Prompt 1: "A futuristic flying car soaring above a modern city skyline during sunset. The car has a sleek, aerodynamic design with glowing headlights and is surrounded by tall skyscrapers. The sky is filled with warm golden light."

Prompt 2: "Medium close-up shot, a fluffy Pomeranian stands on a park bench, wearing a checkered bow tie and a checkered shirt, curiously looking around. The warm, golden sunlight filters through the trees, casting a soft glow on the dog's fur."

Prompt 3: "A stylish brown leather messenger bag with a flap and buckle details, placed on a glass surface. The bag gleams under diffused lighting. The cityscape is visible behind the glass. The camera moves from top to bottom to show details."

Image to Video

Prompt 1: "A cute, fluffy kitten wearing a navy blue captain's hat, steering a wooden boat on sparkling blue ocean waters."

Prompt 2: "A motorcyclist in black gear rides a sleek orange and white motorcycle along a winding road through a picturesque autumn landscape."

Prompt 3: "The glass perfume bottle floats on the water, surrounded by blooming daisies, gently swaying with the waves."

Prompt 4: "A skier with orange ski suit gliding down a snowy mountain slope, with snow spraying around."

Prompt 5: "Two horses are drinking water and eating grass beside a clear lake. Beside the lake are lush green grass and majestic snow-capped mountains. Clouds are rolling and gradually covering the snow-capped mountains"

Tips for Effective Prompts

  1. Use simple words and sentence structures. Avoid overly complex or abstract language. Simple and concise prompts tend to yield the most accurate results. Break down your prompt into smaller chunks to help AI better understand the task.

  2. Keep the visual content simple. Simple scenes or actions are easier for the AI to interpret and generate.

  3. Movement should follow physical principles. It's best to describe movements that are likely to occur in the scene.

  4. Avoid specifying exact numbers in your prompts. AI models may struggle with numerical consistency.

  5. Use cultural keywords for specific styles. Incorporate cultural terms like "Oriental mood," "Chinese," or "Mediterranean" if you're aiming for a particular aesthetic or cultural theme.

  6. Use split-screen scenarios effectively. For split-screen videos, be specific in describing the scenes in each section.

  7. Professional mode will help you get better results. If your video involves characters, priority to choose professional mode.

By mastering the art of writing effective video prompts, you can significantly improve the quality and relevance of the AI-generated content. Whether you are working with text-to-video or image-to-video prompts, following these guidelines and examples will help you get the results you're looking for.

Did this answer your question?