Prompt
602 wordsStyle: 8K cinematic texture. Real photography—not 3D rendering, not a game engine, no game cinematic feel. Photography: Master-level naturalistic photography. Lighting: Natural light only—backlit rim light, camera on the shadow side, thin fog in the air. Key light only from skylight and window light. Color: 60:30:10—Primary/Secondary/Accent. Lens: Physical cinematic lens. 180-degree shutter motion blur. Skin: Pore-level realism—fine hairs, asymmetric moles, capillary redness, pore shadows matching the scene lighting. Performance: Top-tier cinematic—micro-pauses before reactions, precise gaze, moist and bright eyes with catchlights, visible breathing and chest rise. Physics: Obey gravity and inertia—mass has real weight, contact shadows are correct, no floating props. Composition: Rule of thirds + golden ratio. Everyone is moving from the first frame. Consistency: Character, props, and environment completely consistent across every shot, no identity drift. Technical: 24fps smooth motion. 8K detail. No jitter. Audio: Ambient sound effects only. No music. No subtitles. SUBJECT: The delivery person is the individual from the uploaded first frame: high bun hairstyle, yellow and black color-blocked jacket with a light khaki vest and gray sweatpants, holding a black insulated delivery box in the right hand. The video continues the action directly from this first frame, with appearance, clothing, hairstyle, and box consistent throughout. WB 5200K. MULTISHOT—greeting at the door, entering the house, putting down the delivery, and leaving with a shy, playful backward glance. Chinese dialogue, mouth-sync enabled. LOCATION: Modern apartment elevator hallway with beige marble and warm lights, transitioning inward to a tidy entryway. The video naturally extends from the hallway space in the first frame into the interior. ACTION: POV first-person: The camera is 'you' (the customer) opening the door and watching her enter. The video begins moving from the first frame (her standing in the hallway looking at the camera). SHOT 1 (0:00–0:05): Continues from the first frame: The delivery person looks up directly at the camera with a warm slight smile, box in right hand. She says: 'Hello, your delivery is here.' Off-screen male voice (customer/camera) responds: 'Okay, help me bring it in.' Hard cut. SHOT 2 (0:05–0:10): Camera pulls back slightly as she enters, crossing the threshold into the entryway. She bends over to place the box by the cabinet, takes out the meal containers and arranges them neatly, then stands up and straightens the jacket hem. Off-screen voice: 'Thank you.' She bows her head slightly, cheeks blushing, a shy pause, then says: 'You're welcome, enjoy your meal.' Hard cut. SHOT 3 (0:10–0:15): She turns to leave, then stops, looks back over her shoulder at the camera, eyes crinkling, fingers lightly plucking the backpack strap. A playful micro-pause, then she says: 'Would you like me to stay and eat with you?' Freezes on her over-the-shoulder smile, softened by warm light. CAMERA: SHOT 1: POV eye-level, locked position at the door, 35mm feel, slight handheld breathing; Motive—initial encounter. SHOT 2: POV chest-high slow pull-back following her inside, 28mm, fine handheld shake; Motive—making space for her entry. SHOT 3: POV push-in to 50mm mid-close-up on her face as she turns; Motive—playful hook. STYLE: Primary warm beige marble/hallway tungsten light 60% / Secondary cold gray pants and black box 30% / Accent yellow jacket 10%. WB 5200K. Main light from hallway ceiling and off-screen interior lights, light fog. CONSTRAINTS: 9:16 vertical screen. No slow motion, natural 24fps throughout. Video strictly follows the uploaded first frame without altering character appearance or composition. POV remains first-person; camera does not show customer's body (implied hands at most). Delivery person's identity, clothes, bun, and box are identical across three shots. Natural and appropriate framing. Three Chinese lines with clean mouth-sync. No glowing eyes, no burnt-in subtitles.
About this prompt
Cinematic POV Delivery Scene is a Short Drama Seedance 2.0 prompt with 602 words, structured around shot timing, camera direction, motion cues, and remixable scene details.
Use the preview clip when available to understand pacing, then treat the prompt text as a production brief you can adapt for your own subject, brand, or story.
Video prompt structure
Use it for Short Drama videos
Best for Seedance 2.0 generations that need Short Drama, Dialogue, Image Reference. Replace the subject, setting, product, dialogue, music, and timing before sending it to the generator.
Customize before generating
- Swap the subject. Keep the camera and timing structure, but change the character, product, or environment.
- Tighten the timeline. Shorten or expand shot blocks so they match the duration you want to render.
- Add reference media. Use first frames, last frames, images, video, or audio when the prompt depends on a specific look or rhythm.
More Seedance paths to explore
Browse adjacent styles and techniques from the same Seedance 2.0 prompt library.
How to use it
- Step 1
Copy or start from this prompt
Use the full prompt as a structured draft rather than rewriting the whole video brief.
- Step 2
Open Seedance 2.0
Send the text into the Seedance generator and choose text, image, or reference mode.
- Step 3
Adapt and render
Tune duration, aspect ratio, reference media, camera notes, and dialogue before generating.
Make it a Seedance video
Start from this prompt in the Seedance 2.0 generator, then adjust the scene, duration, aspect ratio, and reference media.


