|
| 1 | +# Prompting |
| 2 | + |
| 3 | +_Note_: Special thanks to contrinsan for providing this writeup! |
| 4 | + |
| 5 | +## Overview |
| 6 | + |
| 7 | +The key to prompting is to be specific without going over the token limit - include precise details about the scene, subject, style, and camera movements to ensure the AI accurately interprets your vision. Use the following guidelines to help compose a clear and coherent prompt that will get the results you want. |
| 8 | + |
| 9 | +- Subject: Clearly define the main character or object. |
| 10 | +- Scene/Environment: Describe the setting, including location, time of day, or weather. |
| 11 | +- Action: Specify what the subject is doing, using dynamic verbs for motion. |
| 12 | +- Style: Indicate the visual style (e.g., cinematic, anime, photorealistic). |
| 13 | +- Atmosphere/Mood: Convey the emotional tone (e.g., serene, dramatic, eerie). |
| 14 | +- Camera Movements: Include specific camera instructions (e.g., zoom in, pan left, tracking shot). |
| 15 | +- Lighting: Describe lighting conditions (e.g., soft sunlight, neon glow). |
| 16 | +- Shot Size: Specify framing (e.g., wide shot, close-up, medium shot). |
| 17 | +- Verbs: Employ dynamic verbs like "running," "zooming," or "tilting" to guide the model in creating motion. |
| 18 | +- Incorporate Metadata: Add tags like "hdr," "360-degree," or "fisheye" for specific image types. |
| 19 | +- Model-Specific Considerations: Tailor prompts to the strengths of the model. For Hunyuan, use highly descriptive prompts with cinematic terminology (e.g., "wide-angle view," "lens flare") and detailed environmental cues like weather or time of day. |
| 20 | +- Prompt Length: Aim for 60–100 words to provide sufficient context without overwhelming the model. |
| 21 | +- Logical Sequence: Organize the prompt to paint a clear picture, starting with the subject and scene, followed by actions, camera work, and stylistic details. |
| 22 | +- Avoid Overloading: Balance creativity with clarity to prevent confusing the AI. |
| 23 | + |
| 24 | +## Example |
| 25 | + |
| 26 | +### Prompt |
| 27 | + |
| 28 | +``` |
| 29 | +A lone samurai in traditional armor stands on a misty cliffside overlooking a lush valley at dawn. |
| 30 | +He unsheathes his katana, performing a slow, deliberate sword dance. |
| 31 | +The scene is cinematic, with a serene yet intense atmosphere. |
| 32 | +The camera starts with a wide-angle shot, slowly zooming in to a medium shot of the samurai’s focused expression. |
| 33 | +Soft golden sunlight filters through the mist, casting gentle shadows. |
| 34 | +The video is high-definition, emphasizing realistic textures and fluid motion. |
| 35 | +``` |
| 36 | + |
| 37 | +### Explanation |
| 38 | + |
| 39 | +- Subject: "A lone samurai in traditional armor" clearly defines the main character. |
| 40 | +- Scene/Environment: "Stands on a misty cliffside overlooking a lush valley at dawn" sets a vivid location and time of day. |
| 41 | +- Action: "Unsheathes his katana, performing a slow, deliberate sword dance" uses dynamic verbs to describe the subject’s movement. |
| 42 | +- Style: "Cinematic" specifies a professional, film-like visual style. |
| 43 | +- Atmosphere/Mood: "Serene yet intense" conveys the emotional tone. |
| 44 | +- Camera Movements: "The camera starts with a wide-angle shot, slowly zooming in to a medium shot" provides specific camera instructions. |
| 45 | +- Lighting: "Soft golden sunlight filters through the mist, casting gentle shadows" details the lighting conditions. |
| 46 | +- Shot Size: "Wide-angle shot" and "medium shot" clarify the framing. |
| 47 | +- Metadata: "High-definition, emphasizing realistic textures and fluid motion" adds precision for quality output. |
| 48 | +- Word Count: The prompt is 73 words, fitting the recommended 60–100 word range for sufficient detail without overloading. |
0 commit comments