feat: add prompting guide

ayan4m1 · colinurbs · commit b7af510eb4d5 · 2025-06-29T15:09:04.000-04:00
diff --git a/docs/prompting.md b/docs/prompting.md
@@ -0,0 +1,48 @@
+# Prompting
+
+_Note_: Special thanks to contrinsan for providing this writeup!
+
+## Overview
+
+The key to prompting is to be specific without going over the token limit - include precise details about the scene, subject, style, and camera movements to ensure the AI accurately interprets your vision. Use the following guidelines to help compose a clear and coherent prompt that will get the results you want.
+
+- Subject: Clearly define the main character or object.
+- Scene/Environment: Describe the setting, including location, time of day, or weather.
+- Action: Specify what the subject is doing, using dynamic verbs for motion.
+- Style: Indicate the visual style (e.g., cinematic, anime, photorealistic).
+- Atmosphere/Mood: Convey the emotional tone (e.g., serene, dramatic, eerie).
+- Camera Movements: Include specific camera instructions (e.g., zoom in, pan left, tracking shot).
+- Lighting: Describe lighting conditions (e.g., soft sunlight, neon glow).
+- Shot Size: Specify framing (e.g., wide shot, close-up, medium shot).
+- Verbs: Employ dynamic verbs like "running," "zooming," or "tilting" to guide the model in creating motion.
+- Incorporate Metadata: Add tags like "hdr," "360-degree," or "fisheye" for specific image types.
+- Model-Specific Considerations: Tailor prompts to the strengths of the model. For Hunyuan, use highly descriptive prompts with cinematic terminology (e.g., "wide-angle view," "lens flare") and detailed environmental cues like weather or time of day.
+- Prompt Length: Aim for 60–100 words to provide sufficient context without overwhelming the model.
+- Logical Sequence: Organize the prompt to paint a clear picture, starting with the subject and scene, followed by actions, camera work, and stylistic details.
+- Avoid Overloading: Balance creativity with clarity to prevent confusing the AI.
+
+## Example
+
+### Prompt
+
+```
+A lone samurai in traditional armor stands on a misty cliffside overlooking a lush valley at dawn.
+He unsheathes his katana, performing a slow, deliberate sword dance.
+The scene is cinematic, with a serene yet intense atmosphere.
+The camera starts with a wide-angle shot, slowly zooming in to a medium shot of the samurai’s focused expression.
+Soft golden sunlight filters through the mist, casting gentle shadows.
+The video is high-definition, emphasizing realistic textures and fluid motion.
+```
+
+### Explanation
+
+- Subject: "A lone samurai in traditional armor" clearly defines the main character.
+- Scene/Environment: "Stands on a misty cliffside overlooking a lush valley at dawn" sets a vivid location and time of day.
+- Action: "Unsheathes his katana, performing a slow, deliberate sword dance" uses dynamic verbs to describe the subject’s movement.
+- Style: "Cinematic" specifies a professional, film-like visual style.
+- Atmosphere/Mood: "Serene yet intense" conveys the emotional tone.
+- Camera Movements: "The camera starts with a wide-angle shot, slowly zooming in to a medium shot" provides specific camera instructions.
+- Lighting: "Soft golden sunlight filters through the mist, casting gentle shadows" details the lighting conditions.
+- Shot Size: "Wide-angle shot" and "medium shot" clarify the framing.
+- Metadata: "High-definition, emphasizing realistic textures and fluid motion" adds precision for quality output.
+- Word Count: The prompt is 73 words, fitting the recommended 60–100 word range for sufficient detail without overloading.