04
WAN Image
Generate Character Images
For each character, generate images repeatedly until you get a look you are happy with. This is an iterative process.
- Use the character descriptions from Step 2 as your image prompts
- Generate multiple variations of each character — try 3–5 generations per character
- Tweak the prompt each time: adjust lighting, expression, angle, or style keywords
- Save the best image for each character — this becomes your character reference
Don't settle on the first result. Small prompt changes (adding "cinematic lighting", "portrait style", "full body shot") can dramatically change the output.
05
WAN Image
Generate Scene Images
Now generate the background or full scene image for each scene in your story.
- Use the scene descriptions from Step 3 as your prompts
- Generate multiple versions per scene until the composition feels right
- Pay attention to how the scene image will frame the characters
- Save the best scene image for each story beat
Scene images set the mood. If a scene is meant to feel tense, try adding keywords like "dramatic shadows", "low key lighting", "stormy sky".
06
WAN Video
Generate Scene Videos
Combine everything: use each scene image as the starting frame, add the dialogue for that scene, and attach the character reference image so WAN knows who is speaking.
- For each scene, provide WAN Video with: the scene image (from Step 5), the dialogue/narration text to be spoken, and the character reference image (from Step 4)
- Generate each scene video one at a time
- Review the output — check lip-sync, character consistency, and timing
- Assemble the scene videos in story order to create your final piece
If a character doesn't look consistent across scenes, go back to Step 4 and refine the character reference image. A strong reference image is the key to visual continuity.