Wan

Wan

Wan is Alibaba's multimodal AI generation suite. In video mode, it provides complete T2V, I2V, and V2V workflows with strong prompt adherence, controllable quality, and production-ready outputs.

Text to Video
Image to Video
Video to Video
v2.7 Text to Video
v2.7 Image to Video
v2.7 Video Edit
v2.7 R2V

Configuration

Required: 3–5000 characters (Chinese or English). Use negative prompt to suppress unwanted artifacts.
Click to upload first clip (MP4, MOV)
At least one of first frame, last frame, first clip, or driving audio is required.
Click to upload driving audio (WAV, MP3)
5s

No video generated yet

Fill in the form and click "Generate Video" to start.