Wan 2.7 Image
Model: Fixed to wan-2-7-image.
Prompt: Chinese/English supported, up to 5000 characters.
Input: Optional input_urls (up to 9 images). If input images are provided, aspect ratio is hidden and not sent.
Aspect ratio (no input image): 1:1, 16:9, 4:3, 21:9, 3:4, 9:16, 8:1, 1:8.
Sequential mode: enable_sequential false by default. When false, n range is 1–4 (default 4). When true, n range is 1–12 (default 12).
Resolution: 1K or 2K.
Thinking mode: Available only when enable_sequential=false and no input_urls.
Advanced controls: optional color_palette (3–10 colors, only when non-sequential), optional bbox_list for interactive edit regions, watermark toggle, seed 0–2147483647.
Use cases: poster generation, product visual variants, style-consistent batch assets, guided local edits.
Wan 2.7 Image Pro
Model: Fixed to wan-2-7-image-pro.
Same controls as Wan 2.7 Image, with extended resolution support: 1K / 2K / 4K.
4K constraint: valid only when no input image and sequential mode is disabled.
Use cases: high-resolution campaign key visuals, premium product renders, print-friendly hero assets.
Text to Video
Prompt: Natural language in Chinese or English, 1–5000 characters.
Duration: 5 seconds (quick loops), 10 seconds (standard clips), or 15 seconds (extended scenes).
Resolution: 720p (fast, web-optimized) or 1080p (Full HD, higher detail).
Shot composition: Single continuous shot for simplicity, or multi-shot with transitions for narrative flow.
Use cases: Concept visualization, ad creatives, social media content, storyboarding.
Image to Video
Input: Image URL(s) required; minimum 256×256px; formats JPEG, PNG, WebP; max 10MB per image.
Prompt: Describe the animation, motion, or scene evolution (1–5000 characters).
Duration: 5, 10, or 15 seconds.
Resolution: 720p or 1080p.
Shot composition: Same multi-shot controls as text-to-video.
Use cases: Animating illustrations, product demos, character movement, bringing concept art to life.
Video to Video
Input: Video URL required; formats MP4, MOV, MKV; max 10MB.
Prompt: Describe the transformation—style transfer, content change, mood shift, etc. (1–5000 characters).
Duration: 5 or 10 seconds (output length may differ from input).
Resolution: 720p or 1080p.
Shot composition: Single or multi-shot, with ability to reinterpret original footage.
Use cases: Restyling existing content, adapting videos for different platforms, creative remixing, consistent brand video generation.