Choose the right model for text-to-video, image-to-video, and video editing.
Text-to-video
Generate videos with audio from text prompts. We recommend happyhorse-1.1-t2v. It supports 1080P resolution and clips up to 15 seconds long.
Custom audio file input
If you need to provide a custom audio file such as narration or background music, use wan2.7-t2v-2026-04-25.
Image-to-video
Create dynamic videos from static images. For first-frame image-to-video, use happyhorse-1.1-i2v. For first/last-frame stitching, use wan2.7-i2v-2026-04-25.
First-frame image-to-video
Generate video from a single image. We recommend happyhorse-1.1-i2v, which supports audio, 1080P, and 3 to 15 seconds. If you need to provide a custom audio file, use wan2.7-i2v-2026-04-25.
First/last-frame stitching for long videos
Use a first/last-frame model such as wan2.7-i2v-2026-04-25 to stitch multiple clips together. Setting the last frame of one clip as the first frame of the next creates seamless transitions, ideal for narratives, product demos, or tutorials.
Reference image-to-video
Maintain character consistency across scenes using reference images. We recommend happyhorse-1.1-r2v. If you need to provide a custom audio file to define the voice, or use video as a reference subject, use wan2.7-r2v.
Video editing
Edit existing videos using text instructions for style transfer, element replacement, and other operations. We recommend happyhorse-1.0-video-edit. For effect replication or camera movement replication, use wan2.7-videoedit.
Character animation
Motion-driven character animation
To transfer motion from a reference video to a character in a static image, use wan2.2-animate-move. The background remains unchanged. The pro mode (wan-pro) produces results closer to live-action footage, while the standard mode (wan-std) is faster and more cost-effective.
Character replacement in video
To replace a character in a video with one from a source image, use wan2.2-animate-mix. It also supports pro and standard modes.
Recommended models
|
Model ID |
Use case |
Max resolution |
Max duration |
|
|
Text-to-video |
720P, 1080P |
3–15s |
|
|
Text-to-video, custom audio file |
720P, 1080P |
2–15s |
|
|
First-frame image-to-video |
720P, 1080P |
3–15s |
|
|
First-frame, first/last-frame, video continuation |
720P, 1080P |
2–15s |
|
|
Reference image-to-video |
720P, 1080P |
3–15s |
|
|
Reference image and video-to-video |
720P, 1080P |
2–10s |
|
|
Video editing |
720P, 1080P |
3–15s |
|
|
Video editing, effect replication, camera movement replication |
720P, 1080P |
2–10s |
|
|
Transfer motion to a static character |
720P |
2–30s |
|
|
Replace a character in a video |
720P |
2–30s |
All models
HappyHorse 1.1
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specs |
|
|
Text-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
|
|
First-frame image-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
|
|
Reference image-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
HappyHorse 1.0
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specs |
|
|
Text-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
|
|
First-frame image-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
|
|
Reference image-to-video |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
|
|
Video editing |
Audio |
720P, 1080P. 3–15s. 24 fps, MP4 |
Wan 2.7
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specifications |
|
|
text-to-video |
Audio sync, multi-shot narrative |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
text-to-video |
Audio sync, multi-shot narrative |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
image-to-video |
First frame, first/last frame, video continuation, audio-driven |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
image-to-video |
First frame, first/last frame, video continuation, audio-driven |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
video reference |
Multi-character, ImageN/VideoN reference format |
720P, 1080P. 2–10s. 30 fps, MP4 |
|
|
video editing |
Instruction-based editing, style transfer |
720P, 1080P. 2–10s. 30 fps, MP4 |
Wan 2.6
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specifications |
|
|
text-to-video |
Audio sync, multi-shot narrative |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
image-to-video |
Audio sync, multi-shot narrative |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
image-to-video |
Audio, multi-shot, fast generation |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
video reference |
Audio sync, multi-character, narrative |
720P, 1080P. 2–10s. 30 fps, MP4 |
|
|
video reference |
Multi-character, fast generation |
720P, 1080P. 2–10s. 30 fps, MP4 |
|
|
text-to-video |
Audio sync, multi-shot narrative; for US deployment scope |
720P, 1080P. 2–15s. 30 fps, MP4 |
|
|
image-to-video |
Audio sync, multi-shot narrative; for US deployment scope |
720P, 1080P. 2–15s. 30 fps, MP4 |
Wan 2.5
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specifications |
|
|
text-to-video |
Audio sync |
480P, 720P, 1080P. 5s, 10s. 30 fps, MP4 |
|
|
image-to-video |
Audio sync |
480P, 720P, 1080P. 5s, 10s. 30 fps, MP4 |
Wan 2.2
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specifications |
|
|
text-to-video |
No audio |
480P, 1080P. 5s. 30 fps, MP4 |
|
|
image-to-video |
No audio |
480P, 1080P. 5s. 30 fps, MP4 |
|
|
image-to-video |
No audio, 50% faster than 2.1 |
480P, 720P, 1080P. 5s. 30 fps, MP4 |
|
|
first/last frame |
No audio |
480P, 720P, 1080P. 5s. 30 fps, MP4 |
|
|
character animation |
wan-std / wan-pro modes |
720P. 2–30s. 15/25 fps. MP4 |
|
|
character replacement |
wan-std / wan-pro modes |
720P. 2–30s. 15/25 fps. MP4 |
Wan 2.1 (Wan 2.7 is recommended)
The following models are available in Chinese mainland and international deployment scopes.
|
Model ID |
Type |
Features |
Output specifications |
|
|
text-to-video |
No audio |
720P. 5s. 30 fps, MP4 |
|
|
text-to-video |
No audio |
480P, 720P. 5s. 30 fps, MP4 |
|
|
image-to-video |
No audio |
720P. 5s. 30 fps, MP4 |
|
|
image-to-video |
No audio |
480P, 720P. 3–5s. 30 fps, MP4 |
|
|
first/last frame |
No audio |
720P. 5s. 30 fps, MP4 |
|
|
video editing |
No audio |
720P. Max 5s. 30 fps, MP4 |