Learn about the billing rules and pricing for batch video production features, including text generation, script-to-video, image-text matching, and highlight extraction.
To use the feature, you must purchase an IMS subscription service. For details, see Subscription.
Billing
-
Rules:
-
Charges are calculated based on the total duration of input and output videos. Durations are rounded up to the nearest minute. Any duration less than 1 minute is billed as 1 minute. No charges are incurred for failed production tasks.
-
Intelligent text generation is billed based on the number of tokens consumed. Token counts are rounded up to the nearest thousand. Any usage under 1,000 tokens is billed as 1,000 tokens. No charges are incurred for failed generation tasks.
-
-
Billing cycle: Bills are generated hourly. Alibaba Cloud measures your service usage from the previous billing cycle and issues a bill in the next one. The exact billing time is subject to system processing.
|
Feature |
Billing method |
Unit price |
Unit |
Documentation |
|
Intelligent Text Generation |
Billed by the number of tokens |
0.12 |
CNY per 1,000 tokens |
|
|
Script-to-Video |
|
Same as video editing |
Same as video editing |
|
|
Image-Text Matching (Common Scenarios) |
|
0.3 |
CNY per minute |
|
|
Image-Text Matching (Movie Collections) |
|
1 |
CNY per minute |
|
|
Sports highlights |
|
1 |
CNY per minute |
|
|
Highlight Mashup |
|
2 |
CNY per minute |
|
|
Highlight extraction |
Billed by the input video duration. |
2 |
CNY per minute |
Billing examples
Assume that between 8:00 and 9:00, you use the Image-Text Matching (Common Scenarios) feature in the Chinese mainland region. You provide a 90-second input video and produce a 23-second output video. You also use the Intelligent Text Generation feature, consuming 900 tokens.
The total cost is calculated as follows:
-
Video production cost:
-
Total billable duration = Input duration + Output duration = (90s + 23s) / 60 = 1.88 minutes.
-
Rounded up to the nearest minute, the total duration is 2 minutes.
-
Cost = 0.3 CNY per minute × 2 minutes = 0.6 CNY.
-
-
Text generation cost:
-
Total tokens consumed = 900 tokens.
-
Rounded up to the nearest thousand, the total is 1,000 tokens.
-
Cost = 0.12 CNY per 1000 tokens × 1 = 0.12 CNY.
-
-
Total cost:
-
The total cost for the batch video production between 8:00 and 9:00 is 0.12 + 0.6 = 0.72 CNY.
-