Tiered pricing
Some Model Studio models use tiered pricing. The unit price is determined by the total number of input tokens in a single request. All tokens in that request are billed at the corresponding tier's unit price.
For example, a model might have two pricing tiers: 0 < tokens ≤ 32K and 32K < tokens ≤ 128K. If a request contains 100K input tokens, it falls into the second tier (32K < 100K ≤ 128K), and all tokens are billed at that tier's unit price.
Text generation: Qwen
Qwen-Max
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input tokens | Input price (per million tokens) | Output price (per million tokens) Chain of thought and answer | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3.7-max Alias for qwen3.7-max-2026-05-20 50% batch inference discount context caching discount | Chinese mainland | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 | 1 million tokens |
qwen3.7-max-2026-06-08 | Chinese mainland | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 | 1 million tokens |
qwen3.7-max-2026-05-20 | Chinese mainland | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 | 1 million tokens |
qwen3.7-max-preview Alias for qwen3.7-max-2026-05-17 | Chinese mainland | thinking mode only | 0<token≤1M | CNY 12 | CNY 36 | 1 million tokens |
qwen3.7-max-2026-05-17 | Chinese mainland | thinking mode only | 0<token≤1M | CNY 12 | CNY 36 | 1 million tokens |
qwen3.6-max-preview context caching discount | Chinese mainland | non-thinking and thinking modes | 0<token≤128K | CNY 9 | CNY 54 | 1 million tokens |
128K<token≤256K | CNY 15 | CNY 90 | ||||
qwen3-max Alias for qwen3-max-2026-01-23 50% batch inference discount context caching discount | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 2.5 | CNY 10 | 1 million tokens |
32K<token≤128K | CNY 4 | CNY 16 | ||||
128K<token≤256K | CNY 7 | CNY 28 | ||||
qwen3-max-2026-01-23 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 2.5 | CNY 10 | 1 million tokens |
32K<token≤128K | CNY 4 | CNY 16 | ||||
128K<token≤256K | CNY 7 | CNY 28 | ||||
qwen3-max-2025-09-23 | Chinese mainland | non-thinking mode only | 0<token≤32K | CNY 6 | CNY 24 | 1 million tokens |
32K<token≤128K | CNY 10 | CNY 40 | ||||
128K<token≤256K | CNY 15 | CNY 60 | ||||
qwen3-max-preview context caching discount | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 6 | CNY 24 | 1 million tokens |
32K<token≤128K | CNY 10 | CNY 40 | ||||
128K<token≤256K | CNY 15 | CNY 60 |
More models
Model ID | Deployment scope | Mode | Input tokens | Input price (per million tokens) | Output price (per million tokens) | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-max 50% batch inference discount | Chinese mainland | non-thinking mode only | No tiered pricing | CNY 2.4 | CNY 9.6 | 1 million tokens |
US (Virginia)
Model id | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
qwen3.7-max Equivalent to qwen3.7-max-2026-05-20 Eligible for context caching discounts | Global | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 |
qwen3.7-max-2026-06-08 | Global | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 |
qwen3.7-max-2026-05-20 | Global | non-thinking and thinking modes | 0<token≤1M | CNY 12 | CNY 36 |
qwen3-max Equivalent to qwen3-max-2026-01-23 Eligible for context caching discounts | Global | non-thinking mode only | 0<token≤32K | CNY 2.5 | CNY 10 |
32K<token≤128K | CNY 4 | CNY 16 | |||
128K<token≤256K | CNY 7 | CNY 28 | |||
qwen3-max-2025-09-23 | Global | non-thinking mode only | 0<token≤32K | CNY 6 | CNY 24 |
32K<token≤128K | CNY 10 | CNY 40 | |||
128K<token≤256K | CNY 15 | CNY 60 | |||
qwen3-max-preview Eligible for context caching discounts | Global | non-thinking and thinking modes | 0<token≤32K | CNY 6 | CNY 24 |
32K<token≤128K | CNY 10 | CNY 40 | |||
128K<token≤256K | CNY 15 | CNY 60 |
Singapore
Model id | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
qwen3.7-max Currently maps to qwen3.7-max-2026-05-20 Discount for context caching | International | non-thinking and thinking modes | 0 < tokens ≤ 1M | CNY 18.736 | CNY 56.207 |
qwen3.7-max-2026-06-08 | International | non-thinking and thinking modes | 0 < tokens ≤ 1M | CNY 18.736 | CNY 56.207 |
qwen3.7-max-2026-05-20 | International | non-thinking and thinking modes | 0 < tokens ≤ 1M | CNY 18.736 | CNY 56.207 |
qwen3.6-max-preview Discount for context caching | International | non-thinking and thinking modes | 0 < tokens ≤ 128K | CNY 9.742 | CNY 58.455 |
128K < tokens ≤ 256K | CNY 14.988 | CNY 89.93 | |||
qwen3-max Currently maps to qwen3-max-2026-01-23 Discount for context caching | International | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 8.807 | CNY 44.035 |
32K < tokens ≤ 128K | CNY 17.614 | CNY 88.071 | |||
128K < tokens ≤ 256K | CNY 22.018 | CNY 110.089 | |||
qwen3-max-2026-01-23 | International | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 8.807 | CNY 44.035 |
32K < tokens ≤ 128K | CNY 17.614 | CNY 88.071 | |||
128K < tokens ≤ 256K | CNY 22.018 | CNY 110.089 | |||
qwen3-max-2025-09-23 | International | non-thinking mode only | 0 < tokens ≤ 32K | CNY 8.807 | CNY 44.035 |
32K < tokens ≤ 128K | CNY 17.614 | CNY 88.071 | |||
128K < tokens ≤ 256K | CNY 22.018 | CNY 110.089 | |||
qwen3-max-preview Discount for context caching | International | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 8.807 | CNY 44.035 |
32K < tokens ≤ 128K | CNY 17.614 | CNY 88.071 | |||
128K < tokens ≤ 256K | CNY 22.018 | CNY 110.089 |
More models
Model id | Deployment scope | Mode | Input tokens | Input price | Output price |
qwen-max 50% discount for batch inference | International | non-thinking mode only | no tiered pricing | CNY 11.743 | CNY 46.971 |
Germany (Frankfurt)
Model id | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
qwen3.7-max Equivalent to qwen3.7-max-2026-05-20 context caching qualifies for a discount | global | Non-thinking and thinking modes | 0<tokens≤1M | CNY 12 | CNY 36 |
qwen3.7-max-2026-06-08 | global | Non-thinking and thinking modes | 0<tokens≤1M | CNY 12 | CNY 36 |
qwen3.7-max-2026-05-20 | global | Non-thinking and thinking modes | 0<tokens≤1M | CNY 12 | CNY 36 |
qwen3-max Equivalent to qwen3-max-2026-01-23 context caching qualifies for a discount | global | Non-thinking mode only | 0<tokens≤32K | CNY 2.5 | CNY 10 |
32K<tokens≤128K | CNY 4 | CNY 16 | |||
128K<tokens≤256K | CNY 7 | CNY 28 | |||
qwen3-max Equivalent to qwen3-max-2026-01-23 | EU | Non-thinking and thinking modes | 0<tokens≤32K | CNY 8.993 | CNY 44.965 |
32K<tokens≤128K | CNY 17.986 | CNY 89.93 | |||
128K<tokens≤256K | CNY 22.483 | CNY 112.413 | |||
qwen3-max-2026-01-23 | EU | Non-thinking and thinking modes | 0<tokens≤32K | CNY 8.993 | CNY 44.965 |
32K<tokens≤128K | CNY 17.986 | CNY 89.93 | |||
128K<tokens≤256K | CNY 22.483 | CNY 112.413 | |||
qwen3-max-2025-09-23 | global | Non-thinking mode only | 0<tokens≤32K | CNY 6 | CNY 24 |
32K<tokens≤128K | CNY 10 | CNY 40 | |||
128K<tokens≤256K | CNY 15 | CNY 60 | |||
qwen3-max-preview context caching qualifies for a discount | global | Non-thinking and thinking modes | 0<tokens≤32K | CNY 6 | CNY 24 |
32K<tokens≤128K | CNY 10 | CNY 40 | |||
128K<tokens≤256K | CNY 15 | CNY 60 |
Qwen-Plus
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Input tokens | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | |
Non-thinking mode | Thinking mode | |||||
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 50% batch inference discount context caching discount | Chinese mainland | 0<token≤256K | CNY 2 | CNY 8 | CNY 8 | 1 million tokens |
256K<token≤1M | CNY 6 | CNY 24 | CNY 24 | |||
qwen3.7-plus-2026-05-26 context caching discount | Chinese mainland | 0<token≤256K | CNY 2 | CNY 8 | CNY 8 | 1 million tokens |
256K<token≤1M | CNY 6 | CNY 24 | CNY 24 | |||
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 | Chinese mainland | 0<token≤256K | CNY 2 | CNY 12 | CNY 12 | 1 million tokens |
256K<token≤1M | CNY 8 | CNY 48 | CNY 48 | |||
qwen3.6-plus-2026-04-02 | Chinese mainland | 0<token≤256K | CNY 2 | CNY 12 | CNY 12 | 1 million tokens |
256K<token≤1M | CNY 8 | CNY 48 | CNY 48 | |||
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 | 1 million tokens |
128K<token≤256K | CNY 2 | CNY 12 | CNY 12 | |||
256K<token≤1M | CNY 4 | CNY 24 | CNY 24 | |||
qwen3.5-plus-2026-04-20 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 | 1 million tokens |
128K<token≤256K | CNY 2 | CNY 12 | CNY 12 | |||
256K<token≤1M | CNY 4 | CNY 24 | CNY 24 | |||
qwen3.5-plus-2026-02-15 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 | 1 million tokens |
128K<token≤256K | CNY 2 | CNY 12 | CNY 12 | |||
256K<token≤1M | CNY 4 | CNY 24 | CNY 24 | |||
qwen-plus Currently equivalent to qwen-plus-2025-12-01 50% batch inference discount | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
128K<token≤256K | CNY 2.4 | CNY 20 | CNY 24 | |||
256K<token≤1M | CNY 4.8 | CNY 48 | CNY 64 | |||
qwen-plus-latest 50% batch inference discount | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
128K<token≤256K | CNY 2.4 | CNY 20 | CNY 24 | |||
256K<token≤1M | CNY 4.8 | CNY 48 | CNY 64 | |||
qwen-plus-2025-12-01 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
128K<token≤256K | CNY 2.4 | CNY 20 | CNY 24 | |||
256K<token≤1M | CNY 4.8 | CNY 48 | CNY 64 | |||
qwen-plus-2025-09-11 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
128K<token≤256K | CNY 2.4 | CNY 20 | CNY 24 | |||
256K<token≤1M | CNY 4.8 | CNY 48 | CNY 64 | |||
qwen-plus-2025-07-28 | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
128K<token≤256K | CNY 2.4 | CNY 20 | CNY 24 | |||
256K<token≤1M | CNY 4.8 | CNY 48 | CNY 64 | |||
qwen-plus-2025-07-14 | Chinese mainland | No tiered pricing | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
qwen-plus-2025-04-28 | Chinese mainland | No tiered pricing | CNY 0.8 | CNY 2 | CNY 8 | 1 million tokens |
More models
Model id | Deployment scope | Input tokens | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-plus-2025-01-25 | Chinese mainland | No tiered pricing | CNY 0.8 | CNY 2 | 1 million tokens |
qwen-plus-2025-01-12 | Chinese mainland | No tiered pricing | CNY 0.8 | CNY 2 | 1 million tokens |
qwen-plus-2024-12-20 | Chinese mainland | No tiered pricing | CNY 0.8 | CNY 2 | 1 million tokens |
US (Virginia)
Model ID | Availability | Input token range | Input price | Output price | |
Non-thinking mode | Thinking mode | ||||
qwen3.7-plus Equivalent to qwen3.7-plus-2026-05-26 context caching is eligible for discounts. | Global | 0<tokens≤256K | CNY 2 | CNY 8 | CNY 8 |
256K<tokens≤1M | CNY 6 | CNY 24 | CNY 24 | ||
qwen3.7-plus-2026-05-26 context caching is eligible for discounts. | Global | 0<tokens≤256K | CNY 2 | CNY 8 | CNY 8 |
256K<tokens≤1M | CNY 6 | CNY 24 | CNY 24 | ||
qwen3.6-plus Equivalent to qwen3.6-plus-2026-04-02 | Global | 0<tokens≤256K | CNY 2 | CNY 12 | CNY 12 |
256K<tokens≤1M | CNY 8 | CNY 48 | CNY 48 | ||
qwen3.6-plus-2026-04-02 | Global | 0<tokens≤256K | CNY 2 | CNY 12 | CNY 12 |
256K<tokens≤1M | CNY 8 | CNY 48 | CNY 48 | ||
qwen3.5-plus Equivalent to qwen3.5-plus-2026-02-15 | Global | 0<tokens≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 |
128K<tokens≤256K | CNY 2 | CNY 12 | CNY 12 | ||
256K<tokens≤1M | CNY 4 | CNY 24 | CNY 24 | ||
qwen3.5-plus-2026-02-15 | Global | 0<tokens≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 |
128K<tokens≤256K | CNY 2 | CNY 12 | CNY 12 | ||
256K<tokens≤1M | CNY 4 | CNY 24 | CNY 24 | ||
qwen-plus Equivalent to qwen-plus-2025-12-01 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus-us context caching is eligible for discounts. | US | 0<tokens≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<tokens≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-12-01 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus-2025-12-01-us | US | 0<tokens≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<tokens≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-09-11 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus-2025-07-28 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
Singapore
Model id | Deployment scope | Input token range | Input price (per 1M tokens) | Output price (per 1M tokens) | |
Non-thinking mode | Thinking mode | ||||
qwen3.7-plus Alias for qwen3.7-plus-2026-05-26 context caching discount applies | International | 0<Token≤256K | CNY 2.998 | CNY 11.991 | CNY 11.991 |
256K<Token≤1M | CNY 8.993 | CNY 35.972 | CNY 35.972 | ||
qwen3.7-plus-2026-05-26 context caching discount applies | International | 0<Token≤256K | CNY 2.998 | CNY 11.991 | CNY 11.991 |
256K<Token≤1M | CNY 8.993 | CNY 35.972 | CNY 35.972 | ||
qwen3.6-plus Alias for qwen3.6-plus-2026-04-02 | International | 0<Token≤256K | CNY 3.7471 | CNY 22.4826 | CNY 22.4826 |
256K<Token≤1M | CNY 14.9884 | CNY 44.965 | CNY 44.965 | ||
qwen3.6-plus-2026-04-02 | International | 0<Token≤256K | CNY 3.7471 | CNY 22.4826 | CNY 22.4826 |
256K<Token≤1M | CNY 14.9884 | CNY 44.965 | CNY 44.965 | ||
qwen3.5-plus Alias for qwen3.5-plus-2026-02-15 | International | 0<Token≤256K | CNY 2.936 | CNY 17.614 | CNY 17.614 |
256K<Token≤1M | CNY 3.67 | CNY 22.018 | CNY 22.018 | ||
qwen3.5-plus-2026-04-20 | International | 0<Token≤256K | CNY 2.936 | CNY 17.614 | CNY 17.614 |
256K<Token≤1M | CNY 3.67 | CNY 22.018 | CNY 22.018 | ||
qwen3.5-plus-2026-02-15 | International | 0<Token≤256K | CNY 2.936 | CNY 17.614 | CNY 17.614 |
256K<Token≤1M | CNY 3.67 | CNY 22.018 | CNY 22.018 | ||
qwen-plus Alias for qwen-plus-2025-12-01 | International | 0<Token≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<Token≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-latest | International | 0<Token≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<Token≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-12-01 | International | 0<Token≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<Token≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-09-11 | International | 0<Token≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<Token≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-07-28 | International | 0<Token≤256K | CNY 2.936 | CNY 8.807 | CNY 29.357 |
256K<Token≤1M | CNY 8.807 | CNY 26.421 | CNY 88.071 | ||
qwen-plus-2025-07-14 | International | No tiered pricing | CNY 2.936 | CNY 8.807 | CNY 29.357 |
qwen-plus-2025-04-28 | International | No tiered pricing | CNY 2.936 | CNY 8.807 | CNY 29.357 |
More models
Model id | Deployment scope | Input token range | Input price (per 1M tokens) | Output price (per 1M tokens) |
qwen-plus-2025-01-25 | International | No tiered pricing | CNY 2.936 | CNY 8.807 |
Germany (Frankfurt)
Model ID | Deployment scope | Input token range | Input price | Output price | |
Non-thinking mode | Thinking mode | ||||
qwen3.7-plus Currently equivalent to qwen3.7-plus-2026-05-26 context caching discounts apply | Global | 0<tokens≤256K | CNY 2 | CNY 8 | CNY 8 |
256K<tokens≤1M | CNY 6 | CNY 24 | CNY 24 | ||
qwen3.7-plus-2026-05-26 context caching discounts apply | Global | 0<tokens≤256K | CNY 2 | CNY 8 | CNY 8 |
256K<tokens≤1M | CNY 6 | CNY 24 | CNY 24 | ||
qwen3.6-plus Currently equivalent to qwen3.6-plus-2026-04-02 | Global | 0<tokens≤256K | CNY 2 | CNY 12 | CNY 12 |
256K<tokens≤1M | CNY 8 | CNY 48 | CNY 48 | ||
qwen3.6-plus-2026-04-02 | Global | 0<tokens≤256K | CNY 2 | CNY 12 | CNY 12 |
256K<tokens≤1M | CNY 8 | CNY 48 | CNY 48 | ||
qwen3.5-plus Currently equivalent to qwen3.5-plus-2026-02-15 | Global | 0<tokens≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 |
128K<tokens≤256K | CNY 2 | CNY 12 | CNY 12 | ||
256K<tokens≤1M | CNY 4 | CNY 24 | CNY 24 | ||
qwen3.5-plus-2026-02-15 | Global | 0<tokens≤128K | CNY 0.8 | CNY 4.8 | CNY 4.8 |
128K<tokens≤256K | CNY 2 | CNY 12 | CNY 12 | ||
256K<tokens≤1M | CNY 4 | CNY 24 | CNY 24 | ||
qwen-plus Currently equivalent to qwen-plus-2025-12-01 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus Currently equivalent to qwen-plus-2025-12-01 | EU | 0<tokens≤256K | CNY 2.998 | CNY 8.993 | CNY 29.977 |
256K<tokens≤1M | CNY 8.993 | CNY 26.979 | CNY 89.93 | ||
qwen-plus-2025-12-01 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus-2025-12-01 | EU | 0<tokens≤256K | CNY 2.998 | CNY 8.993 | CNY 29.977 |
256K<tokens≤1M | CNY 8.993 | CNY 26.979 | CNY 89.93 | ||
qwen-plus-2025-09-11 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
qwen-plus-2025-07-28 | Global | 0<tokens≤128K | CNY 0.8 | CNY 2 | CNY 8 |
128K<tokens≤256K | CNY 2.4 | CNY 20 | CNY 24 | ||
256K<tokens≤1M | CNY 4.8 | CNY 48 | CNY 64 | ||
Qwen-Flash
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Region | Mode | Input token range | Input price | Output price Chain of thought and answer | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3.6-flash Currently equivalent to qwen3.6-flash-2026-04-16 50% discount for batch inference Discounts apply to context caching | Chinese mainland | non-thinking and thinking modes | 0<Token≤256K | CNY 1.2 | CNY 7.2 | 1 million tokens |
256K<Token≤1M | CNY 4.8 | CNY 28.8 | ||||
qwen3.6-flash-2026-04-16 | Chinese mainland | non-thinking and thinking modes | 0<Token≤256K | CNY 1.2 | CNY 7.2 | 1 million tokens |
256K<Token≤1M | CNY 4.8 | CNY 28.8 | ||||
qwen3.5-flash Currently equivalent to qwen3.5-flash-2026-02-23 50% discount for batch inference Discounts apply to context caching | Chinese mainland | non-thinking and thinking modes | 0<Token≤128K | CNY 0.2 | CNY 2 | 1 million tokens |
128K<Token≤256K | CNY 0.8 | CNY 8 | ||||
256K<Token≤1M | CNY 1.2 | CNY 12 | ||||
qwen3.5-flash-2026-02-23 | Chinese mainland | non-thinking and thinking modes | 0<Token≤128K | CNY 0.2 | CNY 2 | 1 million tokens |
128K<Token≤256K | CNY 0.8 | CNY 8 | ||||
256K<Token≤1M | CNY 1.2 | CNY 12 | ||||
qwen-flash Currently equivalent to qwen-flash-2025-07-28 50% discount for batch inference Discounts apply to context caching | Chinese mainland | non-thinking and thinking modes | 0<Token≤128K | CNY 0.15 | CNY 1.5 | 1 million tokens |
128K<Token≤256K | CNY 0.6 | CNY 6 | ||||
256K<Token≤1M | CNY 1.2 | CNY 12 | ||||
qwen-flash-2025-07-28 | Chinese mainland | non-thinking and thinking modes | 0<Token≤128K | CNY 0.15 | CNY 1.5 | 1 million tokens |
128K<Token≤256K | CNY 0.6 | CNY 6 | ||||
256K<Token≤1M | CNY 1.2 | CNY 12 |
US (Virginia)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought + answer |
qwen3.6-flash Currently points to qwen3.6-flash-2026-04-16 Discount for context caching | global | non-thinking and thinking modes | 0<token≤256K | CNY 1.2 | CNY 7.2 |
256K<token≤1M | CNY 4.8 | CNY 28.8 | |||
qwen3.6-flash-2026-04-16 | global | non-thinking and thinking modes | 0<token≤256K | CNY 1.2 | CNY 7.2 |
256K<token≤1M | CNY 4.8 | CNY 28.8 | |||
qwen3.5-flash Currently points to qwen3.5-flash-2026-02-23 Discount for context caching | global | non-thinking and thinking modes | 0<token≤128K | CNY 0.2 | CNY 2 |
128K<token≤256K | CNY 0.8 | CNY 8 | |||
256K<token≤1M | CNY 1.2 | CNY 12 | |||
qwen3.5-flash-2026-02-23 | global | non-thinking and thinking modes | 0<token≤128K | CNY 0.2 | CNY 2 |
128K<token≤256K | CNY 0.8 | CNY 8 | |||
256K<token≤1M | CNY 1.2 | CNY 12 | |||
qwen-flash Currently points to qwen-flash-2025-07-28 50% discount for batch inference Discount for context caching | global | non-thinking and thinking modes | 0<token≤128K | CNY 0.15 | CNY 1.5 |
128K<token≤256K | CNY 0.6 | CNY 6 | |||
256K<token≤1M | CNY 1.2 | CNY 12 | |||
qwen-flash-us Discount for context caching | US | 0<token≤256K | CNY 0.367 | CNY 2.936 | |
256K<token≤1M | CNY 1.835 | CNY 14.678 | |||
qwen-flash-2025-07-28 | global | non-thinking and thinking modes | 0<token≤128K | CNY 0.15 | CNY 1.5 |
128K<token≤256K | CNY 0.6 | CNY 6 | |||
256K<token≤1M | CNY 1.2 | CNY 12 | |||
qwen-flash-2025-07-28-us | US | 0<token≤256K | CNY 0.367 | CNY 2.936 | |
256K<token≤1M | CNY 1.835 | CNY 14.678 |
Singapore
Model ID | Deployment scope | Mode | Input token range | Input price | Output price Chain of thought and answer |
qwen3.6-flash Equivalent to qwen3.6-flash-2026-04-16 context caching discounts apply | international | non-thinking and thinking modes | 0<tokens≤256K | CNY 1.87355 | CNY 11.2413 |
256K<tokens≤1M | CNY 7.4942 | CNY 29.9758 | |||
qwen3.6-flash-2026-04-16 | international | non-thinking and thinking modes | 0<tokens≤256K | CNY 1.87355 | CNY 11.2413 |
256K<tokens≤1M | CNY 7.4942 | CNY 29.9758 | |||
qwen3.5-flash Equivalent to qwen3.5-flash-2026-02-23 batch inference 50% discount context caching discounts apply | international | non-thinking and thinking modes | 0<tokens≤1M | CNY 0.734 | CNY 2.936 |
qwen3.5-flash-2026-02-23 | international | non-thinking and thinking modes | 0<tokens≤1M | CNY 0.734 | CNY 2.936 |
qwen-flash Equivalent to qwen-flash-2025-07-28 batch inference 50% discount context caching discounts apply | international | non-thinking and thinking modes | 0<tokens≤256K | CNY 0.367 | CNY 2.936 |
256K<tokens≤1M | CNY 1.835 | CNY 14.678 | |||
qwen-flash-2025-07-28 | international | non-thinking and thinking modes | 0<tokens≤256K | CNY 0.367 | CNY 2.936 |
256K<tokens≤1M | CNY 1.835 | CNY 14.678 |
Germany (Frankfurt)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
qwen3.6-flash Alias for qwen3.6-flash-2026-04-16 Discounts for context caching | Global | non-thinking and thinking modes | 0<Token≤256K | CNY 1.2 | CNY 7.2 |
256K<Token≤1M | CNY 4.8 | CNY 28.8 | |||
qwen3.6-flash-2026-04-16 | Global | non-thinking and thinking modes | 0<Token≤256K | CNY 1.2 | CNY 7.2 |
256K<Token≤1M | CNY 4.8 | CNY 28.8 | |||
qwen3.5-flash Alias for qwen3.5-flash-2026-02-23 Discounts for context caching | Global | non-thinking and thinking modes | 0<Token≤128K | CNY 0.2 | CNY 2 |
128K<Token≤256K | CNY 0.8 | CNY 8 | |||
256K<Token≤1M | CNY 1.2 | CNY 12 | |||
qwen3.5-flash Alias for qwen3.5-flash-2026-02-23 | EU | non-thinking and thinking modes | CNY 0.749 | CNY 2.998 | |
qwen3.5-flash-2026-02-23 | Global | non-thinking and thinking modes | 0<Token≤128K | CNY 0.2 | CNY 2 |
128K<Token≤256K | CNY 0.8 | CNY 8 | |||
256K<Token≤1M | CNY 1.2 | CNY 12 | |||
qwen3.5-flash-2026-02-23 | EU | non-thinking and thinking modes | CNY 0.749 | CNY 2.998 | |
qwen-flash Alias for qwen-flash-2025-07-28 50% discount for batch inference Discounts for context caching | Global | non-thinking and thinking modes | 0<Token≤128K | CNY 0.15 | CNY 1.5 |
128K<Token≤256K | CNY 0.6 | CNY 6 | |||
256K<Token≤1M | CNY 1.2 | CNY 12 | |||
qwen-flash-2025-07-28 | Global | non-thinking and thinking modes | 0<Token≤128K | CNY 0.15 | CNY 1.5 |
128K<Token≤256K | CNY 0.6 | CNY 6 | |||
256K<Token≤1M | CNY 1.2 | CNY 12 |
Qwen-Turbo
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | |
Non-thinking mode | Thinking mode (chain of thought + answer) | |||||
qwen-turbo 50% discount for batch inference | Chinese mainland | Non-thinking and thinking | CNY 0.3 | CNY 0.6 | CNY 3 | 1 million tokens |
Singapore
Model ID | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) | |
Non-thinking mode | Thinking mode (chain of thought + answer) | ||||
qwen-turbo 50% discount for batch inference | International | Non-thinking and thinking | CNY 0.367 | CNY 1.468 | CNY 3.67 |
QwQ
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input price | Output price | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwq-plus 50% discount for batch inference | Chinese mainland | Thinking mode only | CNY 1.6 | CNY 4 | 1 million tokens |
Singapore
Model ID | Deployment scope | Mode | Input price | Output price |
qwq-plus | International | Thinking mode only | CNY 5.871 | CNY 17.614 |
Qwen-Long
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
China (Beijing)
Model ID | Region | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-long 50% discount for batch inference | Chinese mainland | CNY 0.5 | CNY 2 | 1 million tokens |
qwen-long-latest | Chinese mainland | CNY 0.5 | CNY 2 | 1 million tokens |
qwen-long-2025-01-25 | Chinese mainland | CNY 0.5 | CNY 2 | 1 million tokens |
Qwen-Omni
Billing rules: You are billed for input and output tokens. For details on how tokens are calculated for different modalities, see Billing and rate limits.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | ||
Text/image/video | Audio | Text Multimodal input | Text and audio Only audio is billed | |||
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15 | chinese mainland | CNY 7 | CNY 53 | CNY 40 | CNY 213 | 1 million tokens |
qwen3.5-omni-plus-2026-03-15 | chinese mainland | CNY 7 | CNY 53 | CNY 40 | CNY 213 | 1 million tokens |
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15 | chinese mainland | CNY 2.2 | CNY 18 | CNY 13.3 | CNY 72 | 1 million tokens |
qwen3.5-omni-flash-2026-03-15 | chinese mainland | CNY 2.2 | CNY 18 | CNY 13.3 | CNY 72 | 1 million tokens |
More models
Singapore
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | ||
Text/image/video | Audio | Text Multimodal input | Text and audio Only audio is billed | ||
qwen3.5-omni-plus Currently equivalent to qwen3.5-omni-plus-2026-03-15 | international | CNY 10.49 | CNY 82.44 | CNY 62.2 | CNY 329.74 |
qwen3.5-omni-plus-2026-03-15 | international | CNY 10.49 | CNY 82.44 | CNY 62.2 | CNY 329.74 |
qwen3.5-omni-flash Currently equivalent to qwen3.5-omni-flash-2026-03-15 | international | CNY 3 | CNY 22.48 | CNY 16.49 | CNY 89.18 |
qwen3.5-omni-flash-2026-03-15 | international | CNY 3 | CNY 22.48 | CNY 16.49 | CNY 89.18 |
More models
Qwen-Omni-Realtime
Billing rules: Billing is based on input and output tokens. For token calculation rules for different modalities, see billing and rate limits.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | ||
Text and image | Audio | Text multimodal input | Text and audio billed for audio only | |||
qwen3.5-omni-plus-realtime | Chinese mainland | CNY 10 | CNY 80 | CNY 60 | CNY 300 | 1 million tokens |
qwen3.5-omni-plus-realtime-2026-03-15 | Chinese mainland | CNY 10 | CNY 80 | CNY 60 | CNY 300 | 1 million tokens |
qwen3.5-omni-flash-realtime | Chinese mainland | CNY 3.3 | CNY 27 | CNY 20 | CNY 107 | 1 million tokens |
qwen3.5-omni-flash-realtime-2026-03-15 | Chinese mainland | CNY 3.3 | CNY 27 | CNY 20 | CNY 107 | 1 million tokens |
More models
Singapore
Model ID | Deployment scope | Input price | Output price | ||
Text and image | Audio | Text multimodal input | Text and audio billed for audio only | ||
qwen3.5-omni-plus-realtime | International | CNY 15.74 | CNY 123.65 | CNY 92.93 | CNY 464.64 |
qwen3.5-omni-plus-realtime-2026-03-15 | International | CNY 15.74 | CNY 123.65 | CNY 92.93 | CNY 464.64 |
qwen3.5-omni-flash-realtime | International | CNY 4.12 | CNY 33.72 | CNY 24.73 | CNY 132.65 |
qwen3.5-omni-flash-realtime-2026-03-15 | International | CNY 4.12 | CNY 33.72 | CNY 24.73 | CNY 132.65 |
More models
QVQ
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qvq-max | Chinese mainland | CNY 8 | CNY 32 | 1 million tokens |
qvq-plus | Chinese mainland | CNY 2 | CNY 5 | 1 million tokens |
Singapore
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) |
qvq-max | International | CNY 8.807 | CNY 35.228 |
Qwen-VL
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input tokens per request | Input price (per 1M tokens) | Output price (per 1M tokens) CoT and answer | Free quota(Note) Valid for 90 days after activating Model Studio |
qwen3-vl-plus Equivalent to qwen3-vl-plus-2025-12-19 50% discount on batch inference Discounts apply to context caching | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 1 | CNY 10 | 1M tokens |
32K<tokens≤128K | CNY 1.5 | CNY 15 | ||||
128K<tokens≤256K | CNY 3 | CNY 30 | ||||
qwen3-vl-plus-2025-12-19 | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 1 | CNY 10 | 1M tokens |
32K<tokens≤128K | CNY 1.5 | CNY 15 | ||||
128K<tokens≤256K | CNY 3 | CNY 30 | ||||
qwen3-vl-plus-2025-09-23 | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 1 | CNY 10 | 1M tokens |
32K<tokens≤128K | CNY 1.5 | CNY 15 | ||||
128K<tokens≤256K | CNY 3 | CNY 30 | ||||
qwen3-vl-flash Equivalent to qwen3-vl-flash-2026-01-22 50% discount on batch inference Discounts apply to context caching | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 0.15 | CNY 1.5 | 1M tokens |
32K<tokens≤128K | CNY 0.3 | CNY 3 | ||||
128K<tokens≤256K | CNY 0.6 | CNY 6 | ||||
qwen3-vl-flash-2026-01-22 | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 0.15 | CNY 1.5 | 1M tokens |
32K<tokens≤128K | CNY 0.3 | CNY 3 | ||||
128K<tokens≤256K | CNY 0.6 | CNY 6 | ||||
qwen3-vl-flash-2025-10-15 | Chinese mainland | non-thinking & thinking modes | 0<tokens≤32K | CNY 0.15 | CNY 1.5 | 1M tokens |
32K<tokens≤128K | CNY 0.3 | CNY 3 | ||||
128K<tokens≤256K | CNY 0.6 | CNY 6 |
More models
Model ID | Deployment scope | Input tokens per request | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota(Note) Valid for 90 days after activating Model Studio |
qwen-vl-max 50% discount on batch inference Discounts apply to context caching | Chinese mainland | no tiered pricing | CNY 1.6 | CNY 4 | 1M tokens |
qwen-vl-plus 50% discount on batch inference Discounts apply to context caching | Chinese mainland | no tiered pricing | CNY 0.8 | CNY 2 | 1M tokens |
US (Virginia)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2025-10-15 Context caching discount | Global | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 0.15 | CNY 1.5 |
32K < tokens ≤ 128K | CNY 0.3 | CNY 3 | |||
128K < tokens ≤ 256K | CNY 0.6 | CNY 6 | |||
qwen3-vl-flash-us Context caching discount | US | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 0.367 | CNY 2.936 |
32K < tokens ≤ 128K | CNY 0.55 | CNY 4.404 | |||
128K < tokens ≤ 256K | CNY 0.881 | CNY 7.046 | |||
qwen3-vl-flash-2026-01-22-us | US | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 0.367 | CNY 2.936 |
32K < tokens ≤ 128K | CNY 0.55 | CNY 4.404 | |||
128K < tokens ≤ 256K | CNY 0.881 | CNY 7.046 | |||
qwen3-vl-flash-2025-10-15 | Global | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 0.15 | CNY 1.5 |
32K < tokens ≤ 128K | CNY 0.3 | CNY 3 | |||
128K < tokens ≤ 256K | CNY 0.6 | CNY 6 | |||
qwen3-vl-flash-2025-10-15-us | US | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 0.367 | CNY 2.936 |
32K < tokens ≤ 128K | CNY 0.55 | CNY 4.404 | |||
128K < tokens ≤ 256K | CNY 0.881 | CNY 7.046 | |||
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-09-23 Context caching discount | Global | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 1 | CNY 10 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 15 | |||
128K < tokens ≤ 256K | CNY 3 | CNY 30 | |||
qwen3-vl-plus-2025-09-23 | Global | Non-Thinking and Thinking modes | 0 < tokens ≤ 32K | CNY 1 | CNY 10 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 15 | |||
128K < tokens ≤ 256K | CNY 3 | CNY 30 |
Singapore
Model ID | Deployment scope | Mode | Input tokens | Input price (per million tokens) | Output price (per million tokens) |
qwen3-vl-plus Currently equivalent to qwen3-vl-plus-2025-12-19 Context caching is eligible for discounts | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 1.468 | CNY 11.743 |
32K<Token≤128K | CNY 2.202 | CNY 17.614 | |||
128K<Token≤256K | CNY 4.404 | CNY 35.228 | |||
qwen3-vl-plus-2025-12-19 | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 1.468 | CNY 11.743 |
32K<Token≤128K | CNY 2.202 | CNY 17.614 | |||
128K<Token≤256K | CNY 4.404 | CNY 35.228 | |||
qwen3-vl-plus-2025-09-23 | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 1.468 | CNY 11.743 |
32K<Token≤128K | CNY 2.202 | CNY 17.614 | |||
128K<Token≤256K | CNY 4.404 | CNY 35.228 | |||
qwen3-vl-flash Currently equivalent to qwen3-vl-flash-2026-01-22 Context caching discounts apply | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 0.367 | CNY 2.936 |
32K<Token≤128K | CNY 0.55 | CNY 4.404 | |||
128K<Token≤256K | CNY 0.881 | CNY 7.046 | |||
qwen3-vl-flash-2026-01-22 | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 0.367 | CNY 2.936 |
32K<Token≤128K | CNY 0.55 | CNY 4.404 | |||
128K<Token≤256K | CNY 0.881 | CNY 7.046 | |||
qwen3-vl-flash-2025-10-15 | International | Non-Thinking and Thinking modes | 0<Token≤32K | CNY 0.367 | CNY 2.936 |
32K<Token≤128K | CNY 0.55 | CNY 4.404 | |||
128K<Token≤256K | CNY 0.881 | CNY 7.046 |
More models
Model ID | Deployment scope | Input tokens | Input price (per million tokens) | Output price (per million tokens) |
qwen-vl-max Context caching discounts apply | International | No tiered pricing | CNY 5.871 | CNY 23.486 |
qwen-vl-plus Context caching discounts apply | International | No tiered pricing | CNY 1.541 | CNY 4.624 |
Germany (Frankfurt)
Model ID | Service region | Mode | Input tokens | Input price (per 1 million tokens) | Output price (per 1 million tokens) Chain of thought and answer |
qwen3-vl-flash Equivalent to qwen3-vl-flash-2025-10-15 context cache (Discount available) | Global | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 0.15 | CNY 1.5 |
32K < tokens ≤ 128K | CNY 0.3 | CNY 3 | |||
128K < tokens ≤ 256K | CNY 0.6 | CNY 6 | |||
qwen3-vl-flash Equivalent to qwen3-vl-flash-2026-01-22 | EU | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 0.375 | CNY 2.998 |
32K < tokens ≤ 128K | CNY 0.562 | CNY 4.497 | |||
128K < tokens ≤ 256K | CNY 0.899 | CNY 7.194 | |||
qwen3-vl-flash-2026-01-22 | EU | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 0.375 | CNY 2.998 |
32K < tokens ≤ 128K | CNY 0.562 | CNY 4.497 | |||
128K < tokens ≤ 256K | CNY 0.899 | CNY 7.194 | |||
qwen3-vl-flash-2025-10-15 | Global | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 0.15 | CNY 1.5 |
32K < tokens ≤ 128K | CNY 0.3 | CNY 3 | |||
128K < tokens ≤ 256K | CNY 0.6 | CNY 6 | |||
qwen3-vl-flash-2025-10-15 | EU | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 0.375 | CNY 2.998 |
32K < tokens ≤ 128K | CNY 0.562 | CNY 4.497 | |||
128K < tokens ≤ 256K | CNY 0.899 | CNY 7.194 | |||
qwen3-vl-plus Equivalent to qwen3-vl-plus-2025-12-19 context cache (Discount available) | Global | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 1 | CNY 10 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 15 | |||
128K < tokens ≤ 256K | CNY 3 | CNY 30 | |||
qwen3-vl-plus | EU | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 1.499 | CNY 11.991 |
32K < tokens ≤ 128K | CNY 2.248 | CNY 17.986 | |||
128K < tokens ≤ 256K | CNY 4.497 | CNY 35.972 | |||
qwen3-vl-plus-2025-09-23 | Global | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 1 | CNY 10 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 15 | |||
128K < tokens ≤ 256K | CNY 3 | CNY 30 | |||
qwen3-vl-plus-2025-09-23 | EU | non-thinking and thinking modes | 0 < tokens ≤ 32K | CNY 1.499 | CNY 11.991 |
32K < tokens ≤ 128K | CNY 2.248 | CNY 17.986 | |||
128K < tokens ≤ 256K | CNY 4.497 | CNY 35.972 |
Qwen-OCR
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Input price | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-vl-ocr 50% discount for batch inference | Chinese mainland | CNY 0.3 | CNY 0.5 | 1 million tokens |
qwen-vl-ocr-latest 50% discount for batch inference | Chinese mainland | CNY 0.3 | CNY 0.5 | 1 million tokens |
qwen-vl-ocr-2025-11-20 | Chinese mainland | CNY 0.3 | CNY 0.5 | 1 million tokens |
qwen-vl-ocr-2025-08-28 | Chinese mainland | CNY 5 | CNY 5 | 1 million tokens |
qwen-vl-ocr-2025-04-13 | Chinese mainland | CNY 5 | CNY 5 | 1 million tokens |
qwen-vl-ocr-2024-10-28 | Chinese mainland | CNY 5 | CNY 5 | 1 million tokens |
US (Virginia)
Model id | Deployment scope | Input price | Output price |
qwen-vl-ocr | global | CNY 0.3 | CNY 0.5 |
qwen-vl-ocr-2025-11-20 | global | CNY 0.3 | CNY 0.5 |
Singapore
Model id | Deployment scope | Input price | Output price |
qwen-vl-ocr | international | CNY 0.514 | CNY 1.174 |
qwen-vl-ocr-2025-11-20 | international | CNY 0.514 | CNY 1.174 |
Germany (Frankfurt)
Model id | Deployment scope | Input price | Output price |
qwen-vl-ocr | global | CNY 0.3 | CNY 0.5 |
qwen-vl-ocr-2025-11-20 | global | CNY 0.3 | CNY 0.5 |
Qwen-Audio
You are charged for input tokens and output tokens.
One second of audio is calculated as 25 tokens. Audio clips shorter than one second are also billed as 25 tokens.
China (Beijing)
Model ID | Deployment region | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-audio-turbo | Chinese mainland | Currently available for free trial only. After the free quota is exhausted, you can no longer call the model. We recommend using Omni (Qwen-Omni) as an alternative. | 100,000 tokens | |
qwen-audio-turbo-latest | Chinese mainland | |||
Qwen Math
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quotaGuidelines Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-math-plus | Chinese mainland | CNY 4 | CNY 12 | 1 million tokens |
qwen-math-turbo | Chinese mainland | CNY 2 | CNY 6 |
Qwen-Coder
You are charged for input tokens and output tokens.
If the model supports context cache, only input tokens receive a discount.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input tokens | Input price | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-coder-plus Currently equivalent to qwen3-coder-plus-2025-09-23 Discount for context caching | Chinese mainland | 0<tokens≤32K | CNY 4 | CNY 16 | 1 million tokens |
32K<tokens≤128K | CNY 6 | CNY 24 | |||
128K<tokens≤256K | CNY 10 | CNY 40 | |||
256K<tokens≤1M | CNY 20 | CNY 200 | |||
qwen3-coder-plus-2025-09-23 | Chinese mainland | 0<tokens≤32K | CNY 4 | CNY 16 | 1 million tokens |
32K<tokens≤128K | CNY 6 | CNY 24 | |||
128K<tokens≤256K | CNY 10 | CNY 40 | |||
256K<tokens≤1M | CNY 20 | CNY 200 | |||
qwen3-coder-plus-2025-07-22 | Chinese mainland | 0<tokens≤32K | CNY 4 | CNY 16 | 1 million tokens |
32K<tokens≤128K | CNY 6 | CNY 24 | |||
128K<tokens≤256K | CNY 10 | CNY 40 | |||
256K<tokens≤1M | CNY 20 | CNY 200 | |||
qwen3-coder-flash Currently equivalent to qwen3-coder-flash-2025-07-28 | Chinese mainland | 0<tokens≤32K | CNY 1 | CNY 4 | 1 million tokens |
32K<tokens≤128K | CNY 1.5 | CNY 6 | |||
128K<tokens≤256K | CNY 2.5 | CNY 10 | |||
256K<tokens≤1M | CNY 5 | CNY 25 | |||
qwen3-coder-flash-2025-07-28 | Chinese mainland | 0<tokens≤32K | CNY 1 | CNY 4 | 1 million tokens |
32K<tokens≤128K | CNY 1.5 | CNY 6 | |||
128K<tokens≤256K | CNY 2.5 | CNY 10 | |||
256K<tokens≤1M | CNY 5 | CNY 25 |
More models
Model ID | Deployment scope | Input tokens | Input price | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-coder-plus | Chinese mainland | No tiered pricing | CNY 3.5 | CNY 7 | 1 million tokens |
qwen-coder-turbo | Chinese mainland | No tiered pricing | CNY 2 | CNY 6 | 1 million tokens |
US (Virginia)
Model ID | Deployment scope | Input tokens | Input price (per million tokens) | Output price (per million tokens) |
qwen3-coder-plus Alias for qwen3-coder-plus-2025-09-23 | global | 0<token≤32K | CNY 4 | CNY 16 |
32K<token≤128K | CNY 6 | CNY 24 | ||
128K<token≤256K | CNY 10 | CNY 40 | ||
256K<token≤1M | CNY 20 | CNY 200 | ||
qwen3-coder-plus-2025-09-23 | global | 0<token≤32K | CNY 4 | CNY 16 |
32K<token≤128K | CNY 6 | CNY 24 | ||
128K<token≤256K | CNY 10 | CNY 40 | ||
256K<token≤1M | CNY 20 | CNY 200 | ||
qwen3-coder-plus-2025-07-22 | global | 0<token≤32K | CNY 4 | CNY 16 |
32K<token≤128K | CNY 6 | CNY 24 | ||
128K<token≤256K | CNY 10 | CNY 40 | ||
256K<token≤1M | CNY 20 | CNY 200 | ||
qwen3-coder-flash Alias for qwen3-coder-flash-2025-07-28 | global | 0<token≤32K | CNY 1 | CNY 4 |
32K<token≤128K | CNY 1.5 | CNY 6 | ||
128K<token≤256K | CNY 2.5 | CNY 10 | ||
256K<token≤1M | CNY 5 | CNY 25 | ||
qwen3-coder-flash-2025-07-28 | global | 0<token≤32K | CNY 1 | CNY 4 |
32K<token≤128K | CNY 1.5 | CNY 6 | ||
128K<token≤256K | CNY 2.5 | CNY 10 | ||
256K<token≤1M | CNY 5 | CNY 25 |
Singapore
Model ID | Deployment scope | Request tokens | Input price (per 1 million tokens) | Output price (per 1 million tokens) |
qwen3-coder-plus This is an alias for qwen3-coder-plus-2025-09-23 | International | 0<token≤32K | CNY 7.339 | CNY 36.696 |
32K<token≤128K | CNY 13.211 | CNY 66.053 | ||
128K<token≤256K | CNY 22.018 | CNY 110.089 | ||
256K<token≤1M | CNY 44.035 | CNY 440.354 | ||
qwen3-coder-plus-2025-09-23 | International | 0<token≤32K | CNY 7.339 | CNY 36.696 |
32K<token≤128K | CNY 13.211 | CNY 66.053 | ||
128K<token≤256K | CNY 22.018 | CNY 110.089 | ||
256K<token≤1M | CNY 44.035 | CNY 440.354 | ||
qwen3-coder-plus-2025-07-22 | International | 0<token≤32K | CNY 7.339 | CNY 36.696 |
32K<token≤128K | CNY 13.211 | CNY 66.053 | ||
128K<token≤256K | CNY 22.018 | CNY 110.089 | ||
256K<token≤1M | CNY 44.035 | CNY 440.354 | ||
qwen3-coder-flash This is an alias for qwen3-coder-flash-2025-07-28 | International | 0<token≤32K | CNY 2.202 | CNY 11.009 |
32K<token≤128K | CNY 3.67 | CNY 18.348 | ||
128K<token≤256K | CNY 5.871 | CNY 29.357 | ||
256K<token≤1M | CNY 11.743 | CNY 70.457 | ||
qwen3-coder-flash-2025-07-28 | International | 0<token≤32K | CNY 2.202 | CNY 11.009 |
32K<token≤128K | CNY 3.67 | CNY 18.348 | ||
128K<token≤256K | CNY 5.871 | CNY 29.357 | ||
256K<token≤1M | CNY 11.743 | CNY 70.457 |
Germany (Frankfurt)
Model ID | Deployment scope | Input tokens | Input price (per 1M tokens) | Output price (per 1M tokens) |
qwen3-coder-plus Currently an alias for qwen3-coder-plus-2025-09-23 | global | 0 < tokens ≤ 32K | CNY 4 | CNY 16 |
32K<Token≤128K | CNY 6 | CNY 24 | ||
128K < tokens ≤ 256K | CNY 10 | CNY 40 | ||
256K < tokens ≤ 1M | CNY 20 | CNY 200 | ||
qwen3-coder-plus-2025-09-23 | global | 0 < tokens ≤ 32K | CNY 4 | CNY 16 |
32K < tokens ≤ 128K | CNY 6 | CNY 24 | ||
128K < tokens ≤ 256K | CNY 10 | CNY 40 | ||
256K < tokens ≤ 1M | CNY 20 | CNY 200 | ||
qwen3-coder-plus-2025-07-22 | global | 0 < tokens ≤ 32K | CNY 4 | CNY 16 |
32K < tokens ≤ 128K | CNY 6 | CNY 24 | ||
128K < tokens ≤ 256K | CNY 10 | CNY 40 | ||
256K < tokens ≤ 1M | CNY 20 | CNY 200 | ||
qwen3-coder-flash Currently an alias for qwen3-coder-flash-2025-07-28 | global | 0 < tokens ≤ 32K | CNY 1 | CNY 4 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 6 | ||
128K < tokens ≤ 256K | CNY 2.5 | CNY 10 | ||
256K < tokens ≤ 1M | CNY 5 | CNY 25 | ||
qwen3-coder-flash-2025-07-28 | global | 0 < tokens ≤ 32K | CNY 1 | CNY 4 |
32K < tokens ≤ 128K | CNY 1.5 | CNY 6 | ||
128K < tokens ≤ 256K | CNY 2.5 | CNY 10 | ||
256K < tokens ≤ 1M | CNY 5 | CNY 25 |
Qwen translation models
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-mt-plus | Chinese mainland | CNY 1.8 | CNY 5.4 | 1 million tokens |
qwen-mt-flash | Chinese mainland | CNY 0.7 | CNY 1.95 | 1 million tokens |
qwen-mt-lite | Chinese mainland | CNY 0.6 | CNY 1.6 | 1 million tokens |
qwen-mt-turbo | Chinese mainland | CNY 0.7 | CNY 1.95 | 1 million tokens |
US (Virginia)
Model ID | Deployment scope | Input price | Output price |
qwen-mt-flash | Global | CNY 0.7 | CNY 1.95 |
qwen-mt-lite | Global | CNY 0.6 | CNY 1.6 |
qwen-mt-lite-us | US | CNY 0.881 | CNY 2.642 |
qwen-mt-plus | Global | CNY 1.8 | CNY 5.4 |
Singapore
Model ID | Deployment scope | Input price | Output price |
qwen-mt-plus | International | CNY 18.055 | CNY 54.09 |
qwen-mt-flash | International | CNY 1.174 | CNY 3.596 |
qwen-mt-lite | International | CNY 0.881 | CNY 2.642 |
qwen-mt-turbo | International | CNY 1.174 | CNY 3.596 |
Germany (Frankfurt)
Model ID | Deployment scope | Input price | Output price |
qwen-mt-plus | Global | CNY 1.8 | CNY 5.4 |
qwen-mt-flash | Global | CNY 0.7 | CNY 1.95 |
qwen-mt-lite | Global | CNY 0.6 | CNY 1.6 |
Qwen data mining
You are charged for input tokens and output tokens.
China (Beijing)
Model id | Region | Input price | Output price | Free quota (Note) |
qwen-doc-turbo | Chinese mainland | CNY 0.6 | CNY 1 | No free quota |
Qwen Deep Research
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Service region | Input price | Output price | Free quota(Note) |
qwen-deep-research | Chinese mainland | CNY 54 | CNY 163 | No free quota |
qwen-deep-research-2025-12-15 | Chinese mainland | CNY 79 | CNY 236 | No free quota |
Tongyi Xiaomi conversation analysis
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio. |
tongyi-xiaomi-analysis-flash | Chinese mainland | CNY 0.2 | CNY 0.4 | 1 million tokens |
tongyi-xiaomi-analysis-pro | Chinese mainland | CNY 1.0 | CNY 2.7 | 1 million tokens |
Text generation - Qwen (open-source)
Qwen3.6
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input token range | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio | |
Non-thinking mode | Thinking mode (chain of thought + answer) | |||||
qwen3.6-35b-a3b | Chinese mainland | 0<token≤256K | CNY 1.8 | CNY 10.8 | CNY 10.8 | 1 million tokens |
qwen3.6-27b | Chinese mainland | 0<token≤256K | CNY 3 | CNY 18 | CNY 18 | 1 million tokens |
US (Virginia)
Model ID | Deployment scope | Input token range | Input price (per 1 million tokens) | Output price (per 1 million tokens) | |
Non-thinking mode | Thinking mode (chain of thought + answer) | ||||
qwen3.6-35b-a3b | Global | 0<token≤256K | CNY 1.8 | CNY 10.8 | CNY 10.8 |
Singapore
Model ID | Deployment scope | Input token range | Input price (per 1 million tokens) | Output price (per 1 million tokens) | |
Non-thinking mode | Thinking mode (chain of thought + answer) | ||||
qwen3.6-35b-a3b | International | 0<token≤256K | CNY 2.810325 | CNY 16.86195 | CNY 16.86195 |
qwen3.6-27b | International | 0<token≤256K | CNY 4.49652 | CNY 26.97912 | CNY 26.97912 |
Germany (Frankfurt)
Model ID | Deployment scope | Input token range | Input price (per 1 million tokens) | Output price (per 1 million tokens) | |
Non-thinking mode | Thinking mode (chain of thought + answer) | ||||
qwen3.6-35b-a3b | Global | 0<token≤256K | CNY 1.8 | CNY 10.8 | CNY 10.8 |
Qwen3.5
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input tokens | Input price | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio | |
Non-thinking mode | Thinking mode | |||||
qwen3.5-397b-a17b | Chinese mainland | 0<token≤128K | CNY 1.2 | CNY 7.2 | CNY 7.2 | 1 million tokens |
128K<token≤256K | CNY 3 | CNY 18 | CNY 18 | |||
qwen3.5-122b-a10b | Chinese mainland | 0<token≤128K | CNY 0.8 | CNY 6.4 | CNY 6.4 | 1 million tokens |
128K<token≤256K | CNY 2 | CNY 16 | CNY 16 | |||
qwen3.5-27b | Chinese mainland | 0<token≤128K | CNY 0.6 | CNY 4.8 | CNY 4.8 | 1 million tokens |
128K<token≤256K | CNY 1.8 | CNY 14.4 | CNY 14.4 | |||
qwen3.5-35b-a3b | Chinese mainland | 0<token≤128K | CNY 0.4 | CNY 3.2 | CNY 3.2 | 1 million tokens |
128K<token≤256K | CNY 1.6 | CNY 12.8 | CNY 12.8 | |||
US (Virginia)
Model ID | Deployment scope | Input tokens | Input price | Output price | |
Non-thinking mode | Thinking mode | ||||
qwen3.5-397b-a17b | Global | 0<token≤128K | CNY 1.2 | CNY 7.2 | CNY 7.2 |
128K<token≤256K | CNY 3 | CNY 18 | CNY 18 | ||
qwen3.5-122b-a10b | Global | 0<token≤128K | CNY 0.8 | CNY 6.4 | CNY 6.4 |
128K<token≤256K | CNY 2 | CNY 16 | CNY 16 | ||
qwen3.5-27b | Global | 0<token≤128K | CNY 0.6 | CNY 4.8 | CNY 4.8 |
128K<token≤256K | CNY 1.8 | CNY 14.4 | CNY 14.4 | ||
qwen3.5-35b-a3b | Global | 0<token≤128K | CNY 0.4 | CNY 3.2 | CNY 3.2 |
128K<token≤256K | CNY 1.6 | CNY 12.8 | CNY 12.8 | ||
Singapore
Model ID | Deployment scope | Input tokens | Input price | Output price | |
Non-thinking mode | Thinking mode | ||||
qwen3.5-397b-a17b | International | 0<token≤256K | CNY 4.404 | CNY 26.421 | CNY 26.421 |
qwen3.5-122b-a10b | International | 0<token≤256K | CNY 2.936 | CNY 23.486 | CNY 23.486 |
qwen3.5-27b | International | 0<token≤256K | CNY 2.202 | CNY 17.614 | CNY 17.614 |
qwen3.5-35b-a3b | International | 0<token≤256K | CNY 1.835 | CNY 14.678 | CNY 14.678 |
Germany (Frankfurt)
Model ID | Deployment scope | Input tokens | Input price | Output price | |
Non-thinking mode | Thinking mode | ||||
qwen3.5-397b-a17b | Global | 0<token≤128K | CNY 1.2 | CNY 7.2 | CNY 7.2 |
128K<token≤256K | CNY 3 | CNY 18 | CNY 18 | ||
qwen3.5-122b-a10b | Global | 0<token≤128K | CNY 0.8 | CNY 6.4 | CNY 6.4 |
128K<token≤256K | CNY 2 | CNY 16 | CNY 16 | ||
qwen3.5-27b | Global | 0<token≤128K | CNY 0.6 | CNY 4.8 | CNY 4.8 |
128K<token≤256K | CNY 1.8 | CNY 14.4 | CNY 14.4 | ||
qwen3.5-35b-a3b | Global | 0<token≤128K | CNY 0.4 | CNY 3.2 | CNY 3.2 |
128K<token≤256K | CNY 1.6 | CNY 12.8 | CNY 12.8 | ||
Qwen3
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota (Note) Valid for 90 days after activating Model Studio | |
Non-thinking mode | Thinking mode (CoT + answer) | |||||
qwen3-next-80b-a3b-thinking | chinese mainland | Thinking mode only | CNY 1 | - | CNY 10 | 1 million tokens |
qwen3-next-80b-a3b-instruct | chinese mainland | Non-Thinking mode only | CNY 1 | CNY 4 | - | 1 million tokens |
qwen3-235b-a22b-thinking-2507 | chinese mainland | Thinking mode only | CNY 2 | - | CNY 20 | 1 million tokens |
qwen3-235b-a22b-instruct-2507 | chinese mainland | Non-Thinking mode only | CNY 2 | CNY 8 | - | 1 million tokens |
qwen3-30b-a3b-thinking-2507 | chinese mainland | Thinking mode only | CNY 0.75 | - | CNY 7.5 | 1 million tokens |
qwen3-30b-a3b-instruct-2507 | chinese mainland | Non-Thinking mode only | CNY 0.75 | CNY 3 | - | 1 million tokens |
qwen3-235b-a22b | chinese mainland | Non-Thinking and Thinking modes | CNY 2 | CNY 8 | CNY 20 | 1 million tokens |
qwen3-32b | chinese mainland | Non-Thinking and Thinking modes | CNY 2 | CNY 8 | CNY 20 | 1 million tokens |
qwen3-30b-a3b | chinese mainland | Non-Thinking and Thinking modes | CNY 0.75 | CNY 3 | CNY 7.5 | 1 million tokens |
qwen3-14b | chinese mainland | Non-Thinking and Thinking modes | CNY 1 | CNY 4 | CNY 10 | 1 million tokens |
qwen3-8b | chinese mainland | Non-Thinking and Thinking modes | CNY 0.5 | CNY 2 | CNY 5 | 1 million tokens |
US (Virginia)
Model ID | Deployment scope | Mode | Input price (per 1M tokens) | Output price (per 1M tokens) | |
Non-thinking mode | Thinking mode (CoT + answer) | ||||
qwen3-next-80b-a3b-thinking | global | Thinking mode only | CNY 1 | - | CNY 10 |
qwen3-next-80b-a3b-instruct | global | Non-Thinking mode only | CNY 1 | CNY 4 | - |
qwen3-235b-a22b-thinking-2507 | global | Thinking mode only | CNY 1.688 | - | CNY 16.88 |
qwen3-235b-a22b-instruct-2507 | global | Non-Thinking mode only | CNY 1.688 | CNY 6.752 | - |
qwen3-30b-a3b-thinking-2507 | global | Thinking mode only | CNY 0.75 | - | CNY 7.5 |
qwen3-30b-a3b-instruct-2507 | global | Non-Thinking mode only | CNY 0.75 | CNY 3 | - |
qwen3-235b-a22b | global | Non-Thinking and Thinking modes | CNY 2 | CNY 8 | CNY 20 |
qwen3-32b | global | Non-Thinking and Thinking modes | CNY 1.174 | CNY 4.697 | CNY 4.697 |
qwen3-30b-a3b | global | Non-Thinking and Thinking modes | CNY 0.75 | CNY 3 | CNY 7.5 |
qwen3-14b | global | Non-Thinking and Thinking modes | CNY 1 | CNY 4 | CNY 10 |
qwen3-8b | global | Non-Thinking and Thinking modes | CNY 0.5 | CNY 2 | CNY 5 |
Singapore
Model ID | Deployment scope | Mode | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota (Note) Valid for 90 days after activating Model Studio | |
Non-thinking mode | Thinking mode (CoT + answer) | |||||
qwen3-next-80b-a3b-thinking | international | Thinking mode only | CNY 1.101 | - | CNY 8.807 | No free quota |
qwen3-next-80b-a3b-instruct | international | Non-Thinking mode only | CNY 1.101 | CNY 8.807 | - | No free quota |
qwen3-235b-a22b-thinking-2507 | international | Thinking mode only | CNY 1.688 | - | CNY 16.88 | No free quota |
qwen3-235b-a22b-instruct-2507 | international | Non-Thinking mode only | CNY 1.688 | CNY 6.752 | - | No free quota |
qwen3-30b-a3b-thinking-2507 | international | Thinking mode only | CNY 1.468 | - | CNY 17.614 | No free quota |
qwen3-30b-a3b-instruct-2507 | international | Non-Thinking mode only | CNY 1.468 | CNY 5.871 | - | No free quota |
qwen3-235b-a22b | international | Non-Thinking and Thinking modes | CNY 5.137 | CNY 20.55 | CNY 61.65 | No free quota |
qwen3-32b | international | Non-Thinking and Thinking modes | CNY 1.174 | CNY 4.697 | CNY 4.697 | No free quota |
qwen3-30b-a3b | international | Non-Thinking and Thinking modes | CNY 1.468 | CNY 5.871 | CNY 17.614 | No free quota |
qwen3-14b | international | Non-Thinking and Thinking modes | CNY 2.569 | CNY 10.275 | CNY 30.825 | No free quota |
qwen3-8b | international | Non-Thinking and Thinking modes | CNY 1.321 | CNY 5.137 | CNY 15.412 | No free quota |
Germany (Frankfurt)
Model ID | Deployment scope | Mode | Input price (per 1M tokens) | Output price (per 1M tokens) | |
Non-thinking mode | Thinking mode (CoT + answer) | ||||
qwen3-next-80b-a3b-thinking | global | Thinking mode only | CNY 1 | - | CNY 10 |
qwen3-next-80b-a3b-instruct | global | Non-Thinking mode only | CNY 1 | CNY 4 | - |
qwen3-235b-a22b-thinking-2507 | global | Thinking mode only | CNY 1.688 | - | CNY 16.88 |
qwen3-235b-a22b-instruct-2507 | global | Non-Thinking mode only | CNY 1.688 | CNY 6.752 | - |
qwen3-30b-a3b-thinking-2507 | global | Thinking mode only | CNY 0.75 | - | CNY 7.5 |
qwen3-30b-a3b-instruct-2507 | global | Non-Thinking mode only | CNY 0.75 | CNY 3 | - |
qwen3-235b-a22b | global | Non-Thinking and Thinking modes | CNY 2 | CNY 8 | CNY 20 |
qwen3-32b | global | Non-Thinking and Thinking modes | CNY 1.174 | CNY 4.697 | CNY 4.697 |
qwen3-30b-a3b | global | Non-Thinking and Thinking modes | CNY 0.75 | CNY 3 | CNY 7.5 |
qwen3-14b | global | Non-Thinking and Thinking modes | CNY 1 | CNY 4 | CNY 10 |
qwen3-8b | global | Non-Thinking and Thinking modes | CNY 0.5 | CNY 2 | CNY 5 |
Qwen-Omni
You are billed for input and output tokens. For details on how tokens are calculated for different modalities, see billing and rate limits.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quotaGuidelines Valid for 90 days after you activate Alibaba Cloud Model Studio | ||||
Text | Audio | Image/video | Text Text-only input | Text Multimodal input | Text + audio Billed for audio only | |||
qwen2.5-omni-7b | Chinese mainland | CNY 0.6 | CNY 38 | CNY 2 | CNY 2.4 | CNY 6 | CNY 76 | 1 million tokens (any modality) |
Singapore
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | ||||
Text | Audio | Image/video | Text Text-only input | Text Multimodal input | Text + audio Billed for audio only | ||
qwen2.5-omni-7b | international | CNY 0.734 | CNY 49.613 | CNY 2.055 | CNY 2.936 | CNY 6.165 | CNY 99.153 |
Qwen3-Omni-Captioner
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-omni-30b-a3b-captioner | Chinese mainland | CNY 15.8 | CNY 12.7 | 1 million tokens |
Singapore
Model ID | Deployment scope | Input price | Output price |
qwen3-omni-30b-a3b-captioner | international | CNY 27.962 | CNY 22.458 |
Qwen-VL
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) chain of thought + answer | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-vl-235b-a22b-thinking | Chinese mainland | Thinking mode | CNY 2 | CNY 20 | 1 million tokens |
qwen3-vl-235b-a22b-instruct | Chinese mainland | Non-thinking mode | CNY 2 | CNY 8 | 1 million tokens |
qwen3-vl-32b-thinking | Chinese mainland | Thinking mode | CNY 2 | CNY 20 | 1 million tokens |
qwen3-vl-32b-instruct | Chinese mainland | Non-thinking mode | CNY 2 | CNY 8 | 1 million tokens |
qwen3-vl-30b-a3b-thinking | Chinese mainland | Thinking mode | CNY 0.75 | CNY 7.5 | 1 million tokens |
qwen3-vl-30b-a3b-instruct | Chinese mainland | Non-thinking mode | CNY 0.75 | CNY 3 | 1 million tokens |
qwen3-vl-8b-thinking | Chinese mainland | Thinking mode | CNY 0.5 | CNY 5 | 1 million tokens |
qwen3-vl-8b-instruct | Chinese mainland | Non-thinking mode | CNY 0.5 | CNY 2 | 1 million tokens |
US (Virginia)
Model id | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) chain of thought + answer |
qwen3-vl-235b-a22b-thinking | Global | Thinking mode | CNY 2 | CNY 20 |
qwen3-vl-235b-a22b-instruct | Global | Non-thinking mode | CNY 2 | CNY 8 |
qwen3-vl-32b-thinking | Global | Thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-32b-instruct | Global | Non-thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-30b-a3b-thinking | Global | Thinking mode | CNY 0.75 | CNY 7.5 |
qwen3-vl-30b-a3b-instruct | Global | Non-thinking mode | CNY 0.75 | CNY 3 |
qwen3-vl-8b-thinking | Global | Thinking mode | CNY 0.5 | CNY 5 |
qwen3-vl-8b-instruct | Global | Non-thinking mode | CNY 0.5 | CNY 2 |
Singapore
Model id | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) chain of thought + answer |
qwen3-vl-235b-a22b-thinking | International | Thinking mode | CNY 2.936 | CNY 29.357 |
qwen3-vl-235b-a22b-instruct | International | Non-thinking mode | CNY 2.936 | CNY 11.743 |
qwen3-vl-32b-thinking | International | Thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-32b-instruct | International | Non-thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-30b-a3b-thinking | International | Thinking mode | CNY 1.468 | CNY 17.614 |
qwen3-vl-30b-a3b-instruct | International | Non-thinking mode | CNY 1.468 | CNY 5.871 |
qwen3-vl-8b-thinking | International | Thinking mode | CNY 1.321 | CNY 15.412 |
qwen3-vl-8b-instruct | International | Non-thinking mode | CNY 1.321 | CNY 5.137 |
Germany (Frankfurt)
Model id | Deployment scope | Mode | Input price (per 1 million tokens) | Output price (per 1 million tokens) chain of thought + answer |
qwen3-vl-235b-a22b-thinking | Global | Thinking mode | CNY 2 | CNY 20 |
qwen3-vl-235b-a22b-instruct | Global | Non-thinking mode | CNY 2 | CNY 8 |
qwen3-vl-32b-thinking | Global | Thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-32b-instruct | Global | Non-thinking mode | CNY 1.174 | CNY 4.697 |
qwen3-vl-30b-a3b-thinking | Global | Thinking mode | CNY 0.75 | CNY 7.5 |
qwen3-vl-30b-a3b-instruct | Global | Non-thinking mode | CNY 0.75 | CNY 3 |
qwen3-vl-8b-thinking | Global | Thinking mode | CNY 0.5 | CNY 5 |
qwen3-vl-8b-instruct | Global | Non-thinking mode | CNY 0.5 | CNY 2 |
Qwen-Audio
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment region | Input price | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen2-audio-instruct | Chinese mainland | Currently available for a free trial only. Once the free quota is exhausted, you can no longer call the model. We recommend Omni (Qwen-Omni) as an alternative. | 100,000 tokens | |
qwen-audio-chat | Chinese mainland | |||
Qwen-Coder
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input tokens | Input price | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-coder-next | Chinese mainland | 0<Token≤32K | CNY 1 | CNY 4 | 1 million tokens |
32K<Token≤128K | CNY 1.5 | CNY 6 | |||
128K<Token≤256K | CNY 2.5 | CNY 10 | |||
qwen3-coder-480b-a35b-instruct | Chinese mainland | 0<Token≤32K | CNY 6 | CNY 24 | 1 million tokens |
32K<Token≤128K | CNY 9 | CNY 36 | |||
128K<Token≤200K | CNY 15 | CNY 60 | |||
qwen3-coder-30b-a3b-instruct | Chinese mainland | 0<Token≤32K | CNY 1.5 | CNY 6 | 1 million tokens |
32K<Token≤128K | CNY 2.25 | CNY 9 | |||
128K<Token≤200K | CNY 3.75 | CNY 15 |
US (Virginia)
Model ID | Deployment scope | Input tokens | Input price | Output price |
qwen3-coder-480b-a35b-instruct | global | 0<Token≤32K | CNY 6 | CNY 24 |
32K<Token≤128K | CNY 9 | CNY 36 | ||
128K<Token≤200K | CNY 15 | CNY 60 | ||
qwen3-coder-30b-a3b-instruct | global | 0<Token≤32K | CNY 1.5 | CNY 6 |
32K<Token≤128K | CNY 2.25 | CNY 9 | ||
128K<Token≤200K | CNY 3.75 | CNY 15 |
Singapore
Model ID | Deployment scope | Input tokens | Input price | Output price |
qwen3-coder-next | international | 0<Token≤32K | CNY 2.202 | CNY 11.009 |
32K<Token≤128K | CNY 3.67 | CNY 18.348 | ||
128K<Token≤256K | CNY 5.871 | CNY 29.357 | ||
qwen3-coder-480b-a35b-instruct | international | 0<Token≤32K | CNY 11.009 | CNY 55.044 |
32K<Token≤128K | CNY 19.816 | CNY 99.08 | ||
128K<Token≤200K | CNY 33.027 | CNY 165.133 | ||
qwen3-coder-30b-a3b-instruct | international | 0<Token≤32K | CNY 3.303 | CNY 16.513 |
32K<Token≤128K | CNY 5.504 | CNY 27.522 | ||
128K<Token≤200K | CNY 8.807 | CNY 44.035 |
Germany (Frankfurt)
Model ID | Deployment scope | Input tokens | Input price | Output price |
qwen3-coder-30b-a3b-instruct | global | 0<Token≤32K | CNY 1.5 | CNY 6 |
32K<Token≤128K | CNY 2.25 | CNY 9 | ||
128K<Token≤200K | CNY 3.75 | CNY 15 | ||
qwen3-coder-480b-a35b-instruct | global | 0<Token≤32K | CNY 6 | CNY 24 |
32K<Token≤128K | CNY 9 | CNY 36 | ||
128K<Token≤200K | CNY 15 | CNY 60 | ||
qwen3-coder-next | EU | 0<Token≤32K | CNY 2.248 | CNY 11.241 |
32K<Token≤128K | CNY 3.747 | CNY 18.736 | ||
128K<Token≤256K | CNY 5.995 | CNY 29.977 |
Text generation - third-party models
DeepSeek
You are charged for input tokens and output tokens.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Output price Chain of thought + answer | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
deepseek-v4-pro context caching discount | Chinese mainland | CNY 12 | CNY 24 | 1 million tokens |
deepseek-v4-flash context caching discount | Chinese mainland | CNY 1 | CNY 2 | 1 million tokens |
deepseek-v3.2 context caching discount | Chinese mainland | CNY 2 | CNY 3 | 1 million tokens |
deepseek-v3.2-exp | Chinese mainland | CNY 2 | CNY 3 | 1 million tokens |
deepseek-v3.1 | Chinese mainland | CNY 4 | CNY 12 | 1 million tokens |
deepseek-r1 50% discount for batch inference | Chinese mainland | CNY 4 | CNY 16 | 1 million tokens |
deepseek-r1-0528 | Chinese mainland | CNY 4 | CNY 16 | 1 million tokens |
deepseek-v3 50% discount for batch inference | Chinese mainland | CNY 2 | CNY 8 | 1 million tokens |
deepseek-r1-distill-qwen-1.5b | Chinese mainland | Limited-time free | ||
deepseek-r1-distill-qwen-7b | Chinese mainland | CNY 0.5 | CNY 1 | 1 million tokens |
deepseek-r1-distill-qwen-14b | Chinese mainland | CNY 1 | CNY 3 | 1 million tokens |
deepseek-r1-distill-qwen-32b | Chinese mainland | CNY 2 | CNY 6 | 1 million tokens |
deepseek-r1-distill-llama-8b | Chinese mainland | Limited-time free | ||
deepseek-r1-distill-llama-70b | Chinese mainland | Currently available for free trial only. You cannot call the model after the free quota is exhausted. Alternative models include Deep thinking, DeepSeek - Alibaba Cloud, and Kimi - Alibaba Cloud. | 1 million tokens | |
US (Virginia)
Model ID | Deployment scope | Input price | Output price Chain of thought + answer |
deepseek-v4-pro context caching discount | Global | CNY 12 | CNY 24 |
deepseek-v4-flash context caching discount | Global | CNY 1 | CNY 2 |
Singapore
Model ID | Deployment scope | Input price | Output price Chain of thought + answer |
deepseek-v4-pro context caching discount | International | CNY 17.986 | CNY 35.972 |
deepseek-v4-flash context caching discount | International | CNY 1.499 | CNY 2.998 |
deepseek-v3.2 context caching discount | International | CNY 4.272 | CNY 12.815 |
Germany (Frankfurt)
Model ID | Deployment scope | Input price | Output price Chain of thought + answer |
deepseek-v4-pro context caching discount | Global | CNY 12 | CNY 24 |
deepseek-v4-flash context caching discount | Global | CNY 1 | CNY 2 |
DeepSeek-SiliconFlow
China (Beijing)
Model ID | Service region | Input price | Output price Chain of thought and answer | Free quota |
siliconflow/deepseek-v3.2 | Chinese mainland | CNY 2 | CNY 3 | None |
siliconflow/deepseek-v3.1-terminus | Chinese mainland | CNY 4 | CNY 12 | |
siliconflow/deepseek-r1-0528 | Chinese mainland | CNY 4 | CNY 16 | |
siliconflow/deepseek-v3-0324 | Chinese mainland | CNY 2 | CNY 8 |
DeepSeek-Kuaishou Wanqing
China (Beijing)
Model ID | Deployment scope | Input price | Output price Chain of thought + answer | Free quota |
vanchin/deepseek-v3.2-think Discount for context caching | Chinese mainland | CNY 2 | CNY 3 | None |
vanchin/deepseek-v3.1-terminus Discount for context caching | Chinese mainland | CNY 4 | CNY 12 | |
vanchin/deepseek-r1 Discount for context caching | Chinese mainland | CNY 4 | CNY 16 | |
vanchin/deepseek-v3 Discount for context caching | Chinese mainland | CNY 2 | CNY 8 | |
vanchin/deepseek-ocr | Chinese mainland | CNY 0.216 | CNY 0.216 |
Kimi
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Mode | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
kimi-k2.7-code | Chinese mainland | Thinking mode only | CNY 6.5 | CNY 27 | 1 million tokens |
kimi-k2.6 | Chinese mainland | Thinking and Non-Thinking modes | CNY 6.5 | CNY 27 | 1 million tokens |
kimi-k2.5 | Chinese mainland | Thinking and Non-Thinking modes | CNY 4 | CNY 21 | 1 million tokens |
kimi-k2-thinking | Chinese mainland | Thinking mode only | CNY 4 | CNY 16 | 1 million tokens |
Moonshot-Kimi-K2-Instruct | Chinese mainland | Non-Thinking mode only | CNY 4 | CNY 16 | 1 million tokens |
US (Virginia)
Model id | Deployment scope | Mode | Input price | Output price |
kimi-k2.7-code | global | Thinking mode only | CNY 6.5 | CNY 27 |
kimi-k2.5 | global | Thinking and Non-Thinking modes | CNY 4 | CNY 21 |
Germany (Frankfurt)
Model id | Deployment scope | Mode | Input price | Output price |
kimi-k2.7-code | global | Thinking mode only | CNY 6.5 | CNY 27 |
kimi-k2.5 | global | Thinking and Non-Thinking modes | CNY 4 | CNY 21 |
Kimi-Moonshot AI
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price | Output price Chain of thought | Free quota (Note) |
kimi/kimi-k2.7-code Discount for context caching | Chinese mainland | CNY 6.5 | CNY 27 | None |
kimi/kimi-k2.6 Discount for context caching | Chinese mainland | CNY 6.5 | CNY 27 | |
kimi/kimi-k2.5 Discount for context caching | Chinese mainland | CNY 4 | CNY 21 |
GLM
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
glm-5.1 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 6 | CNY 24 | 1 million tokens |
32K<token≤200K | CNY 8 | CNY 28 | ||||
glm-5 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 4 | CNY 18 | 1 million tokens |
32K<token≤198K | CNY 6 | CNY 22 | ||||
glm-4.7 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 3 | CNY 14 | 1 million tokens |
32K<token≤166K | CNY 4 | CNY 16 | ||||
glm-4.6 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 3 | CNY 14 | 1 million tokens |
32K<token≤166K | CNY 4 | CNY 16 | ||||
glm-4.5 | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 3 | CNY 14 | 1 million tokens |
32K<token≤96K | CNY 4 | CNY 16 | ||||
glm-4.5-air | Chinese mainland | non-thinking and thinking modes | 0<token≤32K | CNY 0.8 | CNY 6 | 1 million tokens |
32K<token≤96K | CNY 1.2 | CNY 8 |
US (Virginia)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
glm-5.1 | Global | non-thinking and thinking modes | 0<token≤32K | CNY 6 | CNY 24 |
32K<token≤200K | CNY 8 | CNY 28 |
Singapore
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
glm-5.1 | International | non-thinking and thinking modes | 0<token≤200K | CNY 10.492 | CNY 32.974 | Not available |
Germany (Frankfurt)
Model ID | Deployment scope | Mode | Input tokens | Input price | Output price Chain of thought and answer |
glm-5.1 | Global | non-thinking and thinking modes | 0<token≤32K | CNY 6 | CNY 24 |
32K<token≤200K | CNY 8 | CNY 28 |
GLM-Zhipu AI
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Mode | Input price | Output price Chain of thought and answer | Free quota (Note) |
ZHIPU/GLM-5.1 | Chinese mainland | Non-thinking and Thinking modes | CNY 8 | CNY 28 | None |
ZHIPU/GLM-5 | Chinese mainland | Non-thinking and Thinking modes | CNY 6 | CNY 22 | None |
MiniMax
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment region | Mode | Input price | Output price Chain of thought | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
MiniMax-M2.5 | Chinese mainland | Chain of thought mode only | CNY 2.1 | CNY 8.4 | 1 million tokens |
MiniMax-M2.1 | Chinese mainland | Chain of thought mode only | CNY 2.1 | CNY 8.4 |
MiniMax
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Region | Mode | Input price | Output price Chain of thought | Free quota (note) |
MiniMax/MiniMax-M3 Discount on context caching | Chinese mainland | Thinking and non-thinking modes | CNY 4.2 | CNY 16.8 | None |
MiniMax/MiniMax-M2.7 Discount on context caching | Chinese mainland | Thinking mode only | CNY 2.1 | CNY 8.4 | |
MiniMax/MiniMax-M2.5 Discount on context caching | Chinese mainland | Thinking mode only | CNY 2.1 | CNY 8.4 | |
MiniMax/MiniMax-M2.1 Discount on context caching | Chinese mainland | Thinking mode only | CNY 2.1 | CNY 8.4 |
MiMo-Xiaomi
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Region | Input tokens | Input price | Output price Chain of thought | Free quota (Note) |
xiaomi/mimo-v2.5-pro | Chinese mainland | 0 < tokens ≤ 256K | CNY 7 | CNY 21 | None |
256K < tokens ≤ 1M | CNY 14 | CNY 42 |
Stepfun-StepFun
China (Beijing)
Model ID | Deployment scope | Input price | Output price Chain of thought and answer | Free quota (Note) |
stepfun/step-3.7-flash | Chinese mainland | CNY 1.35 | CNY 8.1 | None |
Image generation
You are not charged for input. You are charged for output based on the number of successfully generated images.
Formula: Cost = Image unit price × Number of images generated.
Notes:
-
Cost does not depend on image resolution or aspect ratio.
-
Failed requests incur no cost and do not consume your free quota.
Qwen text-to-image
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-image-2.0-pro | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0-pro-2026-04-22 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0-pro-2026-03-03 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-2.0-2026-03-03 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-max Currently an alias for qwen-image-max-2025-12-30 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-max-2025-12-30 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-plus Currently an alias for qwen-image | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-plus-2026-01-09 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image | Chinese mainland | CNY 0.25 per image | 100 images |
Singapore
Model ID | Deployment scope | Output price |
qwen-image-2.0-pro | International | CNY 0.550443 per image |
qwen-image-2.0-pro-2026-04-22 | International | CNY 0.550443 per image |
qwen-image-2.0-pro-2026-03-03 | International | CNY 0.550443 per image |
qwen-image-2.0 | International | CNY 0.256873 per image |
qwen-image-2.0-2026-03-03 | International | CNY 0.256873 per image |
qwen-image-max Currently an alias for qwen-image-max-2025-12-30 | International | CNY 0.550443 per image |
qwen-image-max-2025-12-30 | International | CNY 0.550443 per image |
qwen-image-plus Currently an alias for qwen-image | International | CNY 0.220177 per image |
qwen-image-plus-2026-01-09 | International | CNY 0.220177 per image |
qwen-image | International | CNY 0.256873 per image |
Qwen image editing
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Region | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-image-2.0-pro | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0-pro-2026-04-22 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0-pro-2026-03-03 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-2.0 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-2.0-2026-03-03 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-edit-max-2026-01-16 | Chinese mainland | CNY 0.5 per image | 100 images |
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-edit-plus-2025-12-15 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-edit-plus-2025-10-30 | Chinese mainland | CNY 0.2 per image | 100 images |
qwen-image-edit | Chinese mainland | CNY 0.3 per image | 100 images |
Singapore
Model ID | Region | Output price |
qwen-image-2.0-pro | International | CNY 0.550443 per image |
qwen-image-2.0-pro-2026-04-22 | International | CNY 0.550443 per image |
qwen-image-2.0-pro-2026-03-03 | International | CNY 0.550443 per image |
qwen-image-2.0 | International | CNY 0.256873 per image |
qwen-image-2.0-2026-03-03 | International | CNY 0.256873 per image |
qwen-image-edit-max Currently equivalent to qwen-image-edit-max-2026-01-16 | International | CNY 0.550443 per image |
qwen-image-edit-max-2026-01-16 | International | CNY 0.550443 per image |
qwen-image-edit-plus Currently equivalent to qwen-image-edit-plus-2025-10-30 | International | CNY 0.220177 per image |
qwen-image-edit-plus-2025-12-15 | International | CNY 0.220177 per image |
qwen-image-edit-plus-2025-10-30 | International | CNY 0.220177 per image |
qwen-image-edit | International | CNY 0.330266 per image |
Qwen Image Translation
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Price | Free quota(Note) Expires 90 days after you activate Alibaba Cloud Model Studio |
qwen-mt-image | Chinese mainland | CNY 0.003 per image | 100 images |
Z-Image
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
z-image-turbo | Chinese mainland | Prompt rewriting disabled ( Prompt rewriting enabled ( | 100 images |
Singapore
Model id | Deployment scope | Output price |
z-image-turbo | international | Prompt rewriting disabled ( Prompt rewriting enabled ( |
Wanx text-to-image
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
wan2.6-t2i | Chinese mainland | CNY 0.20 per image | 50 images |
wan2.5-t2i-preview | Chinese mainland | CNY 0.20 per image | 50 images |
wan2.2-t2i-plus | Chinese mainland | CNY 0.20 per image | 100 images |
wan2.2-t2i-flash | Chinese mainland | CNY 0.14 per image | 100 images |
wanx2.1-t2i-plus | Chinese mainland | CNY 0.20 per image | 500 images |
wanx2.1-t2i-turbo | Chinese mainland | CNY 0.14 per image | 500 images |
wanx2.0-t2i-turbo | Chinese mainland | CNY 0.04 per image | 500 images |
wanx-v1 | Chinese mainland | CNY 0.16 per image | 500 images |
US (Virginia)
Model ID | Deployment scope | Output price |
wan2.6-t2i | global | CNY 0.20 per image |
Singapore
Model ID | Deployment scope | Output price |
wan2.6-t2i | international | CNY 0.220177 per image |
wan2.5-t2i-preview | international | CNY 0.220177 per image |
wan2.2-t2i-plus | international | CNY 0.366962 per image |
wan2.2-t2i-flash | international | CNY 0.183481 per image |
wanx2.1-t2i-plus | international | CNY 0.366962 per image |
wanx2.1-t2i-turbo | international | CNY 0.183481 per image |
Germany (Frankfurt)
Model ID | Deployment scope | Output price |
wan2.6-t2i | global | CNY 0.20 per image |
Wanx image generation and editing
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
wan2.7-image-pro | chinese mainland | CNY 0.50 per image | 50 images |
wan2.7-image | chinese mainland | CNY 0.20 per image | 50 images |
wan2.6-image | chinese mainland | CNY 0.20 per image | 50 images |
US (Virginia)
Model ID | Deployment scope | Output price |
wan2.6-image | global | CNY 0.20 per image |
Singapore
Model ID | Deployment scope | Output price |
wan2.7-image-pro | international | CNY 0.562065 per image |
wan2.7-image | international | CNY 0.220177 per image |
wan2.6-image | international | CNY 0.220177 per image |
Germany (Frankfurt)
Model ID | Deployment scope | Output price |
wan2.6-image | global | CNY 0.20 per image |
Wan general-purpose image editing
You are billed for output only. For billing rules, see image generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
wan2.5-i2i-preview | Chinese mainland | CNY 0.20 per image | 50 images |
wanx2.1-imageedit | Chinese mainland | CNY 0.14 per image | 500 images |
Singapore
Model ID | Deployment scope | Output price |
wan2.5-i2i-preview | International | CNY 0.220177 per image |
Wanx Sketch-to-Image
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Available regions | Price per image | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wanx-sketch-to-image-lite | Chinese mainland | CNY 0.06 per image | 500 images |
Wanx Image Inpainting
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wanx-x-painting | Chinese mainland | Currently available for a free trial only. After you exhaust the free quota, you can no longer call this model. For alternatives, see Image editing - Qwen or Image editing - Wan 2.1. | 500 images |
Portrait style repaint
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Service region | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio. |
wanx-style-repaint-v1 | Chinese mainland | CNY 0.12/image | 500 images |
Image background generation
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after activation of Alibaba Cloud Model Studio |
wanx-background-generation-v2 | Chinese mainland | CNY 0.08 per image | 500 images |
Image outpainting
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model id | Deployment scope | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
image-out-painting | Chinese mainland | CNY 0.18 per image | 500 images |
Human instance segmentation
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Availability | Price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio. |
image-instance-segmentation | Chinese mainland | Currently available as a free trial only. You cannot call the model once you exhaust the free quota. | 500 images |
Image erasing and inpainting
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio. |
image-erase-completion | Chinese mainland | Free trial only. Once the free quota is exhausted, you will be unable to call the model. Consider using Image editing - Qwen or Image editing - Wan2.1 as alternatives. | 500 images |
Virtual model
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Service region | Output price | Free Tier (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wanx-virtualmodel | Chinese mainland | Currently available for a free trial only. Once your free quota is exhausted, you cannot call the model. Consider using image editing - Qwen or image editing - Wan2.1 as alternatives. | 500 images each |
virtualmodel-v2 | Chinese mainland |
Footwear model
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Expires 90 days after you activate Alibaba Cloud Model Studio |
shoemodel-v1 | Chinese mainland | Available for free trial only. Once you use up the free quota, you can no longer call the model. | 500 images |
Creative poster generation
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wanx-poster-generation-v1 | Chinese mainland | Currently available as a free trial only. Once the free quota is used up, this model becomes unavailable. Consider using Image editing - Qwen or Image editing - Wan2.1 as alternatives. | 500 images |
Portrait generation - FaceChain
facechain-facedetect: Free for a limited time.facechain-finetune: Billed per training run. You are not charged for failed requests.facechain-generation: You are not billed for input. You are billed for each successfully generated image. For billing rules, see image generation.
China (Beijing)
Model ID | Service region | Unit price | Free quota(note) |
facechain-facedetect | Chinese mainland | Free for a limited time | Free for a limited time |
facechain-finetune | Chinese mainland | CNY 2.5 per training run | 50 training runs Valid for 90 days after application approval |
facechain-generation | Chinese mainland | CNY 0.18 per image | 500 images Valid for 90 days after application approval |
Creative text generation - WordArt
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model id | Region | Price | Free quota (Note) Expires 90 days after you activate Alibaba Cloud Model Studio. |
wordart-texture | Chinese mainland | CNY 0.08 per image | 500 images |
wordart-semantic | Chinese mainland | CNY 0.24 per image |
AI Virtual Try-on - OutfitAnyone
aitryon: You are billed for output only. For billing details, see image generation.
aitryon-plus: You are billed for output only. For billing details, see image generation.
aitryon-parsing-v1: You are billed per input image, and output is free. You are not charged for failed requests.
aitryon-refiner: You are billed for output only. For billing details, see image generation.
China (Beijing)
Model id | Deployment scope | Free quotaGuidelines Valid for 90 days after you activate Alibaba Cloud Model Studio |
aitryon | Chinese mainland | 400 images |
aitryon-plus | Chinese mainland | 400 images |
aitryon-parsing-v1 | Chinese mainland | 400 images |
aitryon-refiner | Chinese mainland | 100 images |
China (Beijing)
Model id | Deployment scope | Unit price | Discount | Pricing tier |
aitryon | Chinese mainland | CNY 0.20 per image | None | None |
aitryon-plus | Chinese mainland | CNY 0.50 per image | None | None |
aitryon-parsing-v1 | Chinese mainland | CNY 0.004 per image | None | None |
aitryon-refiner | Chinese mainland | CNY 0.30 per image | None | Generated quantity ≤ 25 images |
CNY 0.275 per image | 8% | 25 images < generated quantity ≤ 125 images | ||
CNY 0.25 per image | 16% | 125 images < generated quantity ≤ 250 images | ||
CNY 0.225 per image | 25% | 250 images < generated quantity ≤ 1,250 images | ||
CNY 0.20 per image | 33% | 1,250 images < generated quantity ≤ 2,500 images | ||
CNY 0.175 per image | 42% | 2,500 images < generated quantity ≤ 25,000 images | ||
CNY 0.15 per image | 50% | Generated quantity > 25,000 images |
Image generation - third-party models
Kling-Image-Generation
You are billed for output only. For billing rules, see image generation.
China (Beijing)
Model ID | Deployment scope | Output resolution | Price | Free quota (Note) |
kling/kling-v3-image-generation | Chinese mainland | 1K | CNY 0.2 per image | No free quota. |
2K | CNY 0.2 per image | |||
kling/kling-v3-omni-image-generation | Chinese mainland | 1K | CNY 0.2 per image | |
2K | CNY 0.2 per image | |||
4K | CNY 0.4 per image |
Music generation
Billing: Charged per second of audio output. Input is free of charge.
China (Beijing)
Model ID | Region | Price (per second) | Free quotaNote Valid for 90 days after you activate Alibaba Cloud Model Studio |
fun-music-preview | Chinese mainland | CNY 0.005 | 1,000 seconds |
fun-music-v1 | Chinese mainland | CNY 0.002 |
Text-to-speech
Qwen-TTS
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Qwen3-TTS-Instruct-Flash
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-instruct-flash | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
qwen3-tts-instruct-flash-2026-01-26 | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
Qwen3-TTS-VD
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-vd-2026-01-26 | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
Qwen3-TTS-VC
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-vc-2026-01-22 | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
Qwen3-TTS-Flash
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-flash Currently an alias for qwen3-tts-flash-2025-11-27 | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
qwen3-tts-flash-2025-11-27 | Chinese mainland | CNY 0.8 | Free | 10,000 characters |
qwen3-tts-flash-2025-09-18 | Chinese mainland | CNY 0.8 | Free | For accounts activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025: 10,000 characters |
Qwen-TTS
Pricing is based on both input and output tokens.
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-tts-flash | Chinese mainland | CNY 1.6 | CNY 10 | 1 million tokens |
qwen-tts-latest | Chinese mainland | CNY 1.6 | CNY 10 | 1 million tokens |
qwen-tts-2025-05-22 | Chinese mainland | CNY 1.6 | CNY 10 | 1 million tokens |
qwen-tts-2025-04-10 | Chinese mainland | CNY 1.6 | CNY 10 | 1 million tokens |
Singapore
Qwen3-TTS-Instruct-Flash
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-instruct-flash | International | CNY 0.8 |
qwen3-tts-instruct-flash-2026-01-26 | International | CNY 0.8 |
Qwen3-TTS-VD
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-vd-2026-01-26 | International | CNY 0.8 |
Qwen3-TTS-VC
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-vc-2026-01-22 | International | CNY 0.8 |
Qwen3-TTS-Flash
Pricing is based on the number of characters in the input text. Output is free of charge.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-flash Currently an alias for qwen3-tts-flash-2025-11-27 | International | CNY 0.733924 |
qwen3-tts-flash-2025-11-27 | International | CNY 0.733924 |
qwen3-tts-flash-2025-09-18 | International | CNY 0.733924 |
Qwen-TTS-Realtime
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Qwen3-TTS-Instruct-Flash-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-instruct-flash-realtime | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
qwen3-tts-instruct-flash-realtime-2026-01-22 | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
Qwen3-TTS-VD-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-vd-realtime-2026-01-15 | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
qwen3-tts-vd-realtime-2025-12-16 | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
Qwen3-TTS-VC-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-vc-realtime-2026-01-15 | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
qwen3-tts-vc-realtime-2025-11-27 | Chinese mainland | 10,000 characters |
Qwen3-TTS-Flash-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen3-tts-flash-realtime | Chinese mainland | CNY 1 | Not charged | 10,000 characters for users activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025. |
qwen3-tts-flash-realtime-2025-11-27 | Chinese mainland | CNY 1 | Not charged | 10,000 characters |
qwen3-tts-flash-realtime-2025-09-18 | Chinese mainland | CNY 1 | Not charged | 10,000 characters for users activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025. |
Qwen-TTS-Realtime
Pricing rule: Billed based on the number of input and output tokens.
Model ID | Deployment scope | Input price (per 1 million tokens) | Output price (per 1 million tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-tts-realtime | Chinese mainland | CNY 2.4 | CNY 12 | 1 million tokens |
qwen-tts-realtime-latest | Chinese mainland | CNY 2.4 | CNY 12 | 1 million tokens |
qwen-tts-realtime-2025-07-15 | Chinese mainland | CNY 2.4 | CNY 12 | 1 million tokens |
Singapore
Qwen3-TTS-Instruct-Flash-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-instruct-flash-realtime | International | CNY 1 |
qwen3-tts-instruct-flash-realtime-2026-01-22 | International | CNY 1 |
Qwen3-TTS-VD-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-vd-realtime-2026-01-15 | International | CNY 0.954101 |
qwen3-tts-vd-realtime-2025-12-16 | International | CNY 0.954101 |
Qwen3-TTS-VC-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-vc-realtime-2026-01-15 | International | CNY 0.954101 |
qwen3-tts-vc-realtime-2025-11-27 | International |
Qwen3-TTS-Flash-Realtime
Pricing rule: Billed based on the number of input characters. Output is not charged.
Model ID | Deployment scope | Input price (per 10,000 characters) |
qwen3-tts-flash-realtime | International | CNY 0.954101 |
qwen3-tts-flash-realtime-2025-11-27 | International | CNY 0.954101 |
qwen3-tts-flash-realtime-2025-09-18 | International | CNY 0.954101 |
Qwen-TTS voice cloning
You are billed for each new voice clone you create.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Price (per clone) | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen-voice-enrollment | Chinese mainland | CNY 0.01 | 1,000 voice clones/account |
Singapore
Model ID | Deployment scope | Price (per clone) |
qwen-voice-enrollment | International | CNY 0.01 |
Qwen-TTS voice design
Billing rules: You are billed for creating each new voice clone.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Price (per voice clone) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-voice-design | chinese mainland | CNY 0.2 | 10 voice clones/account |
Singapore
Model ID | Deployment scope | Price (per voice clone) |
qwen-voice-design | international | CNY 0.2 |
CosyVoice
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Billing rule: Billing is based on the number of input characters; output is not charged.
Model ID | Deployment scope | Input price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
cosyvoice-v3.5-plus | Chinese mainland | CNY 1.5 | 10,000 characters |
cosyvoice-v3.5-flash | Chinese mainland | CNY 0.8 | 10,000 characters |
cosyvoice-v3-plus | Chinese mainland | CNY 2 | 10,000 characters |
cosyvoice-v3-flash | Chinese mainland | CNY 1 | 10,000 characters |
cosyvoice-v2 | Chinese mainland | CNY 2 | 10,000 characters |
cosyvoice-v1 | Chinese mainland | CNY 2 | 10,000 characters |
Singapore
Billing rule: Billing is based on the number of input characters; output is not charged.
Model ID | Deployment scope | Input price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
cosyvoice-v3-plus | International | CNY 1.9082 | N/A |
cosyvoice-v3-flash | International | CNY 0.9541 | N/A |
Sambert
Billing is based on the number of input characters. Output is free.
China (Beijing)
Model ID | Deployment scope | Input price (per 10,000 characters) | Free quota(Note) |
See the Java SDK | Chinese mainland | CNY 1 | Each Alibaba Cloud account receives a free monthly quota of 30,000 characters per model. |
MiniMax
Billing is based on the number of characters in the input text, and the output is free.
Voice cloning incurs a one-time fee. This fee is billed along with the speech synthesis fee when the cloned voice is first used.
Model name | Deployment scope | Price (per 10,000 characters) | Free quota (Note) | |
MiniMax/speech-2.8-hd | Chinese mainland | CNY 3.5 | CNY 9.9 (Charged when first used for speech synthesis) | None |
MiniMax/speech-02-hd | Chinese mainland | CNY 3.5 | ||
MiniMax/speech-2.8-turbo | Chinese mainland | CNY 2 | ||
MiniMax/speech-02-turbo | Chinese mainland | CNY 2 |
Speech recognition and translation
Qwen-LiveTranslate-Flash-Realtime
Billing is based on the number of input and output tokens. For details about how tokens are calculated for different modalities, see Billing.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price (per 1,000,000 tokens) | Output price (per 1,000,000 tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio. | ||
Input: audio | Input: image | Output: text | Output: audio | |||
qwen3.5-livetranslate-flash-realtime | Chinese mainland | CNY 40 | CNY 3.3 | CNY 100 | CNY 160 | 1 million tokens |
qwen3.5-livetranslate-flash-realtime-2026-05-19 | Chinese mainland | CNY 40 | CNY 3.3 | CNY 100 | CNY 160 | 1 million tokens |
qwen3-livetranslate-flash-realtime | Chinese mainland | CNY 64 | CNY 8 | CNY 64 | CNY 240 | 1 million tokens |
qwen3-livetranslate-flash-realtime-2025-09-22 | Chinese mainland | CNY 64 | CNY 8 | CNY 64 | CNY 240 | 1 million tokens |
Singapore
Model ID | Deployment scope | Input price (per 1,000,000 tokens) | Output price (per 1,000,000 tokens) | ||
Input: audio | Input: image | Output: text | Output: audio | ||
qwen3.5-livetranslate-flash-realtime | International | CNY 56.207 | CNY 4.122 | CNY 149.884 | CNY 224.826 |
qwen3.5-livetranslate-flash-realtime-2026-05-19 | International | CNY 56.207 | CNY 4.122 | CNY 149.884 | CNY 224.826 |
qwen3-livetranslate-flash-realtime | International | CNY 73.392 | CNY 9.541 | CNY 73.392 | CNY 278.891 |
qwen3-livetranslate-flash-realtime-2025-09-22 | International | CNY 73.392 | CNY 9.541 | CNY 73.392 | CNY 278.891 |
Qwen-LiveTranslate-Flash
Billing is based on input and output tokens. For details on the token calculation rules for different modalities, see Billing Details.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price (per 1M tokens) | Output price (per 1M tokens) | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | ||
Input: audio | Input: image | Output: text | Output: audio | |||
qwen3-livetranslate-flash | chinese mainland | CNY 10 | CNY 4 | CNY 10 | CNY 40 | 1 million tokens |
qwen3-livetranslate-flash-2025-12-01 | chinese mainland | CNY 10 | CNY 4 | CNY 10 | CNY 40 | 1 million tokens |
Singapore
Model ID | Deployment scope | Input price (per 1M tokens) | Output price (per 1M tokens) | ||
Input: audio | Input: image | Output: text | Output: audio | ||
qwen3-livetranslate-flash | international | CNY 11.573 | CNY 4.629 | CNY 11.573 | CNY 46.292 |
qwen3-livetranslate-flash-2025-12-01 | international | CNY 11.573 | CNY 4.629 | CNY 11.573 | CNY 46.292 |
Qwen-ASR
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Pricing rule: You are billed per second of input audio. Output is free of charge.
Model ID | Deployment scope | Input price | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-asr-flash-filetrans | Chinese mainland | CNY 0.00022 per second | Not charged | 36,000 seconds (10 hours) |
qwen3-asr-flash-filetrans-2025-11-17 | Chinese mainland | 36,000 seconds (10 hours) | ||
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08 | Chinese mainland | 36,000 seconds (10 hours) | ||
qwen3-asr-flash-2026-02-10 | Chinese mainland | 36,000 seconds (10 hours) | ||
qwen3-asr-flash-2025-09-08 | Chinese mainland | 36,000 seconds (10 hours) |
US (Virginia)
Pricing rule: You are billed per second of input audio. Output is free of charge.
Model ID | Deployment scope | Input price | Output price |
qwen3-asr-flash-us | US | CNY 0.000035 per second | Not charged |
qwen3-asr-flash-2025-09-08-us | US | CNY 0.000035 per second |
Singapore
Pricing rule: You are billed per second of input audio. Output is free of charge.
Model ID | Deployment scope | Input price | Output price |
qwen3-asr-flash-filetrans | international | CNY 0.00026 per second | Not charged |
qwen3-asr-flash-filetrans-2025-11-17 | international | CNY 0.00026 per second | |
qwen3-asr-flash Currently equivalent to qwen3-asr-flash-2025-09-08 | international | CNY 0.00026 per second | |
qwen3-asr-flash-2026-02-10 | international | CNY 0.00026 per second | |
qwen3-asr-flash-2025-09-08 | international | CNY 0.00026 per second |
Qwen-ASR-Realtime
Charges for input audio are calculated per second. Output is not charged.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-asr-flash-realtime | chinese mainland | CNY 0.00033 per second | 36,000 seconds (10 hours) |
qwen3-asr-flash-realtime-2026-02-10 | chinese mainland | 36,000 seconds (10 hours) | |
qwen3-asr-flash-realtime-2025-10-27 | chinese mainland | 36,000 seconds (10 hours) |
Singapore
Model ID | Deployment scope | Input price |
qwen3-asr-flash-realtime | international | CNY 0.00066 per second |
qwen3-asr-flash-realtime-2026-02-10 | international | |
qwen3-asr-flash-realtime-2025-10-27 | International |
Fun-ASR
Audio file recognition
Billing rule: Billed per second of input audio. The output is not billed.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Free quotaGuidelines Valid for 90 days after you activate Alibaba Cloud Model Studio |
fun-asr | Chinese mainland | CNY 0.00022 per second | 36,000 seconds (10 hours) |
fun-asr-2025-11-07 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-2025-08-25 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-mtl | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-mtl-2025-08-25 | Chinese mainland | 36,000 seconds (10 hours) |
Singapore
Model ID | Deployment scope | Input price |
fun-asr | International | CNY 0.00026 per second |
fun-asr-2025-11-07 | International | |
fun-asr-2025-08-25 | International | |
fun-asr-mtl | International | |
fun-asr-mtl-2025-08-25 | International |
Real-time speech recognition
Billing rule: Billed per second of input audio. The output is not billed.
China (Beijing)
Model ID | Deployment scope | Input price | Free quotaGuidelines Valid for 90 days after you activate Alibaba Cloud Model Studio |
fun-asr-realtime | Chinese mainland | CNY 0.00033 per second | 36,000 seconds (10 hours) |
fun-asr-realtime-2026-02-28 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-realtime-2025-11-07 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-realtime-2025-09-15 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-mtl-realtime | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-mtl-realtime-2025-12-10 | Chinese mainland | 36,000 seconds (10 hours) | |
fun-asr-flash-8k-realtime | Chinese mainland | CNY 0.00022 per second | 36,000 seconds (10 hours) |
fun-asr-flash-8k-realtime-2026-01-28 | Chinese mainland | 36,000 seconds (10 hours) |
Singapore
Model ID | Deployment scope | Input price |
fun-asr-realtime | International | CNY 0.00066 per second |
fun-asr-realtime-2025-11-07 | International |
Paraformer
Audio file recognition
Charges apply per second of input audio. Output is not billed.
China (Beijing)
Model ID | Deployment scope | Input price | Free quota(Note) |
paraformer-v2 | Chinese mainland | CNY 0.00008 per second | 36,000 seconds (10 hours) Issued at 00:00 (UTC+8) on the first day of each month. Expires after one month. |
paraformer-8k-v2 | Chinese mainland | ||
paraformer-v1 | Chinese mainland | ||
paraformer-8k-v1 | Chinese mainland | ||
paraformer-mtl-v1 | Chinese mainland |
Real-time speech recognition
Charges apply per second of input audio. Output is not billed.
China (Beijing)
Model ID | Deployment scope | Input price | Free quota(Note) |
paraformer-realtime-v2 | Chinese mainland | CNY 0.00024 per second | 36,000 seconds (10 hours) Issued at 00:00 (UTC+8) on the first day of each month. Expires after one month. |
paraformer-realtime-v1 | Chinese mainland | ||
paraformer-realtime-8k-v2 | Chinese mainland | ||
paraformer-realtime-8k-v1 | Chinese mainland |
Video generation
You are not charged for input. You are charged for output based on the total duration of successfully generated videos (in seconds).
Formula: Cost = Video unit price × Video duration (seconds).
Notes:
-
Some models charge by output video resolution. Prices differ for resolutions such as 480P, 720P, and 1080P.
-
Some models charge by output video edition. Prices differ for editions such as Standard Edition and Professional Edition.
-
Some models charge by output video aspect ratio. Prices differ for aspect ratios such as 1:1 and 3:4.
-
Some models use a flat rate, regardless of resolution, edition, or aspect ratio.
-
Failed requests incur no cost and do not consume your free quota.
HappyHorse-Text-to-video
Charges are based on output only. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output resolution | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
happyhorse-1.0-t2v | Chinese mainland | 720P | CNY 0.9 per second | 10 seconds |
1080P | CNY 1.6 per second |
US (Virginia)
Model ID | Deployment scope | Output resolution | Output price |
happyhorse-1.0-t2v | Global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
Singapore
Model ID | Deployment scope | Output resolution | Output price |
happyhorse-1.0-t2v | International | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Germany (Frankfurt)
Model ID | Deployment scope | Output resolution | Output price |
happyhorse-1.0-t2v | Global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
HappyHorse: Image-to-video (first frame)
Billing applies to output only. For details, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output video resolution | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
happyhorse-1.0-i2v | Chinese mainland | 720P | CNY 0.9 per second | 10 seconds |
1080P | CNY 1.6 per second |
US (Virginia)
Model ID | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-i2v | Global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
Singapore
Model ID | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-i2v | International | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Germany (Frankfurt)
Model ID | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-i2v | Global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
HappyHorse: Reference-to-video
You are billed for output only. See video generation for billing rules.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Output video resolution | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
happyhorse-1.0-r2v | Chinese mainland | 720P | CNY 0.9 per second | 10 seconds |
1080P | CNY 1.6 per second |
US (Virginia)
Model id | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-r2v | global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
Singapore
Model id | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-r2v | international | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Germany (Frankfurt)
Model id | Deployment scope | Output video resolution | Output price |
happyhorse-1.0-r2v | global | 720P | CNY 0.9 per second |
1080P | CNY 1.6 per second |
HappyHorse-Video Editing
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.
Model id | Deployment scope | Output resolution | Price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
happyhorse-1.0-video-edit | Chinese mainland | 720P | CNY 0.9 per second | 10 seconds |
1080P | CNY 1.6 per second |
US (Virginia)
Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.
Model id | Deployment scope | Output resolution | Price |
happyhorse-1.0-video-edit | global | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Singapore
Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.
Model id | Deployment scope | Output resolution | Price |
happyhorse-1.0-video-edit | global | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Germany (Frankfurt)
Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.
Model id | Deployment scope | Output resolution | Price |
happyhorse-1.0-video-edit | global | 720P | CNY 1.049188 per second |
1080P | CNY 1.798608 per second |
Wanx-text-to-video
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Output resolution | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wan2.7-t2v-2026-04-25 | Chinese mainland | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second | |||
wan2.7-t2v | Chinese mainland | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second | |||
wan2.6-t2v | Chinese mainland | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second | |||
wan2.5-t2v-preview | Chinese mainland | 480P | CNY 0.3 per second | 50 seconds |
720P | CNY 0.6 per second | |||
1080P | CNY 1 per second | |||
wan2.2-t2v-plus | Chinese mainland | 480P | CNY 0.14 per second | 50 seconds |
1080P | CNY 0.70 per second | |||
wanx2.1-t2v-turbo | Chinese mainland | 480P | CNY 0.24 per second | 200 seconds |
720P | CNY 0.24 per second | |||
wanx2.1-t2v-plus | Chinese mainland | 720P | CNY 0.70 per second | 200 seconds |
US (Virginia)
Model id | Deployment scope | Output resolution | Output price |
wan2.6-t2v | Global | 720P | CNY 0.6 per second |
1080P | CNY 1 per second | ||
wan2.6-t2v-us | US | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second |
Singapore
Model id | Deployment scope | Output resolution | Output price |
wan2.7-t2v-2026-04-25 | International | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second | ||
wan2.7-t2v | International | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second | ||
wan2.6-t2v | International | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second | ||
wan2.5-t2v-preview | International | 480P | CNY 0.366961 per second |
720P | CNY 0.733923 per second | ||
1080P | CNY 1.100885 per second | ||
wan2.2-t2v-plus | International | 480P | CNY 0.146785 per second |
1080P | CNY 0.733924 per second | ||
wan2.1-t2v-turbo | International | 480P | CNY 0.264213 per second |
720P | CNY 0.264213 per second | ||
wan2.1-t2v-plus | International | 720P | CNY 0.733924 per second |
Germany (Frankfurt)
Model id | Deployment scope | Output resolution | Output price |
wan2.6-t2v | Global | 720P | CNY 0.6 per second |
1080P | CNY 1 per second |
Wan Image-to-Video
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output video type | Output video resolution | Price | Free quotaGuidelines Valid for 90 days after activating Model Studio |
wan2.7-i2v-2026-04-25 | Chinese mainland | Video with audio | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second | ||||
wan2.7-i2v | Chinese mainland | Video with audio | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second |
Singapore
Model ID | Deployment scope | Output video type | Output video resolution | Price |
wan2.7-i2v-2026-04-25 | International | Video with audio | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second | |||
wan2.7-i2v | International | Video with audio | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second |
Wan: Image-to-video (first frame)
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model id | Deployment scope | Output video type | Output video resolution | Unit price | Free quota (Note) Valid for 90 days after activating Model Studio |
wan2.6-i2v-flash | Chinese mainland | video with audio
| 720P | CNY 0.3/second | 50 seconds |
1080P | CNY 0.5/second | ||||
silent video
| 720P | CNY 0.15/second | |||
1080P | CNY 0.25/second | ||||
wan2.6-i2v | Chinese mainland | video with audio | 720P | CNY 0.6/second | 50 seconds |
1080P | CNY 1/second | ||||
wan2.5-i2v-preview | Chinese mainland | video with audio | 480P | CNY 0.3/second | 50 seconds |
720P | CNY 0.6/second | ||||
1080P | CNY 1/second | ||||
wan2.2-i2v-flash | Chinese mainland | silent video | 480P | CNY 0.10/second | 50 seconds |
720P | CNY 0.20/second | ||||
1080P | CNY 0.48/second | ||||
wan2.2-i2v-plus | Chinese mainland | silent video | 480P | CNY 0.14/second | 50 seconds |
1080P | CNY 0.70/second | ||||
wanx2.1-i2v-turbo | Chinese mainland | silent video | 480P | CNY 0.24/second | 200 seconds |
720P | CNY 0.24/second | ||||
wanx2.1-i2v-plus | Chinese mainland | silent video | 720P | CNY 0.70/second | 200 seconds |
US (Virginia)
Model id | Deployment scope | Output video type | Output video resolution | Unit price |
wan2.6-i2v | Global | video with audio | 720P | CNY 0.6/second |
1080P | CNY 1/second | |||
wan2.6-i2v-us | US | video with audio | 720P | CNY 0.733924/second |
1080P | CNY 1.100886/second |
Singapore
Model id | Deployment scope | Output video type | Output video resolution | Unit price |
wan2.6-i2v-flash | International | video with audio
| 720P | CNY 0.366962/second |
1080P | CNY 0.550443/second | |||
silent video
| 720P | CNY 0.183481/second | ||
1080P | CNY 0.275221/second | |||
wan2.6-i2v | International | video with audio | 720P | CNY 0.733924/second |
1080P | CNY 1.100886/second | |||
wan2.5-i2v-preview | International | video with audio | 480P | CNY 0.366961/second |
720P | CNY 0.733923/second | |||
1080P | CNY 1.100885/second | |||
wan2.2-i2v-flash | International | silent video | 480P | CNY 0.110089/second |
720P | CNY 0.264213/second | |||
wan2.2-i2v-plus | International | silent video | 480P | CNY 0.146785/second |
1080P | CNY 0.733924/second | |||
wan2.1-i2v-turbo | International | silent video | 480P | CNY 0.264213/second |
720P | CNY 0.264213/second | |||
wan2.1-i2v-plus | International | silent video | 720P | CNY 0.733924/second |
Germany (Frankfurt)
Model id | Deployment scope | Output video type | Output video resolution | Unit price |
wan2.6-i2v | Global | video with audio | 720P | CNY 0.6/second |
1080P | CNY 1/second |
Wanx - image-to-video (first and last frames)
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output video resolution | Price | Free quotaGuidelines Valid for 90 days after you activate Alibaba Cloud Model Studio |
wanx2.2-kf2v-flash | chinese mainland | 480P | CNY 0.10/second | 50 seconds |
720P | CNY 0.20/second | |||
1080P | CNY 0.48/second | |||
wanx2.1-kf2v-plus | chinese mainland | 720P | CNY 0.70/second | 200 seconds |
Singapore
Model ID | Deployment scope | Output video resolution | Price |
wanx2.1-kf2v-plus | international | 720P | CNY 0.733924/second |
Wan - Reference-to-video
Billing rules: Both input and output videos are billed based on video duration (seconds). Failed requests are not billed and do not consume the free quota.
Billing formula: Billed duration = Input video duration (up to 5 seconds) + Output video duration.
The billable duration of the input video does not exceed 5 seconds. For calculation rules, see Billing and rate limiting.
The billable duration of the output video is the length of the successfully generated video in seconds.
China (Beijing)
Model ID | Deployment scope | Output video type | Output video resolution | Rate | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
wan2.7-r2v | Chinese mainland | audio video | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second | ||||
wan2.6-r2v-flash | Chinese mainland | audio video
| 720P | CNY 0.3 per second | 50 seconds |
1080P | CNY 0.5 per second | ||||
silent video
| 720P | CNY 0.15 per second | |||
1080P | CNY 0.25 per second | ||||
wan2.6-r2v | Chinese mainland | audio video | 720P | CNY 0.6 per second | 50 seconds |
1080P | CNY 1 per second |
US (Virginia)
Model ID | Deployment scope | Output video type | Output video resolution | Rate |
wan2.6-r2v | global | audio video | 720P | CNY 0.6 per second |
1080P | CNY 1 per second |
Singapore
Model ID | Deployment scope | Output video type | Output video resolution | Rate |
wan2.7-r2v | international | audio video | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second | |||
wan2.6-r2v-flash | international | audio video
| 720P | CNY 0.366962 per second |
1080P | CNY 0.550443 per second | |||
silent video
| 720P | CNY 0.183481 per second | ||
1080P | CNY 0.275221 per second | |||
wan2.6-r2v | international | audio video | 720P | CNY 0.733924 per second |
1080P | CNY 1.100886 per second |
Germany (Frankfurt)
Model ID | Deployment scope | Output video type | Output video resolution | Rate |
wan2.6-r2v | global | audio video | 720P | CNY 0.6 per second |
1080P | CNY 1 per second |
Wan video editing
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Pricing rule: Both input and output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.
Model ID | Deployment scope | Output video resolution | Input and output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wan2.7-videoedit | Chinese mainland | 720P | CNY 0.6/second | 50 seconds |
1080P | CNY 1/second |
Pricing rule: Input is not billed. Output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.
Model ID | Deployment scope | Output video resolution | Output price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wanx2.1-vace-plus | Chinese mainland | 720P | CNY 0.70/second | 50 seconds |
Singapore
Pricing rule: Both input and output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.
Model ID | Deployment scope | Output video resolution | Input and output price |
wan2.7-videoedit | international | 720P | CNY 0.733924/second |
1080P | CNY 1.100886/second |
Pricing rule: Input is not billed. Output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.
Model ID | Deployment scope | Output video resolution | Output price |
wanx2.1-vace-plus | international | 720P | CNY 0.733924/second |
Wan - digital human
wan2.2-s2v-detect: Input is billed per image for each successful request, regardless of the detection result. Output is free.wan2.2-s2v: Input is free, while output is billed based on the duration (in seconds) of successfully generated videos. For billing details, see Video generation.
China (Beijing)
Model ID | Deployment scope | Unit price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
wan2.2-s2v-detect | Chinese mainland | Input image: CNY 0.004/image | 200 images |
wan2.2-s2v | Chinese mainland | Output video:
| 100 seconds |
Wanx-Image-to-Motion
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output video mode | Output price | Free Tier(note) Valid for 90 days after activation |
wan2.2-animate-move | Chinese mainland | standard mode | CNY 0.4 per second | 50 seconds Valid for 90 days after activation |
professional mode | CNY 0.6/second |
Singapore
Model ID | Deployment scope | Output video mode | Output price |
wan2.2-animate-move | International | standard mode | CNY 0.880709 per second |
professional mode | CNY 1.321063 per second |
Wan - Video face swap
Only output is billed. For billing rules, see video generation.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Output mode | Output price | Free quota(Note) Valid for 90 days after activating Model Studio |
wan2.2-animate-mix | Chinese mainland | standard mode | CNY 0.6 per second | 50 seconds |
professional mode | CNY 0.9 per second |
Singapore
Model ID | Deployment scope | Output mode | Output price |
wan2.2-animate-mix | International | standard mode | CNY 1.321063 per second |
professional mode | CNY 1.908202 per second |
AnimateAnyone
animate-anyone-detect-gen2: Input is billed, but output is free. You are charged for each input image in a successful request, regardless of the detection result.animate-anyone-template-gen2: Input is free, while output is billed by the second for each successfully generated video. For billing rules, see video generation.animate-anyone-gen2: Input is free, while output is billed by the second for each successfully generated video. For billing rules, see video generation.
China (Beijing)
Model ID | Service region | Unit price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
animate-anyone-detect-gen2 | Chinese mainland | input image: CNY 0.004 per image | 200 images |
animate-anyone-template-gen2 | Chinese mainland | output video: CNY 0.08 per second | 1,800 seconds (30 minutes) |
animate-anyone-gen2 | Chinese mainland | output video: CNY 0.08 per second | 1,800 seconds (30 minutes) |
EMO
emo-detect-v1: The input is billed, and the output is free. Billing is based on the number of images processed. You are charged once for each input image in a successful request, regardless of the detection result.emo-v1: The input is free, and the output is billed. Output is billed per second of successfully generated video. For pricing rules, see video generation.
China (Beijing)
Model id | Region | Unit price | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
emo-detect-v1 | Chinese mainland | Input image: CNY 0.004/image | 200 images |
emo-v1 | Chinese mainland | Output video:
| 1,800 seconds (30 minutes) |
LivePortrait
liveportrait-detect: You are billed for input images, while output is free. Each image is billed once for every successful request, regardless of the detection result.
liveportrait: Input is free, but output is billed. You are billed based on the duration (in seconds) of successfully generated videos. For billing rules, see video generation.
China (Beijing)
Model ID | Region | Price | Quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
liveportrait-detect | Chinese mainland | CNY 0.004 per input image | 200 images |
liveportrait | Chinese mainland | CNY 0.02 per second of output video | 1,800 seconds (30 minutes) |
Emoji sticker
emoji-detect-v1: You are billed for each input image processed in a successful request, regardless of the detection result. The output is not billed.emoji-v1: Input is free. You are billed for the output based on the duration (in seconds) of each successfully generated video. For billing rules, see video generation.
China (Beijing)
Model ID | Deployment scope | Unit price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
emoji-detect-v1 | Chinese mainland | Input image: CNY 0.004 per image | 200 images |
emoji-v1 | Chinese mainland | Output video: CNY 0.08 per second | 1,800 seconds (30 minutes) |
VideoRetalk
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model ID | Deployment scope | Output price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
videoretalk | Chinese mainland | CNY 0.08 per second | 1,800 seconds (30 minutes) |
Video style transfer
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model id | Deployment scope | Output video resolution | Unit price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
video-style-transform | chinese mainland | 540P | CNY 0.2 per second | 600 seconds |
720P | CNY 0.5 per second |
Video generation - third-party model
Pixverse text-to-video
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model id | Deployment scope | Video type | Video resolution | Price | Free quotaGuidelines |
pixverse/pixverse-c1-t2v | Chinese mainland | Video with audio
| 360P | CNY 0.24 per second | No free quota |
540P | CNY 0.30 per second | ||||
720P | CNY 0.39 per second | ||||
1080P | CNY 0.71 per second | ||||
Silent video
| 360P | CNY 0.18 per second | |||
540P | CNY 0.24 per second | ||||
720P | CNY 0.30 per second | ||||
1080P | CNY 0.56 per second | ||||
pixverse/pixverse-v6-t2v | Chinese mainland | Video with audio
| 360P | CNY 0.21 per second | No free quota |
540P | CNY 0.27 per second | ||||
720P | CNY 0.36 per second | ||||
1080P | CNY 0.68 per second | ||||
Silent video
| 360P | CNY 0.15 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.53 per second | ||||
pixverse/pixverse-v5.6-t2v | Chinese mainland | Video with audio
| 360P | CNY 0.47 per second | No free quota |
540P | CNY 0.47 per second | ||||
720P | CNY 0.53 per second | ||||
1080P | CNY 0.70 per second | ||||
Silent video
| 360P | CNY 0.21 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.44 per second | ||||
pixverse/pixverse-v5.6-it2v | Chinese mainland | Video with audio
| 360P | CNY 0.47 per second | No free quota |
540P | CNY 0.47 per second | ||||
720P | CNY 0.53 per second | ||||
1080P | CNY 0.70 per second | ||||
Silent video
| 360P | CNY 0.21 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.44 per second |
Pika: Image-to-video (initial frame)
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model ID | Region | Video type | Resolution | Price | Free quota(Note) |
pixverse/pixverse-c1-it2v | Chinese mainland | video with audio
| 360P | CNY 0.24 per second | No free quota |
540P | CNY 0.30 per second | ||||
720P | CNY 0.39 per second | ||||
1080P | CNY 0.71 per second | ||||
silent video
| 360P | CNY 0.18 per second | |||
540P | CNY 0.24 per second | ||||
720P | CNY 0.30 per second | ||||
1080P | CNY 0.56 per second | ||||
pixverse/pixverse-v6-it2v | Chinese mainland | video with audio
| 360P | CNY 0.21 per second | No free quota |
540P | CNY 0.27 per second | ||||
720P | CNY 0.36 per second | ||||
1080P | CNY 0.68 per second | ||||
silent video
| 360P | CNY 0.15 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.53 per second | ||||
pixverse/pixverse-v5.6-it2v | Chinese mainland | video with audio
| 360P | CNY 0.47 per second | No free quota |
540P | CNY 0.47 per second | ||||
720P | CNY 0.53 per second | ||||
1080P | CNY 0.70 per second | ||||
silent video
| 360P | CNY 0.21 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.44 per second |
Pika - Image-to-video (first and last frames)
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model ID | Deployment scope | Output video type | Output video resolution | Output price | Free quota(Note) |
pixverse/pixverse-c1-kf2v | Chinese mainland | video with audio
| 360P | CNY 0.24 per second | No free quota |
540P | CNY 0.30 per second | ||||
720P | CNY 0.39 per second | ||||
1080P | CNY 0.71 per second | ||||
silent video
| 360P | CNY 0.18 per second | |||
540P | CNY 0.24 per second | ||||
720P | CNY 0.30 per second | ||||
1080P | CNY 0.56 per second | ||||
pixverse/pixverse-v6-kf2v | Chinese mainland | video with audio
| 360P | CNY 0.21 per second | No free quota |
540P | CNY 0.27 per second | ||||
720P | CNY 0.36 per second | ||||
1080P | CNY 0.68 per second | ||||
silent video
| 360P | CNY 0.15 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.53 per second | ||||
pixverse/pixverse-v5.6-kf2v | Chinese mainland | video with audio
| 360P | CNY 0.47 per second | No free quota |
540P | CNY 0.47 per second | ||||
720P | CNY 0.53 per second | ||||
1080P | CNY 0.70 per second | ||||
silent video
| 360P | CNY 0.21 per second | |||
540P | CNY 0.21/s | ||||
720P | CNY 0.27/second | ||||
1080P | CNY 0.44 per second |
Pika-Reference-to-Video
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model id | Deployment scope | Output type | Resolution | Price | Free quota(Note) |
pixverse/pixverse-c1-r2v | Chinese mainland | Video with audio
| 360P | CNY 0.24 per second | No free quota |
540P | CNY 0.3 per second | ||||
720P | CNY 0.39 per second | ||||
1080P | CNY 0.71 per second | ||||
silent video
| 360P | CNY 0.18 per second | |||
540P | CNY 0.24 per second | ||||
720P | CNY 0.3 per second | ||||
1080P | CNY 0.56 per second | ||||
pixverse/pixverse-v5.6-r2v | Chinese mainland | Video with audio
| 360P | CNY 0.47 per second | No free quota |
540P | CNY 0.47 per second | ||||
720P | CNY 0.53 per second | ||||
1080P | CNY 0.7 per second | ||||
silent video
| 360P | CNY 0.21 per second | |||
540P | CNY 0.21 per second | ||||
720P | CNY 0.27 per second | ||||
1080P | CNY 0.44 per second |
Kling-Video-Generation
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model id | Deployment scope | Video type | Video resolution | Price | Free quota (Note) |
kling/kling-v3-video-generation | Chinese mainland | Silent video | 720P | CNY 0.6 per second | No free quota |
1080P | CNY 0.8 per second | ||||
Video with audio | 720P | CNY 0.9 per second | |||
1080P | CNY 1.2 per second | ||||
kling/kling-v3-omni-video-generation | Chinese mainland | Silent video (without reference video) | 720P | CNY 0.6 per second | No free quota |
1080P | CNY 0.8 per second | ||||
Silent video (with reference video) | 720P | CNY 0.9 per second | |||
1080P | CNY 1.2 per second | ||||
Video with audio (without reference video) | 720P | CNY 0.9 per second | |||
1080P | CNY 1.2 per second |
Vidu-Text-to-video
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model id | Deployment scope | Output video resolution | Output price | Free quota (Note) |
vidu/viduq3-pro_text2video | Chinese mainland | 540P | CNY 0.3125 per second | No free quota |
720P | CNY 0.78125 per second | |||
1080P | CNY 0.9375 per second | |||
vidu/viduq3-turbo_text2video | Chinese mainland | 540P | CNY 0.25 per second | No free quota |
720P | CNY 0.375 per second | |||
1080P | CNY 0.4375 per second | |||
vidu/viduq2_text2video | Chinese mainland | 540P | CNY 0.1125 per second | No free quota |
720P | CNY 0.21875 per second | |||
1080P | CNY 0.375 per second |
Vidu: Image-to-video (first frame)
Only output is billed. For billing rules, see video generation.
China (Beijing)
Model ID | Deployment scope | Output video resolution | Price | Free quota (Note) |
vidu/viduq3-pro_img2video | Chinese mainland | 540P | CNY 0.3125 per second | No free quota |
720P | CNY 0.78125 per second | |||
1080P | CNY 0.9375 per second | |||
vidu/viduq3-turbo_img2video | Chinese mainland | 540P | CNY 0.25 per second | No free quota |
720P | CNY 0.375 per second | |||
1080P | CNY 0.4375 per second | |||
vidu/viduq2-pro_img2video | Chinese mainland | 540P | CNY 0.15625 per second | No free quota |
720P | CNY 0.34375 per second | |||
1080P | CNY 0.71875 per second | |||
vidu/viduq2-turbo_img2video | Chinese mainland | 540P | CNY 0.0875 per second | No free quota |
720P | CNY 0.25 per second | |||
1080P | CNY 0.46875 per second | |||
vidu/viduq2-pro-fast_img2video | Chinese mainland | 720P | CNY 0.1 per second | No free quota |
1080P | CNY 0.2 per second |
Vidu image-to-video - first and last frames
Only output is billed. For billing rules, see video generation.
North China 2 (Beijing)
Model ID | Region | Output video resolution | Unit price | Free quota(note) |
vidu/viduq3-pro_start-end2video | Chinese mainland | 540P | CNY 0.3125 per second | No free quota |
720P | CNY 0.78125 per second | |||
1080P | CNY 0.9375 per second | |||
vidu/viduq3-turbo_start-end2video | Chinese mainland | 540P | CNY 0.25 per second | No free quota |
720P | CNY 0.375 per second | |||
1080P | CNY 0.4375 per second | |||
vidu/viduq2-pro_start-end2video | Chinese mainland | 540P | CNY 0.15625 per second | No free quota |
720P | CNY 0.34375 per second | |||
1080P | CNY 0.71875 per second | |||
vidu/viduq2-turbo_start-end2video | Chinese mainland | 540P | CNY 0.0875 per second | No free quota |
720P | CNY 0.25 per second | |||
1080P | CNY 0.46875 per second |
Vidu - Video generation from reference
Only output is billed. For billing rules, see video generation.
North China 2 (Beijing)
Model ID | Service region | Output resolution | Unit price | Free quota(Note) |
vidu/viduq3-mix_reference2video | Chinese mainland | 720P | CNY 0.78125 per second | No free quota |
1080P | CNY 0.9375 per second | |||
vidu/viduq3_reference2video | Chinese mainland | 540P | CNY 0.3125 per second | No free quota |
720P | CNY 0.625 per second | |||
1080P | CNY 0.78125 per second | |||
vidu/viduq3-turbo_reference2video | Chinese mainland | 540P | CNY 0.15625 per second | No free quota |
720P | CNY 0.3125 per second | |||
1080P | CNY 0.40625 per second | |||
vidu/viduq2-pro_reference2video | Chinese mainland | 540P | CNY 0.25 per second | No free quota |
720P | CNY 0.3125 per second | |||
1080P | CNY 0.78125 per second | |||
vidu/viduq2_reference2video | Chinese mainland | 540P | CNY 0.21875 per second | No free quota |
720P | CNY 0.28125 per second | |||
1080P | CNY 0.71875 per second |
3D model generation - Third-party models
Tripo-3D model generation
Billing is per output request; input is not charged.
China (Beijing)
Model ID | Deployment scope | Task type | Specification | Price |
Tripo/Tripo-H3.1 | Chinese mainland | text-to-3D | Standard + no texture | CNY 0.7 per request |
Standard + SD texture | CNY 1.4 per request | |||
Standard + HD texture | CNY 2.1 per request | |||
HD + no texture | CNY 2.1 per request | |||
HD + SD texture | CNY 2.8 per request | |||
HD + HD texture | CNY 3.5 per request | |||
single-image-to-3D / multi-image-to-3D | Standard + no texture | CNY 1.4 per request | ||
Standard + SD texture | CNY 2.1 per request | |||
Standard + HD texture | CNY 2.8 per request | |||
HD + no texture | CNY 2.8 per request | |||
HD + SD texture | CNY 3.5 per request | |||
HD + HD texture | CNY 4.2 per request | |||
Tripo/Tripo-P1.0 | Chinese mainland | text-to-3D | no texture | CNY 2.1 per request |
SD texture | CNY 2.8 per request | |||
HD texture | CNY 3.5 per request | |||
single-image-to-3D / multi-image-to-3D | no texture | CNY 2.8 per request | ||
SD texture | CNY 3.5 per request | |||
HD texture | CNY 4.2 per request |
Text embedding
You are charged only for input tokens. Output tokens are not charged.
If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.
China (Beijing)
Model id | Region | Input price | Free quota(Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
text-embedding-v4 50% discount for batch inference | Chinese mainland | CNY 0.5 | 1 million tokens |
text-embedding-v3 50% discount for batch inference | Chinese mainland | CNY 0.5 | 500,000 tokens |
text-embedding-v2 50% discount for batch inference | Chinese mainland | CNY 0.7 | 500,000 tokens |
text-embedding-v1 50% discount for batch inference | Chinese mainland | CNY 0.7 | 500,000 tokens |
text-embedding-async-v2 | Chinese mainland | CNY 0.7 | 20 million tokens |
text-embedding-async-v1 | Chinese mainland | CNY 0.7 | 20 million tokens |
Singapore
Model id | Region | Input price |
text-embedding-v4 | international | CNY 0.514 |
text-embedding-v3 | international | CNY 0.514 |
Multimodal embedding
You are charged for input tokens. Output is free.
China (Beijing)
Model ID | Service region | Input price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio | |
Text | Image/video | |||
qwen3-vl-embedding | Chinese mainland | CNY 0.7 | CNY 1.8 | 1 million tokens |
qwen2.5-vl-embedding | Chinese mainland | 1 million tokens | ||
tongyi-embedding-vision-plus | Chinese mainland | CNY 0.5 | CNY 0.5 | 1 million tokens |
tongyi-embedding-vision-flash | Chinese mainland | CNY 0.15 | CNY 0.15 | 1 million tokens |
multimodal-embedding-v1 | Chinese mainland | CNY 0.7 | CNY 0.9 | 1 million tokens |
Text reranking
Text reranking models
You are billed for input tokens. There is no charge for output.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Price (per 1M tokens) | Free quota (Note) Valid for 90 days after activating Alibaba Cloud Model Studio |
qwen3-vl-rerank | Chinese mainland | Text input: CNY 0.7 Image input: CNY 1.8 | 1 million tokens |
qwen3-rerank | Chinese mainland | Text input: CNY 0.5 | 1 million tokens |
gte-rerank-v2 | Chinese mainland | Text input: CNY 0.8 | 1 million tokens |
Singapore
Model ID | Deployment scope | Price (per 1M tokens) |
qwen3-rerank | International | CNY 0.74942 |
Industry models
Tongyi Farui
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) |
farui-plus | Chinese mainland | CNY 20 | CNY 20 | No free quota |
Intent understanding
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
tongyi-intent-detect-v3 | Chinese mainland | CNY 0.4 | CNY 1 | 1 million tokens |
Role play
You are charged for input tokens and output tokens.
The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
qwen-plus-character Discount available with Session Cache | Chinese mainland | CNY 0.8 | CNY 2 | 1 million tokens |
qwen-flash-character Discount available with Session Cache | Chinese mainland | CNY 0.25 | CNY 1.5 | 1 million tokens |
qwen-flash-character-2026-02-26 Discount available with Session Cache | Chinese mainland | CNY 0.18 | CNY 1.5 | 1 million tokens |
Asia Pacific SE 1 (Singapore)
Model ID | Deployment scope | Input price | Output price |
qwen-plus-character Discount available with Session Cache | International | CNY 3.747 | CNY 10.492 |
qwen-flash-character Discount available with Session Cache | International | CNY 0.375 | CNY 2.998 |
qwen-plus-character-ja | International | CNY 3.67 | CNY 10.275 |
UI interaction
You are charged for input tokens and output tokens.
China (Beijing)
Model ID | Deployment scope | Input price | Output price | Free quota (Note) Valid for 90 days after you activate Alibaba Cloud Model Studio |
gui-plus | Chinese mainland | CNY 1.5 | CNY 4.5 | 1 million tokens |
gui-plus-2026-02-26 | Chinese mainland |
Error codes
If a model call fails and returns an error message, see Error codes.