Model inference pricing

更新时间:
复制 MD 格式

Tiered pricing

Some Model Studio models use tiered pricing. The unit price is determined by the total number of input tokens in a single request. All tokens in that request are billed at the corresponding tier's unit price.

For example, a model might have two pricing tiers: 0 < tokens ≤ 32K and 32K < tokens ≤ 128K. If a request contains 100K input tokens, it falls into the second tier (32K < 100K ≤ 128K), and all tokens are billed at that tier's unit price.

Text generation: Qwen

Qwen-Max

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens

Input price (per million tokens)

Output price (per million tokens)

Chain of thought and answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3.7-max

Alias for qwen3.7-max-2026-05-20
50% batch inference discount
context caching discount

Chinese mainland

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

1 million tokens

qwen3.7-max-2026-06-08

Chinese mainland

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

1 million tokens

qwen3.7-max-2026-05-20

Chinese mainland

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

1 million tokens

qwen3.7-max-preview

Alias for qwen3.7-max-2026-05-17

Chinese mainland

thinking mode only

0<token≤1M

CNY 12

CNY 36

1 million tokens

qwen3.7-max-2026-05-17

Chinese mainland

thinking mode only

0<token≤1M

CNY 12

CNY 36

1 million tokens

qwen3.6-max-preview

context caching discount

Chinese mainland

non-thinking and thinking modes

0<token≤128K

CNY 9

CNY 54

1 million tokens

128K<token≤256K

CNY 15

CNY 90

qwen3-max

Alias for qwen3-max-2026-01-23
50% batch inference discount
context caching discount

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 2.5

CNY 10

1 million tokens

32K<token≤128K

CNY 4

CNY 16

128K<token≤256K

CNY 7

CNY 28

qwen3-max-2026-01-23

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 2.5

CNY 10

1 million tokens

32K<token≤128K

CNY 4

CNY 16

128K<token≤256K

CNY 7

CNY 28

qwen3-max-2025-09-23

Chinese mainland

non-thinking mode only

0<token≤32K

CNY 6

CNY 24

1 million tokens

32K<token≤128K

CNY 10

CNY 40

128K<token≤256K

CNY 15

CNY 60

qwen3-max-preview

context caching discount

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 6

CNY 24

1 million tokens

32K<token≤128K

CNY 10

CNY 40

128K<token≤256K

CNY 15

CNY 60

More models

Model ID

Deployment scope

Mode

Input tokens

Input price (per million tokens)

Output price (per million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-max

50% batch inference discount

Chinese mainland

non-thinking mode only

No tiered pricing

CNY 2.4

CNY 9.6

1 million tokens

US (Virginia)

Model id

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

qwen3.7-max

Equivalent to qwen3.7-max-2026-05-20
Eligible for context caching discounts

Global

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

qwen3.7-max-2026-06-08

Global

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

qwen3.7-max-2026-05-20

Global

non-thinking and thinking modes

0<token≤1M

CNY 12

CNY 36

qwen3-max

Equivalent to qwen3-max-2026-01-23
Eligible for context caching discounts

Global

non-thinking mode only

0<token≤32K

CNY 2.5

CNY 10

32K<token≤128K

CNY 4

CNY 16

128K<token≤256K

CNY 7

CNY 28

qwen3-max-2025-09-23

Global

non-thinking mode only

0<token≤32K

CNY 6

CNY 24

32K<token≤128K

CNY 10

CNY 40

128K<token≤256K

CNY 15

CNY 60

qwen3-max-preview

Eligible for context caching discounts

Global

non-thinking and thinking modes

0<token≤32K

CNY 6

CNY 24

32K<token≤128K

CNY 10

CNY 40

128K<token≤256K

CNY 15

CNY 60

Singapore

Model id

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

qwen3.7-max

Currently maps to qwen3.7-max-2026-05-20
Discount for context caching

International

non-thinking and thinking modes

0 < tokens ≤ 1M

CNY 18.736

CNY 56.207

qwen3.7-max-2026-06-08

International

non-thinking and thinking modes

0 < tokens ≤ 1M

CNY 18.736

CNY 56.207

qwen3.7-max-2026-05-20

International

non-thinking and thinking modes

0 < tokens ≤ 1M

CNY 18.736

CNY 56.207

qwen3.6-max-preview

Discount for context caching

International

non-thinking and thinking modes

0 < tokens ≤ 128K

CNY 9.742

CNY 58.455

128K < tokens ≤ 256K

CNY 14.988

CNY 89.93

qwen3-max

Currently maps to qwen3-max-2026-01-23
Discount for context caching

International

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 8.807

CNY 44.035

32K < tokens ≤ 128K

CNY 17.614

CNY 88.071

128K < tokens ≤ 256K

CNY 22.018

CNY 110.089

qwen3-max-2026-01-23

International

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 8.807

CNY 44.035

32K < tokens ≤ 128K

CNY 17.614

CNY 88.071

128K < tokens ≤ 256K

CNY 22.018

CNY 110.089

qwen3-max-2025-09-23

International

non-thinking mode only

0 < tokens ≤ 32K

CNY 8.807

CNY 44.035

32K < tokens ≤ 128K

CNY 17.614

CNY 88.071

128K < tokens ≤ 256K

CNY 22.018

CNY 110.089

qwen3-max-preview

Discount for context caching

International

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 8.807

CNY 44.035

32K < tokens ≤ 128K

CNY 17.614

CNY 88.071

128K < tokens ≤ 256K

CNY 22.018

CNY 110.089

More models

Model id

Deployment scope

Mode

Input tokens

Input price

Output price

qwen-max

50% discount for batch inference

International

non-thinking mode only

no tiered pricing

CNY 11.743

CNY 46.971

Germany (Frankfurt)

Model id

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

qwen3.7-max

Equivalent to qwen3.7-max-2026-05-20
context caching qualifies for a discount

global

Non-thinking and thinking modes

0<tokens≤1M

CNY 12

CNY 36

qwen3.7-max-2026-06-08

global

Non-thinking and thinking modes

0<tokens≤1M

CNY 12

CNY 36

qwen3.7-max-2026-05-20

global

Non-thinking and thinking modes

0<tokens≤1M

CNY 12

CNY 36

qwen3-max

Equivalent to qwen3-max-2026-01-23
context caching qualifies for a discount

global

Non-thinking mode only

0<tokens≤32K

CNY 2.5

CNY 10

32K<tokens≤128K

CNY 4

CNY 16

128K<tokens≤256K

CNY 7

CNY 28

qwen3-max

Equivalent to qwen3-max-2026-01-23

EU

Non-thinking and thinking modes

0<tokens≤32K

CNY 8.993

CNY 44.965

32K<tokens≤128K

CNY 17.986

CNY 89.93

128K<tokens≤256K

CNY 22.483

CNY 112.413

qwen3-max-2026-01-23

EU

Non-thinking and thinking modes

0<tokens≤32K

CNY 8.993

CNY 44.965

32K<tokens≤128K

CNY 17.986

CNY 89.93

128K<tokens≤256K

CNY 22.483

CNY 112.413

qwen3-max-2025-09-23

global

Non-thinking mode only

0<tokens≤32K

CNY 6

CNY 24

32K<tokens≤128K

CNY 10

CNY 40

128K<tokens≤256K

CNY 15

CNY 60

qwen3-max-preview

context caching qualifies for a discount

global

Non-thinking and thinking modes

0<tokens≤32K

CNY 6

CNY 24

32K<tokens≤128K

CNY 10

CNY 40

128K<tokens≤256K

CNY 15

CNY 60

Qwen-Plus

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Input tokens

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-thinking mode

Thinking mode

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
50% batch inference discount
context caching discount

Chinese mainland

0<token≤256K

CNY 2

CNY 8

CNY 8

1 million tokens

256K<token≤1M

CNY 6

CNY 24

CNY 24

qwen3.7-plus-2026-05-26

context caching discount

Chinese mainland

0<token≤256K

CNY 2

CNY 8

CNY 8

1 million tokens

256K<token≤1M

CNY 6

CNY 24

CNY 24

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Chinese mainland

0<token≤256K

CNY 2

CNY 12

CNY 12

1 million tokens

256K<token≤1M

CNY 8

CNY 48

CNY 48

qwen3.6-plus-2026-04-02

Chinese mainland

0<token≤256K

CNY 2

CNY 12

CNY 12

1 million tokens

256K<token≤1M

CNY 8

CNY 48

CNY 48

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

Chinese mainland

0<token≤128K

CNY 0.8

CNY 4.8

CNY 4.8

1 million tokens

128K<token≤256K

CNY 2

CNY 12

CNY 12

256K<token≤1M

CNY 4

CNY 24

CNY 24

qwen3.5-plus-2026-04-20

Chinese mainland

0<token≤128K

CNY 0.8

CNY 4.8

CNY 4.8

1 million tokens

128K<token≤256K

CNY 2

CNY 12

CNY 12

256K<token≤1M

CNY 4

CNY 24

CNY 24

qwen3.5-plus-2026-02-15

Chinese mainland

0<token≤128K

CNY 0.8

CNY 4.8

CNY 4.8

1 million tokens

128K<token≤256K

CNY 2

CNY 12

CNY 12

256K<token≤1M

CNY 4

CNY 24

CNY 24

qwen-plus

Currently equivalent to qwen-plus-2025-12-01
50% batch inference discount

Chinese mainland

0<token≤128K

CNY 0.8

CNY 2

CNY 8

1 million tokens

128K<token≤256K

CNY 2.4

CNY 20

CNY 24

256K<token≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-latest

50% batch inference discount

Chinese mainland

0<token≤128K

CNY 0.8

CNY 2

CNY 8

1 million tokens

128K<token≤256K

CNY 2.4

CNY 20

CNY 24

256K<token≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-12-01

Chinese mainland

0<token≤128K

CNY 0.8

CNY 2

CNY 8

1 million tokens

128K<token≤256K

CNY 2.4

CNY 20

CNY 24

256K<token≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-09-11

Chinese mainland

0<token≤128K

CNY 0.8

CNY 2

CNY 8

1 million tokens

128K<token≤256K

CNY 2.4

CNY 20

CNY 24

256K<token≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-07-28

Chinese mainland

0<token≤128K

CNY 0.8

CNY 2

CNY 8

1 million tokens

128K<token≤256K

CNY 2.4

CNY 20

CNY 24

256K<token≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-07-14

Chinese mainland

No tiered pricing

CNY 0.8

CNY 2

CNY 8

1 million tokens

qwen-plus-2025-04-28

Chinese mainland

No tiered pricing

CNY 0.8

CNY 2

CNY 8

1 million tokens

More models

Model id

Deployment scope

Input tokens

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-plus-2025-01-25

Chinese mainland

No tiered pricing

CNY 0.8

CNY 2

1 million tokens

qwen-plus-2025-01-12

Chinese mainland

No tiered pricing

CNY 0.8

CNY 2

1 million tokens

qwen-plus-2024-12-20

Chinese mainland

No tiered pricing

CNY 0.8

CNY 2

1 million tokens

US (Virginia)

Model ID

Availability

Input token range

Input price

Output price

Non-thinking mode

Thinking mode

qwen3.7-plus

Equivalent to qwen3.7-plus-2026-05-26
context caching is eligible for discounts.

Global

0<tokens≤256K

CNY 2

CNY 8

CNY 8

256K<tokens≤1M

CNY 6

CNY 24

CNY 24

qwen3.7-plus-2026-05-26

context caching is eligible for discounts.

Global

0<tokens≤256K

CNY 2

CNY 8

CNY 8

256K<tokens≤1M

CNY 6

CNY 24

CNY 24

qwen3.6-plus

Equivalent to qwen3.6-plus-2026-04-02

Global

0<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 8

CNY 48

CNY 48

qwen3.6-plus-2026-04-02

Global

0<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 8

CNY 48

CNY 48

qwen3.5-plus

Equivalent to qwen3.5-plus-2026-02-15

Global

0<tokens≤128K

CNY 0.8

CNY 4.8

CNY 4.8

128K<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 4

CNY 24

CNY 24

qwen3.5-plus-2026-02-15

Global

0<tokens≤128K

CNY 0.8

CNY 4.8

CNY 4.8

128K<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 4

CNY 24

CNY 24

qwen-plus

Equivalent to qwen-plus-2025-12-01

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-us

context caching is eligible for discounts.

US

0<tokens≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<tokens≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-12-01

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-12-01-us

US

0<tokens≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<tokens≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-09-11

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-07-28

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

Singapore

Model id

Deployment scope

Input token range

Input price (per 1M tokens)

Output price (per 1M tokens)

Non-thinking mode

Thinking mode

qwen3.7-plus

Alias for qwen3.7-plus-2026-05-26
context caching discount applies

International

0<Token≤256K

CNY 2.998

CNY 11.991

CNY 11.991

256K<Token≤1M

CNY 8.993

CNY 35.972

CNY 35.972

qwen3.7-plus-2026-05-26

context caching discount applies

International

0<Token≤256K

CNY 2.998

CNY 11.991

CNY 11.991

256K<Token≤1M

CNY 8.993

CNY 35.972

CNY 35.972

qwen3.6-plus

Alias for qwen3.6-plus-2026-04-02

International

0<Token≤256K

CNY 3.7471

CNY 22.4826

CNY 22.4826

256K<Token≤1M

CNY 14.9884

CNY 44.965

CNY 44.965

qwen3.6-plus-2026-04-02

International

0<Token≤256K

CNY 3.7471

CNY 22.4826

CNY 22.4826

256K<Token≤1M

CNY 14.9884

CNY 44.965

CNY 44.965

qwen3.5-plus

Alias for qwen3.5-plus-2026-02-15

International

0<Token≤256K

CNY 2.936

CNY 17.614

CNY 17.614

256K<Token≤1M

CNY 3.67

CNY 22.018

CNY 22.018

qwen3.5-plus-2026-04-20

International

0<Token≤256K

CNY 2.936

CNY 17.614

CNY 17.614

256K<Token≤1M

CNY 3.67

CNY 22.018

CNY 22.018

qwen3.5-plus-2026-02-15

International

0<Token≤256K

CNY 2.936

CNY 17.614

CNY 17.614

256K<Token≤1M

CNY 3.67

CNY 22.018

CNY 22.018

qwen-plus

Alias for qwen-plus-2025-12-01

International

0<Token≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<Token≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-latest

International

0<Token≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<Token≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-12-01

International

0<Token≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<Token≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-09-11

International

0<Token≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<Token≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-07-28

International

0<Token≤256K

CNY 2.936

CNY 8.807

CNY 29.357

256K<Token≤1M

CNY 8.807

CNY 26.421

CNY 88.071

qwen-plus-2025-07-14

International

No tiered pricing

CNY 2.936

CNY 8.807

CNY 29.357

qwen-plus-2025-04-28

International

No tiered pricing

CNY 2.936

CNY 8.807

CNY 29.357

More models

Model id

Deployment scope

Input token range

Input price (per 1M tokens)

Output price (per 1M tokens)

qwen-plus-2025-01-25

International

No tiered pricing

CNY 2.936

CNY 8.807

Germany (Frankfurt)

Model ID

Deployment scope

Input token range

Input price

Output price

Non-thinking mode

Thinking mode

qwen3.7-plus

Currently equivalent to qwen3.7-plus-2026-05-26
context caching discounts apply

Global

0<tokens≤256K

CNY 2

CNY 8

CNY 8

256K<tokens≤1M

CNY 6

CNY 24

CNY 24

qwen3.7-plus-2026-05-26

context caching discounts apply

Global

0<tokens≤256K

CNY 2

CNY 8

CNY 8

256K<tokens≤1M

CNY 6

CNY 24

CNY 24

qwen3.6-plus

Currently equivalent to qwen3.6-plus-2026-04-02

Global

0<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 8

CNY 48

CNY 48

qwen3.6-plus-2026-04-02

Global

0<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 8

CNY 48

CNY 48

qwen3.5-plus

Currently equivalent to qwen3.5-plus-2026-02-15

Global

0<tokens≤128K

CNY 0.8

CNY 4.8

CNY 4.8

128K<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 4

CNY 24

CNY 24

qwen3.5-plus-2026-02-15

Global

0<tokens≤128K

CNY 0.8

CNY 4.8

CNY 4.8

128K<tokens≤256K

CNY 2

CNY 12

CNY 12

256K<tokens≤1M

CNY 4

CNY 24

CNY 24

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus

Currently equivalent to qwen-plus-2025-12-01

EU

0<tokens≤256K

CNY 2.998

CNY 8.993

CNY 29.977

256K<tokens≤1M

CNY 8.993

CNY 26.979

CNY 89.93

qwen-plus-2025-12-01

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-12-01

EU

0<tokens≤256K

CNY 2.998

CNY 8.993

CNY 29.977

256K<tokens≤1M

CNY 8.993

CNY 26.979

CNY 89.93

qwen-plus-2025-09-11

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

qwen-plus-2025-07-28

Global

0<tokens≤128K

CNY 0.8

CNY 2

CNY 8

128K<tokens≤256K

CNY 2.4

CNY 20

CNY 24

256K<tokens≤1M

CNY 4.8

CNY 48

CNY 64

Qwen-Flash

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price. If the model supports context cache, only input tokens receive a discount. These two discounts cannot apply simultaneously.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Region

Mode

Input token range

Input price

Output price

Chain of thought and answer

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3.6-flash

Currently equivalent to qwen3.6-flash-2026-04-16
50% discount for batch inference
Discounts apply to context caching

Chinese mainland

non-thinking and thinking modes

0<Token≤256K

CNY 1.2

CNY 7.2

1 million tokens

256K<Token≤1M

CNY 4.8

CNY 28.8

qwen3.6-flash-2026-04-16

Chinese mainland

non-thinking and thinking modes

0<Token≤256K

CNY 1.2

CNY 7.2

1 million tokens

256K<Token≤1M

CNY 4.8

CNY 28.8

qwen3.5-flash

Currently equivalent to qwen3.5-flash-2026-02-23
50% discount for batch inference
Discounts apply to context caching

Chinese mainland

non-thinking and thinking modes

0<Token≤128K

CNY 0.2

CNY 2

1 million tokens

128K<Token≤256K

CNY 0.8

CNY 8

256K<Token≤1M

CNY 1.2

CNY 12

qwen3.5-flash-2026-02-23

Chinese mainland

non-thinking and thinking modes

0<Token≤128K

CNY 0.2

CNY 2

1 million tokens

128K<Token≤256K

CNY 0.8

CNY 8

256K<Token≤1M

CNY 1.2

CNY 12

qwen-flash

Currently equivalent to qwen-flash-2025-07-28
50% discount for batch inference
Discounts apply to context caching

Chinese mainland

non-thinking and thinking modes

0<Token≤128K

CNY 0.15

CNY 1.5

1 million tokens

128K<Token≤256K

CNY 0.6

CNY 6

256K<Token≤1M

CNY 1.2

CNY 12

qwen-flash-2025-07-28

Chinese mainland

non-thinking and thinking modes

0<Token≤128K

CNY 0.15

CNY 1.5

1 million tokens

128K<Token≤256K

CNY 0.6

CNY 6

256K<Token≤1M

CNY 1.2

CNY 12

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought + answer

qwen3.6-flash

Currently points to qwen3.6-flash-2026-04-16
Discount for context caching

global

non-thinking and thinking modes

0<token≤256K

CNY 1.2

CNY 7.2

256K<token≤1M

CNY 4.8

CNY 28.8

qwen3.6-flash-2026-04-16

global

non-thinking and thinking modes

0<token≤256K

CNY 1.2

CNY 7.2

256K<token≤1M

CNY 4.8

CNY 28.8

qwen3.5-flash

Currently points to qwen3.5-flash-2026-02-23
Discount for context caching

global

non-thinking and thinking modes

0<token≤128K

CNY 0.2

CNY 2

128K<token≤256K

CNY 0.8

CNY 8

256K<token≤1M

CNY 1.2

CNY 12

qwen3.5-flash-2026-02-23

global

non-thinking and thinking modes

0<token≤128K

CNY 0.2

CNY 2

128K<token≤256K

CNY 0.8

CNY 8

256K<token≤1M

CNY 1.2

CNY 12

qwen-flash

Currently points to qwen-flash-2025-07-28
50% discount for batch inference
Discount for context caching

global

non-thinking and thinking modes

0<token≤128K

CNY 0.15

CNY 1.5

128K<token≤256K

CNY 0.6

CNY 6

256K<token≤1M

CNY 1.2

CNY 12

qwen-flash-us

Discount for context caching

US

0<token≤256K

CNY 0.367

CNY 2.936

256K<token≤1M

CNY 1.835

CNY 14.678

qwen-flash-2025-07-28

global

non-thinking and thinking modes

0<token≤128K

CNY 0.15

CNY 1.5

128K<token≤256K

CNY 0.6

CNY 6

256K<token≤1M

CNY 1.2

CNY 12

qwen-flash-2025-07-28-us

US

0<token≤256K

CNY 0.367

CNY 2.936

256K<token≤1M

CNY 1.835

CNY 14.678

Singapore

Model ID

Deployment scope

Mode

Input token range

Input price

Output price

Chain of thought and answer

qwen3.6-flash

Equivalent to qwen3.6-flash-2026-04-16
context caching discounts apply

international

non-thinking and thinking modes

0<tokens≤256K

CNY 1.87355

CNY 11.2413

256K<tokens≤1M

CNY 7.4942

CNY 29.9758

qwen3.6-flash-2026-04-16

international

non-thinking and thinking modes

0<tokens≤256K

CNY 1.87355

CNY 11.2413

256K<tokens≤1M

CNY 7.4942

CNY 29.9758

qwen3.5-flash

Equivalent to qwen3.5-flash-2026-02-23
batch inference 50% discount
context caching discounts apply

international

non-thinking and thinking modes

0<tokens≤1M

CNY 0.734

CNY 2.936

qwen3.5-flash-2026-02-23

international

non-thinking and thinking modes

0<tokens≤1M

CNY 0.734

CNY 2.936

qwen-flash

Equivalent to qwen-flash-2025-07-28
batch inference 50% discount
context caching discounts apply

international

non-thinking and thinking modes

0<tokens≤256K

CNY 0.367

CNY 2.936

256K<tokens≤1M

CNY 1.835

CNY 14.678

qwen-flash-2025-07-28

international

non-thinking and thinking modes

0<tokens≤256K

CNY 0.367

CNY 2.936

256K<tokens≤1M

CNY 1.835

CNY 14.678

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

qwen3.6-flash

Alias for qwen3.6-flash-2026-04-16
Discounts for context caching

Global

non-thinking and thinking modes

0<Token≤256K

CNY 1.2

CNY 7.2

256K<Token≤1M

CNY 4.8

CNY 28.8

qwen3.6-flash-2026-04-16

Global

non-thinking and thinking modes

0<Token≤256K

CNY 1.2

CNY 7.2

256K<Token≤1M

CNY 4.8

CNY 28.8

qwen3.5-flash

Alias for qwen3.5-flash-2026-02-23
Discounts for context caching

Global

non-thinking and thinking modes

0<Token≤128K

CNY 0.2

CNY 2

128K<Token≤256K

CNY 0.8

CNY 8

256K<Token≤1M

CNY 1.2

CNY 12

qwen3.5-flash

Alias for qwen3.5-flash-2026-02-23

EU

non-thinking and thinking modes

CNY 0.749

CNY 2.998

qwen3.5-flash-2026-02-23

Global

non-thinking and thinking modes

0<Token≤128K

CNY 0.2

CNY 2

128K<Token≤256K

CNY 0.8

CNY 8

256K<Token≤1M

CNY 1.2

CNY 12

qwen3.5-flash-2026-02-23

EU

non-thinking and thinking modes

CNY 0.749

CNY 2.998

qwen-flash

Alias for qwen-flash-2025-07-28
50% discount for batch inference
Discounts for context caching

Global

non-thinking and thinking modes

0<Token≤128K

CNY 0.15

CNY 1.5

128K<Token≤256K

CNY 0.6

CNY 6

256K<Token≤1M

CNY 1.2

CNY 12

qwen-flash-2025-07-28

Global

non-thinking and thinking modes

0<Token≤128K

CNY 0.15

CNY 1.5

128K<Token≤256K

CNY 0.6

CNY 6

256K<Token≤1M

CNY 1.2

CNY 12

Qwen-Turbo

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen-turbo

50% discount for batch inference

Chinese mainland

Non-thinking and thinking

CNY 0.3

CNY 0.6

CNY 3

1 million tokens

Singapore

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen-turbo

50% discount for batch inference

International

Non-thinking and thinking

CNY 0.367

CNY 1.468

CNY 3.67

QwQ

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input price

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwq-plus

50% discount for batch inference

Chinese mainland

Thinking mode only

CNY 1.6

CNY 4

1 million tokens

Singapore

Model ID

Deployment scope

Mode

Input price

Output price

qwq-plus

International

Thinking mode only

CNY 5.871

CNY 17.614

Qwen-Long

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

China (Beijing)

Model ID

Region

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-long

50% discount for batch inference

Chinese mainland

CNY 0.5

CNY 2

1 million tokens

qwen-long-latest

Chinese mainland

CNY 0.5

CNY 2

1 million tokens

qwen-long-2025-01-25

Chinese mainland

CNY 0.5

CNY 2

1 million tokens

Qwen-Omni

Billing rules: You are billed for input and output tokens. For details on how tokens are calculated for different modalities, see Billing and rate limits.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text/image/video

Audio

Text

Multimodal input

Text and audio

Only audio is billed

qwen3.5-omni-plus

Currently equivalent to qwen3.5-omni-plus-2026-03-15

chinese mainland

CNY 7

CNY 53

CNY 40

CNY 213

1 million tokens

qwen3.5-omni-plus-2026-03-15

chinese mainland

CNY 7

CNY 53

CNY 40

CNY 213

1 million tokens

qwen3.5-omni-flash

Currently equivalent to qwen3.5-omni-flash-2026-03-15

chinese mainland

CNY 2.2

CNY 18

CNY 13.3

CNY 72

1 million tokens

qwen3.5-omni-flash-2026-03-15

chinese mainland

CNY 2.2

CNY 18

CNY 13.3

CNY 72

1 million tokens

More models

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text and audio

Only audio is billed

qwen3-omni-flash

Currently equivalent to qwen3-omni-flash-2025-12-01

chinese mainland

non-thinking and thinking modes

CNY 1.8

CNY 15.8

CNY 3.3

CNY 6.9

CNY 12.7

CNY 62.6

1 million tokens

qwen3-omni-flash-2025-12-01

chinese mainland

non-thinking and thinking modes

CNY 1.8

CNY 15.8

CNY 3.3

CNY 6.9

CNY 12.7

CNY 62.6

1 million tokens

qwen3-omni-flash-2025-09-15

chinese mainland

non-thinking and thinking modes

CNY 1.8

CNY 15.8

CNY 3.3

CNY 6.9

CNY 12.7

CNY 62.6

1 million tokens

qwen-omni-turbo

Currently equivalent to qwen-omni-turbo-2025-03-26

chinese mainland

non-thinking mode

CNY 0.4

CNY 25

CNY 1.5

CNY 1.6

CNY 4.5

CNY 50

1 million tokens

qwen-omni-turbo-latest

chinese mainland

non-thinking mode

CNY 0.4

CNY 25

CNY 1.5

CNY 1.6

CNY 4.5

CNY 50

1 million tokens

qwen-omni-turbo-2025-03-26

chinese mainland

non-thinking mode

CNY 0.4

CNY 25

CNY 1.5

CNY 1.6

CNY 4.5

CNY 50

1 million tokens

qwen-omni-turbo-2025-01-19

chinese mainland

non-thinking mode

CNY 0.4

CNY 25

CNY 1.5

CNY 1.6

CNY 4.5

CNY 50

1 million tokens

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text/image/video

Audio

Text

Multimodal input

Text and audio

Only audio is billed

qwen3.5-omni-plus

Currently equivalent to qwen3.5-omni-plus-2026-03-15

international

CNY 10.49

CNY 82.44

CNY 62.2

CNY 329.74

qwen3.5-omni-plus-2026-03-15

international

CNY 10.49

CNY 82.44

CNY 62.2

CNY 329.74

qwen3.5-omni-flash

Currently equivalent to qwen3.5-omni-flash-2026-03-15

international

CNY 3

CNY 22.48

CNY 16.49

CNY 89.18

qwen3.5-omni-flash-2026-03-15

international

CNY 3

CNY 22.48

CNY 16.49

CNY 89.18

More models

Model ID

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text and audio

Only audio is billed

qwen3-omni-flash

Currently equivalent to qwen3-omni-flash-2025-12-01

international

non-thinking and thinking modes

CNY 3.156

CNY 27.962

CNY 5.725

CNY 12.183

CNY 22.458

CNY 110.896

qwen3-omni-flash-2025-12-01

international

non-thinking and thinking modes

CNY 3.156

CNY 27.962

CNY 5.725

CNY 12.183

CNY 22.458

CNY 110.896

qwen3-omni-flash-2025-09-15

international

non-thinking and thinking modes

CNY 3.156

CNY 27.962

CNY 5.725

CNY 12.183

CNY 22.458

CNY 110.896

qwen-omni-turbo

Currently equivalent to qwen-omni-turbo-2025-03-26

international

non-thinking mode

CNY 0.514

CNY 32.586

CNY 1.541

CNY 1.982

CNY 4.624

CNY 65.246

qwen-omni-turbo-latest

international

non-thinking mode

CNY 0.514

CNY 32.586

CNY 1.541

CNY 1.982

CNY 4.624

CNY 65.246

qwen-omni-turbo-2025-03-26

international

non-thinking mode

CNY 0.514

CNY 32.586

CNY 1.541

CNY 1.982

CNY 4.624

CNY 65.246

Qwen-Omni-Realtime

Billing rules: Billing is based on input and output tokens. For token calculation rules for different modalities, see billing and rate limits.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text and image

Audio

Text

multimodal input

Text and audio

billed for audio only

qwen3.5-omni-plus-realtime

Chinese mainland

CNY 10

CNY 80

CNY 60

CNY 300

1 million tokens

qwen3.5-omni-plus-realtime-2026-03-15

Chinese mainland

CNY 10

CNY 80

CNY 60

CNY 300

1 million tokens

qwen3.5-omni-flash-realtime

Chinese mainland

CNY 3.3

CNY 27

CNY 20

CNY 107

1 million tokens

qwen3.5-omni-flash-realtime-2026-03-15

Chinese mainland

CNY 3.3

CNY 27

CNY 20

CNY 107

1 million tokens

More models

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image

Text

text-only input

Text

multimodal input

Text and audio

billed for audio only

qwen3-omni-flash-realtime

Chinese mainland

CNY 2.2

CNY 18.9

CNY 3.9

CNY 8.3

CNY 15.2

CNY 75.1

1 million tokens

qwen3-omni-flash-realtime-2025-12-01

Chinese mainland

CNY 2.2

CNY 18.9

CNY 3.9

CNY 8.3

CNY 15.2

CNY 75.1

1 million tokens

qwen3-omni-flash-realtime-2025-09-15

Chinese mainland

CNY 2.2

CNY 18.9

CNY 3.9

CNY 8.3

CNY 15.2

CNY 75.1

1 million tokens

qwen-omni-turbo-realtime

Chinese mainland

CNY 1.6

CNY 25

CNY 6

CNY 6.4

CNY 18

CNY 50

1 million tokens

qwen-omni-turbo-realtime-latest

Chinese mainland

CNY 1.6

CNY 25

CNY 6

CNY 6.4

CNY 18

CNY 50

1 million tokens

qwen-omni-turbo-realtime-2025-05-08

Chinese mainland

CNY 1.6

CNY 25

CNY 6

CNY 6.4

CNY 18

CNY 50

1 million tokens

Singapore

Model ID

Deployment scope

Input price

Output price

Text and image

Audio

Text

multimodal input

Text and audio

billed for audio only

qwen3.5-omni-plus-realtime

International

CNY 15.74

CNY 123.65

CNY 92.93

CNY 464.64

qwen3.5-omni-plus-realtime-2026-03-15

International

CNY 15.74

CNY 123.65

CNY 92.93

CNY 464.64

qwen3.5-omni-flash-realtime

International

CNY 4.12

CNY 33.72

CNY 24.73

CNY 132.65

qwen3.5-omni-flash-realtime-2026-03-15

International

CNY 4.12

CNY 33.72

CNY 24.73

CNY 132.65

More models

Model ID

Deployment scope

Input price

Output price

Text

Audio

Image

Text

text-only input

Text

multimodal input

Text and audio

billed for audio only

qwen3-omni-flash-realtime

International

CNY 3.816

CNY 33.54

CNY 6.899

CNY 14.605

CNY 26.935

CNY 133.06

qwen3-omni-flash-realtime-2025-12-01

International

CNY 3.816

CNY 33.54

CNY 6.899

CNY 14.605

CNY 26.935

CNY 133.06

qwen3-omni-flash-realtime-2025-09-15

International

CNY 3.816

CNY 33.54

CNY 6.899

CNY 14.605

CNY 26.935

CNY 133.06

qwen-omni-turbo-realtime

International

CNY 1.982

CNY 32.586

CNY 6.165

CNY 7.853

CNY 18.495

CNY 65.246

qwen-omni-turbo-realtime-latest

International

CNY 1.982

CNY 32.586

CNY 6.165

CNY 7.853

CNY 18.495

CNY 65.246

qwen-omni-turbo-realtime-2025-05-08

International

CNY 1.982

CNY 32.586

CNY 6.165

CNY 7.853

CNY 18.495

CNY 65.246

QVQ

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qvq-max

Chinese mainland

CNY 8

CNY 32

1 million tokens

qvq-plus

Chinese mainland

CNY 2

CNY 5

1 million tokens

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qvq-max

International

CNY 8.807

CNY 35.228

Qwen-VL

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens per request

Input price (per 1M tokens)

Output price (per 1M tokens)

CoT and answer

Free quota(Note)

Valid for 90 days after activating Model Studio

qwen3-vl-plus

Equivalent to qwen3-vl-plus-2025-12-19
50% discount on batch inference
Discounts apply to context caching

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 1

CNY 10

1M tokens

32K<tokens≤128K

CNY 1.5

CNY 15

128K<tokens≤256K

CNY 3

CNY 30

qwen3-vl-plus-2025-12-19

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 1

CNY 10

1M tokens

32K<tokens≤128K

CNY 1.5

CNY 15

128K<tokens≤256K

CNY 3

CNY 30

qwen3-vl-plus-2025-09-23

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 1

CNY 10

1M tokens

32K<tokens≤128K

CNY 1.5

CNY 15

128K<tokens≤256K

CNY 3

CNY 30

qwen3-vl-flash

Equivalent to qwen3-vl-flash-2026-01-22
50% discount on batch inference
Discounts apply to context caching

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 0.15

CNY 1.5

1M tokens

32K<tokens≤128K

CNY 0.3

CNY 3

128K<tokens≤256K

CNY 0.6

CNY 6

qwen3-vl-flash-2026-01-22

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 0.15

CNY 1.5

1M tokens

32K<tokens≤128K

CNY 0.3

CNY 3

128K<tokens≤256K

CNY 0.6

CNY 6

qwen3-vl-flash-2025-10-15

Chinese mainland

non-thinking & thinking modes

0<tokens≤32K

CNY 0.15

CNY 1.5

1M tokens

32K<tokens≤128K

CNY 0.3

CNY 3

128K<tokens≤256K

CNY 0.6

CNY 6

More models

Model ID

Deployment scope

Input tokens per request

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota(Note)

Valid for 90 days after activating Model Studio

qwen-vl-max

50% discount on batch inference
Discounts apply to context caching

Chinese mainland

no tiered pricing

CNY 1.6

CNY 4

1M tokens

qwen-vl-plus

50% discount on batch inference
Discounts apply to context caching

Chinese mainland

no tiered pricing

CNY 0.8

CNY 2

1M tokens

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2025-10-15
Context caching discount

Global

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 0.15

CNY 1.5

32K < tokens ≤ 128K

CNY 0.3

CNY 3

128K < tokens ≤ 256K

CNY 0.6

CNY 6

qwen3-vl-flash-us

Context caching discount

US

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 0.367

CNY 2.936

32K < tokens ≤ 128K

CNY 0.55

CNY 4.404

128K < tokens ≤ 256K

CNY 0.881

CNY 7.046

qwen3-vl-flash-2026-01-22-us

US

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 0.367

CNY 2.936

32K < tokens ≤ 128K

CNY 0.55

CNY 4.404

128K < tokens ≤ 256K

CNY 0.881

CNY 7.046

qwen3-vl-flash-2025-10-15

Global

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 0.15

CNY 1.5

32K < tokens ≤ 128K

CNY 0.3

CNY 3

128K < tokens ≤ 256K

CNY 0.6

CNY 6

qwen3-vl-flash-2025-10-15-us

US

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 0.367

CNY 2.936

32K < tokens ≤ 128K

CNY 0.55

CNY 4.404

128K < tokens ≤ 256K

CNY 0.881

CNY 7.046

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-09-23
Context caching discount

Global

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 1

CNY 10

32K < tokens ≤ 128K

CNY 1.5

CNY 15

128K < tokens ≤ 256K

CNY 3

CNY 30

qwen3-vl-plus-2025-09-23

Global

Non-Thinking and Thinking modes

0 < tokens ≤ 32K

CNY 1

CNY 10

32K < tokens ≤ 128K

CNY 1.5

CNY 15

128K < tokens ≤ 256K

CNY 3

CNY 30

Singapore

Model ID

Deployment scope

Mode

Input tokens

Input price (per million tokens)

Output price (per million tokens)

qwen3-vl-plus

Currently equivalent to qwen3-vl-plus-2025-12-19
Context caching is eligible for discounts

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 1.468

CNY 11.743

32K<Token≤128K

CNY 2.202

CNY 17.614

128K<Token≤256K

CNY 4.404

CNY 35.228

qwen3-vl-plus-2025-12-19

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 1.468

CNY 11.743

32K<Token≤128K

CNY 2.202

CNY 17.614

128K<Token≤256K

CNY 4.404

CNY 35.228

qwen3-vl-plus-2025-09-23

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 1.468

CNY 11.743

32K<Token≤128K

CNY 2.202

CNY 17.614

128K<Token≤256K

CNY 4.404

CNY 35.228

qwen3-vl-flash

Currently equivalent to qwen3-vl-flash-2026-01-22
Context caching discounts apply

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 0.367

CNY 2.936

32K<Token≤128K

CNY 0.55

CNY 4.404

128K<Token≤256K

CNY 0.881

CNY 7.046

qwen3-vl-flash-2026-01-22

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 0.367

CNY 2.936

32K<Token≤128K

CNY 0.55

CNY 4.404

128K<Token≤256K

CNY 0.881

CNY 7.046

qwen3-vl-flash-2025-10-15

International

Non-Thinking and Thinking modes

0<Token≤32K

CNY 0.367

CNY 2.936

32K<Token≤128K

CNY 0.55

CNY 4.404

128K<Token≤256K

CNY 0.881

CNY 7.046

More models

Model ID

Deployment scope

Input tokens

Input price (per million tokens)

Output price (per million tokens)

qwen-vl-max

Context caching discounts apply

International

No tiered pricing

CNY 5.871

CNY 23.486

qwen-vl-plus

Context caching discounts apply

International

No tiered pricing

CNY 1.541

CNY 4.624

Germany (Frankfurt)

Model ID

Service region

Mode

Input tokens

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Chain of thought and answer

qwen3-vl-flash

Equivalent to qwen3-vl-flash-2025-10-15
context cache (Discount available)

Global

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 0.15

CNY 1.5

32K < tokens ≤ 128K

CNY 0.3

CNY 3

128K < tokens ≤ 256K

CNY 0.6

CNY 6

qwen3-vl-flash

Equivalent to qwen3-vl-flash-2026-01-22

EU

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 0.375

CNY 2.998

32K < tokens ≤ 128K

CNY 0.562

CNY 4.497

128K < tokens ≤ 256K

CNY 0.899

CNY 7.194

qwen3-vl-flash-2026-01-22

EU

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 0.375

CNY 2.998

32K < tokens ≤ 128K

CNY 0.562

CNY 4.497

128K < tokens ≤ 256K

CNY 0.899

CNY 7.194

qwen3-vl-flash-2025-10-15

Global

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 0.15

CNY 1.5

32K < tokens ≤ 128K

CNY 0.3

CNY 3

128K < tokens ≤ 256K

CNY 0.6

CNY 6

qwen3-vl-flash-2025-10-15

EU

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 0.375

CNY 2.998

32K < tokens ≤ 128K

CNY 0.562

CNY 4.497

128K < tokens ≤ 256K

CNY 0.899

CNY 7.194

qwen3-vl-plus

Equivalent to qwen3-vl-plus-2025-12-19
context cache (Discount available)

Global

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 1

CNY 10

32K < tokens ≤ 128K

CNY 1.5

CNY 15

128K < tokens ≤ 256K

CNY 3

CNY 30

qwen3-vl-plus

EU

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 1.499

CNY 11.991

32K < tokens ≤ 128K

CNY 2.248

CNY 17.986

128K < tokens ≤ 256K

CNY 4.497

CNY 35.972

qwen3-vl-plus-2025-09-23

Global

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 1

CNY 10

32K < tokens ≤ 128K

CNY 1.5

CNY 15

128K < tokens ≤ 256K

CNY 3

CNY 30

qwen3-vl-plus-2025-09-23

EU

non-thinking and thinking modes

0 < tokens ≤ 32K

CNY 1.499

CNY 11.991

32K < tokens ≤ 128K

CNY 2.248

CNY 17.986

128K < tokens ≤ 256K

CNY 4.497

CNY 35.972

Qwen-OCR

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Input price

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-vl-ocr

50% discount for batch inference

Chinese mainland

CNY 0.3

CNY 0.5

1 million tokens

qwen-vl-ocr-latest

50% discount for batch inference

Chinese mainland

CNY 0.3

CNY 0.5

1 million tokens

qwen-vl-ocr-2025-11-20

Chinese mainland

CNY 0.3

CNY 0.5

1 million tokens

qwen-vl-ocr-2025-08-28

Chinese mainland

CNY 5

CNY 5

1 million tokens

qwen-vl-ocr-2025-04-13

Chinese mainland

CNY 5

CNY 5

1 million tokens

qwen-vl-ocr-2024-10-28

Chinese mainland

CNY 5

CNY 5

1 million tokens

US (Virginia)

Model id

Deployment scope

Input price

Output price

qwen-vl-ocr

global

CNY 0.3

CNY 0.5

qwen-vl-ocr-2025-11-20

global

CNY 0.3

CNY 0.5

Singapore

Model id

Deployment scope

Input price

Output price

qwen-vl-ocr

international

CNY 0.514

CNY 1.174

qwen-vl-ocr-2025-11-20

international

CNY 0.514

CNY 1.174

Germany (Frankfurt)

Model id

Deployment scope

Input price

Output price

qwen-vl-ocr

global

CNY 0.3

CNY 0.5

qwen-vl-ocr-2025-11-20

global

CNY 0.3

CNY 0.5

Qwen-Audio

You are charged for input tokens and output tokens.

One second of audio is calculated as 25 tokens. Audio clips shorter than one second are also billed as 25 tokens.

China (Beijing)

Model ID

Deployment region

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-audio-turbo

Chinese mainland

Currently available for free trial only.

After the free quota is exhausted, you can no longer call the model. We recommend using Omni (Qwen-Omni) as an alternative.

100,000 tokens

qwen-audio-turbo-latest

Chinese mainland

Qwen Math

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quotaGuidelines

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-math-plus

Chinese mainland

CNY 4

CNY 12

1 million tokens

qwen-math-turbo

Chinese mainland

CNY 2

CNY 6

Qwen-Coder

You are charged for input tokens and output tokens.

If the model supports context cache, only input tokens receive a discount.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input tokens

Input price

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-coder-plus

Currently equivalent to qwen3-coder-plus-2025-09-23
Discount for context caching

Chinese mainland

0<tokens≤32K

CNY 4

CNY 16

1 million tokens

32K<tokens≤128K

CNY 6

CNY 24

128K<tokens≤256K

CNY 10

CNY 40

256K<tokens≤1M

CNY 20

CNY 200

qwen3-coder-plus-2025-09-23

Chinese mainland

0<tokens≤32K

CNY 4

CNY 16

1 million tokens

32K<tokens≤128K

CNY 6

CNY 24

128K<tokens≤256K

CNY 10

CNY 40

256K<tokens≤1M

CNY 20

CNY 200

qwen3-coder-plus-2025-07-22

Chinese mainland

0<tokens≤32K

CNY 4

CNY 16

1 million tokens

32K<tokens≤128K

CNY 6

CNY 24

128K<tokens≤256K

CNY 10

CNY 40

256K<tokens≤1M

CNY 20

CNY 200

qwen3-coder-flash

Currently equivalent to qwen3-coder-flash-2025-07-28

Chinese mainland

0<tokens≤32K

CNY 1

CNY 4

1 million tokens

32K<tokens≤128K

CNY 1.5

CNY 6

128K<tokens≤256K

CNY 2.5

CNY 10

256K<tokens≤1M

CNY 5

CNY 25

qwen3-coder-flash-2025-07-28

Chinese mainland

0<tokens≤32K

CNY 1

CNY 4

1 million tokens

32K<tokens≤128K

CNY 1.5

CNY 6

128K<tokens≤256K

CNY 2.5

CNY 10

256K<tokens≤1M

CNY 5

CNY 25

More models

Model ID

Deployment scope

Input tokens

Input price

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-coder-plus

Chinese mainland

No tiered pricing

CNY 3.5

CNY 7

1 million tokens

qwen-coder-turbo

Chinese mainland

No tiered pricing

CNY 2

CNY 6

1 million tokens

US (Virginia)

Model ID

Deployment scope

Input tokens

Input price (per million tokens)

Output price (per million tokens)

qwen3-coder-plus

Alias for qwen3-coder-plus-2025-09-23

global

0<token≤32K

CNY 4

CNY 16

32K<token≤128K

CNY 6

CNY 24

128K<token≤256K

CNY 10

CNY 40

256K<token≤1M

CNY 20

CNY 200

qwen3-coder-plus-2025-09-23

global

0<token≤32K

CNY 4

CNY 16

32K<token≤128K

CNY 6

CNY 24

128K<token≤256K

CNY 10

CNY 40

256K<token≤1M

CNY 20

CNY 200

qwen3-coder-plus-2025-07-22

global

0<token≤32K

CNY 4

CNY 16

32K<token≤128K

CNY 6

CNY 24

128K<token≤256K

CNY 10

CNY 40

256K<token≤1M

CNY 20

CNY 200

qwen3-coder-flash

Alias for qwen3-coder-flash-2025-07-28

global

0<token≤32K

CNY 1

CNY 4

32K<token≤128K

CNY 1.5

CNY 6

128K<token≤256K

CNY 2.5

CNY 10

256K<token≤1M

CNY 5

CNY 25

qwen3-coder-flash-2025-07-28

global

0<token≤32K

CNY 1

CNY 4

32K<token≤128K

CNY 1.5

CNY 6

128K<token≤256K

CNY 2.5

CNY 10

256K<token≤1M

CNY 5

CNY 25

Singapore

Model ID

Deployment scope

Request tokens

Input price (per 1 million tokens)

Output price (per 1 million tokens)

qwen3-coder-plus

This is an alias for qwen3-coder-plus-2025-09-23

International

0<token≤32K

CNY 7.339

CNY 36.696

32K<token≤128K

CNY 13.211

CNY 66.053

128K<token≤256K

CNY 22.018

CNY 110.089

256K<token≤1M

CNY 44.035

CNY 440.354

qwen3-coder-plus-2025-09-23

International

0<token≤32K

CNY 7.339

CNY 36.696

32K<token≤128K

CNY 13.211

CNY 66.053

128K<token≤256K

CNY 22.018

CNY 110.089

256K<token≤1M

CNY 44.035

CNY 440.354

qwen3-coder-plus-2025-07-22

International

0<token≤32K

CNY 7.339

CNY 36.696

32K<token≤128K

CNY 13.211

CNY 66.053

128K<token≤256K

CNY 22.018

CNY 110.089

256K<token≤1M

CNY 44.035

CNY 440.354

qwen3-coder-flash

This is an alias for qwen3-coder-flash-2025-07-28

International

0<token≤32K

CNY 2.202

CNY 11.009

32K<token≤128K

CNY 3.67

CNY 18.348

128K<token≤256K

CNY 5.871

CNY 29.357

256K<token≤1M

CNY 11.743

CNY 70.457

qwen3-coder-flash-2025-07-28

International

0<token≤32K

CNY 2.202

CNY 11.009

32K<token≤128K

CNY 3.67

CNY 18.348

128K<token≤256K

CNY 5.871

CNY 29.357

256K<token≤1M

CNY 11.743

CNY 70.457

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens

Input price (per 1M tokens)

Output price (per 1M tokens)

qwen3-coder-plus

Currently an alias for qwen3-coder-plus-2025-09-23

global

0 < tokens ≤ 32K

CNY 4

CNY 16

32K<Token≤128K

CNY 6

CNY 24

128K < tokens ≤ 256K

CNY 10

CNY 40

256K < tokens ≤ 1M

CNY 20

CNY 200

qwen3-coder-plus-2025-09-23

global

0 < tokens ≤ 32K

CNY 4

CNY 16

32K < tokens ≤ 128K

CNY 6

CNY 24

128K < tokens ≤ 256K

CNY 10

CNY 40

256K < tokens ≤ 1M

CNY 20

CNY 200

qwen3-coder-plus-2025-07-22

global

0 < tokens ≤ 32K

CNY 4

CNY 16

32K < tokens ≤ 128K

CNY 6

CNY 24

128K < tokens ≤ 256K

CNY 10

CNY 40

256K < tokens ≤ 1M

CNY 20

CNY 200

qwen3-coder-flash

Currently an alias for qwen3-coder-flash-2025-07-28

global

0 < tokens ≤ 32K

CNY 1

CNY 4

32K < tokens ≤ 128K

CNY 1.5

CNY 6

128K < tokens ≤ 256K

CNY 2.5

CNY 10

256K < tokens ≤ 1M

CNY 5

CNY 25

qwen3-coder-flash-2025-07-28

global

0 < tokens ≤ 32K

CNY 1

CNY 4

32K < tokens ≤ 128K

CNY 1.5

CNY 6

128K < tokens ≤ 256K

CNY 2.5

CNY 10

256K < tokens ≤ 1M

CNY 5

CNY 25

Qwen translation models

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-mt-plus

Chinese mainland

CNY 1.8

CNY 5.4

1 million tokens

qwen-mt-flash

Chinese mainland

CNY 0.7

CNY 1.95

1 million tokens

qwen-mt-lite

Chinese mainland

CNY 0.6

CNY 1.6

1 million tokens

qwen-mt-turbo

Chinese mainland

CNY 0.7

CNY 1.95

1 million tokens

US (Virginia)

Model ID

Deployment scope

Input price

Output price

qwen-mt-flash

Global

CNY 0.7

CNY 1.95

qwen-mt-lite

Global

CNY 0.6

CNY 1.6

qwen-mt-lite-us

US

CNY 0.881

CNY 2.642

qwen-mt-plus

Global

CNY 1.8

CNY 5.4

Singapore

Model ID

Deployment scope

Input price

Output price

qwen-mt-plus

International

CNY 18.055

CNY 54.09

qwen-mt-flash

International

CNY 1.174

CNY 3.596

qwen-mt-lite

International

CNY 0.881

CNY 2.642

qwen-mt-turbo

International

CNY 1.174

CNY 3.596

Germany (Frankfurt)

Model ID

Deployment scope

Input price

Output price

qwen-mt-plus

Global

CNY 1.8

CNY 5.4

qwen-mt-flash

Global

CNY 0.7

CNY 1.95

qwen-mt-lite

Global

CNY 0.6

CNY 1.6

Qwen data mining

You are charged for input tokens and output tokens.

China (Beijing)

Model id

Region

Input price

Output price

Free quota (Note)

qwen-doc-turbo

Chinese mainland

CNY 0.6

CNY 1

No free quota

Qwen Deep Research

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Service region

Input price

Output price

Free quota(Note)

qwen-deep-research

Chinese mainland

CNY 54

CNY 163

No free quota

qwen-deep-research-2025-12-15

Chinese mainland

CNY 79

CNY 236

No free quota

Tongyi Xiaomi conversation analysis

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio.

tongyi-xiaomi-analysis-flash

Chinese mainland

CNY 0.2

CNY 0.4

1 million tokens

tongyi-xiaomi-analysis-pro

Chinese mainland

CNY 1.0

CNY 2.7

1 million tokens

Text generation - Qwen (open-source)

Qwen3.6

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input token range

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Chinese mainland

0<token≤256K

CNY 1.8

CNY 10.8

CNY 10.8

1 million tokens

qwen3.6-27b

Chinese mainland

0<token≤256K

CNY 3

CNY 18

CNY 18

1 million tokens

US (Virginia)

Model ID

Deployment scope

Input token range

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Global

0<token≤256K

CNY 1.8

CNY 10.8

CNY 10.8

Singapore

Model ID

Deployment scope

Input token range

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

International

0<token≤256K

CNY 2.810325

CNY 16.86195

CNY 16.86195

qwen3.6-27b

International

0<token≤256K

CNY 4.49652

CNY 26.97912

CNY 26.97912

Germany (Frankfurt)

Model ID

Deployment scope

Input token range

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Non-thinking mode

Thinking mode (chain of thought + answer)

qwen3.6-35b-a3b

Global

0<token≤256K

CNY 1.8

CNY 10.8

CNY 10.8

Qwen3.5

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input tokens

Input price

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

Non-thinking mode

Thinking mode

qwen3.5-397b-a17b

Chinese mainland

0<token≤128K

CNY 1.2

CNY 7.2

CNY 7.2

1 million tokens

128K<token≤256K

CNY 3

CNY 18

CNY 18

qwen3.5-122b-a10b

Chinese mainland

0<token≤128K

CNY 0.8

CNY 6.4

CNY 6.4

1 million tokens

128K<token≤256K

CNY 2

CNY 16

CNY 16

qwen3.5-27b

Chinese mainland

0<token≤128K

CNY 0.6

CNY 4.8

CNY 4.8

1 million tokens

128K<token≤256K

CNY 1.8

CNY 14.4

CNY 14.4

qwen3.5-35b-a3b

Chinese mainland

0<token≤128K

CNY 0.4

CNY 3.2

CNY 3.2

1 million tokens

128K<token≤256K

CNY 1.6

CNY 12.8

CNY 12.8

US (Virginia)

Model ID

Deployment scope

Input tokens

Input price

Output price

Non-thinking mode

Thinking mode

qwen3.5-397b-a17b

Global

0<token≤128K

CNY 1.2

CNY 7.2

CNY 7.2

128K<token≤256K

CNY 3

CNY 18

CNY 18

qwen3.5-122b-a10b

Global

0<token≤128K

CNY 0.8

CNY 6.4

CNY 6.4

128K<token≤256K

CNY 2

CNY 16

CNY 16

qwen3.5-27b

Global

0<token≤128K

CNY 0.6

CNY 4.8

CNY 4.8

128K<token≤256K

CNY 1.8

CNY 14.4

CNY 14.4

qwen3.5-35b-a3b

Global

0<token≤128K

CNY 0.4

CNY 3.2

CNY 3.2

128K<token≤256K

CNY 1.6

CNY 12.8

CNY 12.8

Singapore

Model ID

Deployment scope

Input tokens

Input price

Output price

Non-thinking mode

Thinking mode

qwen3.5-397b-a17b

International

0<token≤256K

CNY 4.404

CNY 26.421

CNY 26.421

qwen3.5-122b-a10b

International

0<token≤256K

CNY 2.936

CNY 23.486

CNY 23.486

qwen3.5-27b

International

0<token≤256K

CNY 2.202

CNY 17.614

CNY 17.614

qwen3.5-35b-a3b

International

0<token≤256K

CNY 1.835

CNY 14.678

CNY 14.678

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens

Input price

Output price

Non-thinking mode

Thinking mode

qwen3.5-397b-a17b

Global

0<token≤128K

CNY 1.2

CNY 7.2

CNY 7.2

128K<token≤256K

CNY 3

CNY 18

CNY 18

qwen3.5-122b-a10b

Global

0<token≤128K

CNY 0.8

CNY 6.4

CNY 6.4

128K<token≤256K

CNY 2

CNY 16

CNY 16

qwen3.5-27b

Global

0<token≤128K

CNY 0.6

CNY 4.8

CNY 4.8

128K<token≤256K

CNY 1.8

CNY 14.4

CNY 14.4

qwen3.5-35b-a3b

Global

0<token≤128K

CNY 0.4

CNY 3.2

CNY 3.2

128K<token≤256K

CNY 1.6

CNY 12.8

CNY 12.8

Qwen3

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota (Note)

Valid for 90 days after activating Model Studio

Non-thinking mode

Thinking mode (CoT + answer)

qwen3-next-80b-a3b-thinking

chinese mainland

Thinking mode only

CNY 1

-

CNY 10

1 million tokens

qwen3-next-80b-a3b-instruct

chinese mainland

Non-Thinking mode only

CNY 1

CNY 4

-

1 million tokens

qwen3-235b-a22b-thinking-2507

chinese mainland

Thinking mode only

CNY 2

-

CNY 20

1 million tokens

qwen3-235b-a22b-instruct-2507

chinese mainland

Non-Thinking mode only

CNY 2

CNY 8

-

1 million tokens

qwen3-30b-a3b-thinking-2507

chinese mainland

Thinking mode only

CNY 0.75

-

CNY 7.5

1 million tokens

qwen3-30b-a3b-instruct-2507

chinese mainland

Non-Thinking mode only

CNY 0.75

CNY 3

-

1 million tokens

qwen3-235b-a22b

chinese mainland

Non-Thinking and Thinking modes

CNY 2

CNY 8

CNY 20

1 million tokens

qwen3-32b

chinese mainland

Non-Thinking and Thinking modes

CNY 2

CNY 8

CNY 20

1 million tokens

qwen3-30b-a3b

chinese mainland

Non-Thinking and Thinking modes

CNY 0.75

CNY 3

CNY 7.5

1 million tokens

qwen3-14b

chinese mainland

Non-Thinking and Thinking modes

CNY 1

CNY 4

CNY 10

1 million tokens

qwen3-8b

chinese mainland

Non-Thinking and Thinking modes

CNY 0.5

CNY 2

CNY 5

1 million tokens

US (Virginia)

Model ID

Deployment scope

Mode

Input price (per 1M tokens)

Output price (per 1M tokens)

Non-thinking mode

Thinking mode (CoT + answer)

qwen3-next-80b-a3b-thinking

global

Thinking mode only

CNY 1

-

CNY 10

qwen3-next-80b-a3b-instruct

global

Non-Thinking mode only

CNY 1

CNY 4

-

qwen3-235b-a22b-thinking-2507

global

Thinking mode only

CNY 1.688

-

CNY 16.88

qwen3-235b-a22b-instruct-2507

global

Non-Thinking mode only

CNY 1.688

CNY 6.752

-

qwen3-30b-a3b-thinking-2507

global

Thinking mode only

CNY 0.75

-

CNY 7.5

qwen3-30b-a3b-instruct-2507

global

Non-Thinking mode only

CNY 0.75

CNY 3

-

qwen3-235b-a22b

global

Non-Thinking and Thinking modes

CNY 2

CNY 8

CNY 20

qwen3-32b

global

Non-Thinking and Thinking modes

CNY 1.174

CNY 4.697

CNY 4.697

qwen3-30b-a3b

global

Non-Thinking and Thinking modes

CNY 0.75

CNY 3

CNY 7.5

qwen3-14b

global

Non-Thinking and Thinking modes

CNY 1

CNY 4

CNY 10

qwen3-8b

global

Non-Thinking and Thinking modes

CNY 0.5

CNY 2

CNY 5

Singapore

Model ID

Deployment scope

Mode

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota (Note)

Valid for 90 days after activating Model Studio

Non-thinking mode

Thinking mode (CoT + answer)

qwen3-next-80b-a3b-thinking

international

Thinking mode only

CNY 1.101

-

CNY 8.807

No free quota

qwen3-next-80b-a3b-instruct

international

Non-Thinking mode only

CNY 1.101

CNY 8.807

-

No free quota

qwen3-235b-a22b-thinking-2507

international

Thinking mode only

CNY 1.688

-

CNY 16.88

No free quota

qwen3-235b-a22b-instruct-2507

international

Non-Thinking mode only

CNY 1.688

CNY 6.752

-

No free quota

qwen3-30b-a3b-thinking-2507

international

Thinking mode only

CNY 1.468

-

CNY 17.614

No free quota

qwen3-30b-a3b-instruct-2507

international

Non-Thinking mode only

CNY 1.468

CNY 5.871

-

No free quota

qwen3-235b-a22b

international

Non-Thinking and Thinking modes

CNY 5.137

CNY 20.55

CNY 61.65

No free quota

qwen3-32b

international

Non-Thinking and Thinking modes

CNY 1.174

CNY 4.697

CNY 4.697

No free quota

qwen3-30b-a3b

international

Non-Thinking and Thinking modes

CNY 1.468

CNY 5.871

CNY 17.614

No free quota

qwen3-14b

international

Non-Thinking and Thinking modes

CNY 2.569

CNY 10.275

CNY 30.825

No free quota

qwen3-8b

international

Non-Thinking and Thinking modes

CNY 1.321

CNY 5.137

CNY 15.412

No free quota

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input price (per 1M tokens)

Output price (per 1M tokens)

Non-thinking mode

Thinking mode (CoT + answer)

qwen3-next-80b-a3b-thinking

global

Thinking mode only

CNY 1

-

CNY 10

qwen3-next-80b-a3b-instruct

global

Non-Thinking mode only

CNY 1

CNY 4

-

qwen3-235b-a22b-thinking-2507

global

Thinking mode only

CNY 1.688

-

CNY 16.88

qwen3-235b-a22b-instruct-2507

global

Non-Thinking mode only

CNY 1.688

CNY 6.752

-

qwen3-30b-a3b-thinking-2507

global

Thinking mode only

CNY 0.75

-

CNY 7.5

qwen3-30b-a3b-instruct-2507

global

Non-Thinking mode only

CNY 0.75

CNY 3

-

qwen3-235b-a22b

global

Non-Thinking and Thinking modes

CNY 2

CNY 8

CNY 20

qwen3-32b

global

Non-Thinking and Thinking modes

CNY 1.174

CNY 4.697

CNY 4.697

qwen3-30b-a3b

global

Non-Thinking and Thinking modes

CNY 0.75

CNY 3

CNY 7.5

qwen3-14b

global

Non-Thinking and Thinking modes

CNY 1

CNY 4

CNY 10

qwen3-8b

global

Non-Thinking and Thinking modes

CNY 0.5

CNY 2

CNY 5

Qwen-Omni

You are billed for input and output tokens. For details on how tokens are calculated for different modalities, see billing and rate limits.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quotaGuidelines

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Billed for audio only

qwen2.5-omni-7b

Chinese mainland

CNY 0.6

CNY 38

CNY 2

CNY 2.4

CNY 6

CNY 76

1 million tokens (any modality)

Singapore

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Text

Audio

Image/video

Text

Text-only input

Text

Multimodal input

Text + audio

Billed for audio only

qwen2.5-omni-7b

international

CNY 0.734

CNY 49.613

CNY 2.055

CNY 2.936

CNY 6.165

CNY 99.153

Qwen3-Omni-Captioner

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-omni-30b-a3b-captioner

Chinese mainland

CNY 15.8

CNY 12.7

1 million tokens

Singapore

Model ID

Deployment scope

Input price

Output price

qwen3-omni-30b-a3b-captioner

international

CNY 27.962

CNY 22.458

Qwen-VL

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

chain of thought + answer

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-vl-235b-a22b-thinking

Chinese mainland

Thinking mode

CNY 2

CNY 20

1 million tokens

qwen3-vl-235b-a22b-instruct

Chinese mainland

Non-thinking mode

CNY 2

CNY 8

1 million tokens

qwen3-vl-32b-thinking

Chinese mainland

Thinking mode

CNY 2

CNY 20

1 million tokens

qwen3-vl-32b-instruct

Chinese mainland

Non-thinking mode

CNY 2

CNY 8

1 million tokens

qwen3-vl-30b-a3b-thinking

Chinese mainland

Thinking mode

CNY 0.75

CNY 7.5

1 million tokens

qwen3-vl-30b-a3b-instruct

Chinese mainland

Non-thinking mode

CNY 0.75

CNY 3

1 million tokens

qwen3-vl-8b-thinking

Chinese mainland

Thinking mode

CNY 0.5

CNY 5

1 million tokens

qwen3-vl-8b-instruct

Chinese mainland

Non-thinking mode

CNY 0.5

CNY 2

1 million tokens

US (Virginia)

Model id

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

chain of thought + answer

qwen3-vl-235b-a22b-thinking

Global

Thinking mode

CNY 2

CNY 20

qwen3-vl-235b-a22b-instruct

Global

Non-thinking mode

CNY 2

CNY 8

qwen3-vl-32b-thinking

Global

Thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-32b-instruct

Global

Non-thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-30b-a3b-thinking

Global

Thinking mode

CNY 0.75

CNY 7.5

qwen3-vl-30b-a3b-instruct

Global

Non-thinking mode

CNY 0.75

CNY 3

qwen3-vl-8b-thinking

Global

Thinking mode

CNY 0.5

CNY 5

qwen3-vl-8b-instruct

Global

Non-thinking mode

CNY 0.5

CNY 2

Singapore

Model id

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

chain of thought + answer

qwen3-vl-235b-a22b-thinking

International

Thinking mode

CNY 2.936

CNY 29.357

qwen3-vl-235b-a22b-instruct

International

Non-thinking mode

CNY 2.936

CNY 11.743

qwen3-vl-32b-thinking

International

Thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-32b-instruct

International

Non-thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-30b-a3b-thinking

International

Thinking mode

CNY 1.468

CNY 17.614

qwen3-vl-30b-a3b-instruct

International

Non-thinking mode

CNY 1.468

CNY 5.871

qwen3-vl-8b-thinking

International

Thinking mode

CNY 1.321

CNY 15.412

qwen3-vl-8b-instruct

International

Non-thinking mode

CNY 1.321

CNY 5.137

Germany (Frankfurt)

Model id

Deployment scope

Mode

Input price (per 1 million tokens)

Output price (per 1 million tokens)

chain of thought + answer

qwen3-vl-235b-a22b-thinking

Global

Thinking mode

CNY 2

CNY 20

qwen3-vl-235b-a22b-instruct

Global

Non-thinking mode

CNY 2

CNY 8

qwen3-vl-32b-thinking

Global

Thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-32b-instruct

Global

Non-thinking mode

CNY 1.174

CNY 4.697

qwen3-vl-30b-a3b-thinking

Global

Thinking mode

CNY 0.75

CNY 7.5

qwen3-vl-30b-a3b-instruct

Global

Non-thinking mode

CNY 0.75

CNY 3

qwen3-vl-8b-thinking

Global

Thinking mode

CNY 0.5

CNY 5

qwen3-vl-8b-instruct

Global

Non-thinking mode

CNY 0.5

CNY 2

Qwen-Audio

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment region

Input price

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen2-audio-instruct

Chinese mainland

Currently available for a free trial only.

Once the free quota is exhausted, you can no longer call the model. We recommend Omni (Qwen-Omni) as an alternative.

100,000 tokens

qwen-audio-chat

Chinese mainland

Qwen-Coder

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input tokens

Input price

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-coder-next

Chinese mainland

0<Token≤32K

CNY 1

CNY 4

1 million tokens

32K<Token≤128K

CNY 1.5

CNY 6

128K<Token≤256K

CNY 2.5

CNY 10

qwen3-coder-480b-a35b-instruct

Chinese mainland

0<Token≤32K

CNY 6

CNY 24

1 million tokens

32K<Token≤128K

CNY 9

CNY 36

128K<Token≤200K

CNY 15

CNY 60

qwen3-coder-30b-a3b-instruct

Chinese mainland

0<Token≤32K

CNY 1.5

CNY 6

1 million tokens

32K<Token≤128K

CNY 2.25

CNY 9

128K<Token≤200K

CNY 3.75

CNY 15

US (Virginia)

Model ID

Deployment scope

Input tokens

Input price

Output price

qwen3-coder-480b-a35b-instruct

global

0<Token≤32K

CNY 6

CNY 24

32K<Token≤128K

CNY 9

CNY 36

128K<Token≤200K

CNY 15

CNY 60

qwen3-coder-30b-a3b-instruct

global

0<Token≤32K

CNY 1.5

CNY 6

32K<Token≤128K

CNY 2.25

CNY 9

128K<Token≤200K

CNY 3.75

CNY 15

Singapore

Model ID

Deployment scope

Input tokens

Input price

Output price

qwen3-coder-next

international

0<Token≤32K

CNY 2.202

CNY 11.009

32K<Token≤128K

CNY 3.67

CNY 18.348

128K<Token≤256K

CNY 5.871

CNY 29.357

qwen3-coder-480b-a35b-instruct

international

0<Token≤32K

CNY 11.009

CNY 55.044

32K<Token≤128K

CNY 19.816

CNY 99.08

128K<Token≤200K

CNY 33.027

CNY 165.133

qwen3-coder-30b-a3b-instruct

international

0<Token≤32K

CNY 3.303

CNY 16.513

32K<Token≤128K

CNY 5.504

CNY 27.522

128K<Token≤200K

CNY 8.807

CNY 44.035

Germany (Frankfurt)

Model ID

Deployment scope

Input tokens

Input price

Output price

qwen3-coder-30b-a3b-instruct

global

0<Token≤32K

CNY 1.5

CNY 6

32K<Token≤128K

CNY 2.25

CNY 9

128K<Token≤200K

CNY 3.75

CNY 15

qwen3-coder-480b-a35b-instruct

global

0<Token≤32K

CNY 6

CNY 24

32K<Token≤128K

CNY 9

CNY 36

128K<Token≤200K

CNY 15

CNY 60

qwen3-coder-next

EU

0<Token≤32K

CNY 2.248

CNY 11.241

32K<Token≤128K

CNY 3.747

CNY 18.736

128K<Token≤256K

CNY 5.995

CNY 29.977

Text generation - third-party models

DeepSeek

You are charged for input tokens and output tokens.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Chain of thought + answer

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

deepseek-v4-pro

context caching discount

Chinese mainland

CNY 12

CNY 24

1 million tokens

deepseek-v4-flash

context caching discount

Chinese mainland

CNY 1

CNY 2

1 million tokens

deepseek-v3.2

context caching discount

Chinese mainland

CNY 2

CNY 3

1 million tokens

deepseek-v3.2-exp

Chinese mainland

CNY 2

CNY 3

1 million tokens

deepseek-v3.1

Chinese mainland

CNY 4

CNY 12

1 million tokens

deepseek-r1

50% discount for batch inference

Chinese mainland

CNY 4

CNY 16

1 million tokens

deepseek-r1-0528

Chinese mainland

CNY 4

CNY 16

1 million tokens

deepseek-v3

50% discount for batch inference

Chinese mainland

CNY 2

CNY 8

1 million tokens

deepseek-r1-distill-qwen-1.5b

Chinese mainland

Limited-time free

deepseek-r1-distill-qwen-7b

Chinese mainland

CNY 0.5

CNY 1

1 million tokens

deepseek-r1-distill-qwen-14b

Chinese mainland

CNY 1

CNY 3

1 million tokens

deepseek-r1-distill-qwen-32b

Chinese mainland

CNY 2

CNY 6

1 million tokens

deepseek-r1-distill-llama-8b

Chinese mainland

Limited-time free

deepseek-r1-distill-llama-70b

Chinese mainland

Currently available for free trial only.

You cannot call the model after the free quota is exhausted. Alternative models include Deep thinking, DeepSeek - Alibaba Cloud, and Kimi - Alibaba Cloud.

1 million tokens

US (Virginia)

Model ID

Deployment scope

Input price

Output price

Chain of thought + answer

deepseek-v4-pro

context caching discount

Global

CNY 12

CNY 24

deepseek-v4-flash

context caching discount

Global

CNY 1

CNY 2

Singapore

Model ID

Deployment scope

Input price

Output price

Chain of thought + answer

deepseek-v4-pro

context caching discount

International

CNY 17.986

CNY 35.972

deepseek-v4-flash

context caching discount

International

CNY 1.499

CNY 2.998

deepseek-v3.2

context caching discount

International

CNY 4.272

CNY 12.815

Germany (Frankfurt)

Model ID

Deployment scope

Input price

Output price

Chain of thought + answer

deepseek-v4-pro

context caching discount

Global

CNY 12

CNY 24

deepseek-v4-flash

context caching discount

Global

CNY 1

CNY 2

DeepSeek-SiliconFlow

China (Beijing)

Model ID

Service region

Input price

Output price

Chain of thought and answer

Free quota

siliconflow/deepseek-v3.2

Chinese mainland

CNY 2

CNY 3

None

siliconflow/deepseek-v3.1-terminus

Chinese mainland

CNY 4

CNY 12

siliconflow/deepseek-r1-0528

Chinese mainland

CNY 4

CNY 16

siliconflow/deepseek-v3-0324

Chinese mainland

CNY 2

CNY 8

DeepSeek-Kuaishou Wanqing

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Chain of thought + answer

Free quota

vanchin/deepseek-v3.2-think

Discount for context caching

Chinese mainland

CNY 2

CNY 3

None

vanchin/deepseek-v3.1-terminus

Discount for context caching

Chinese mainland

CNY 4

CNY 12

vanchin/deepseek-r1

Discount for context caching

Chinese mainland

CNY 4

CNY 16

vanchin/deepseek-v3

Discount for context caching

Chinese mainland

CNY 2

CNY 8

vanchin/deepseek-ocr

Chinese mainland

CNY 0.216

CNY 0.216

Kimi

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Mode

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

kimi-k2.7-code

Chinese mainland

Thinking mode only

CNY 6.5

CNY 27

1 million tokens

kimi-k2.6

Chinese mainland

Thinking and Non-Thinking modes

CNY 6.5

CNY 27

1 million tokens

kimi-k2.5

Chinese mainland

Thinking and Non-Thinking modes

CNY 4

CNY 21

1 million tokens

kimi-k2-thinking

Chinese mainland

Thinking mode only

CNY 4

CNY 16

1 million tokens

Moonshot-Kimi-K2-Instruct

Chinese mainland

Non-Thinking mode only

CNY 4

CNY 16

1 million tokens

US (Virginia)

Model id

Deployment scope

Mode

Input price

Output price

kimi-k2.7-code

global

Thinking mode only

CNY 6.5

CNY 27

kimi-k2.5

global

Thinking and Non-Thinking modes

CNY 4

CNY 21

Germany (Frankfurt)

Model id

Deployment scope

Mode

Input price

Output price

kimi-k2.7-code

global

Thinking mode only

CNY 6.5

CNY 27

kimi-k2.5

global

Thinking and Non-Thinking modes

CNY 4

CNY 21

Kimi-Moonshot AI

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Chain of thought

Free quota (Note)

kimi/kimi-k2.7-code

Discount for context caching

Chinese mainland

CNY 6.5

CNY 27

None

kimi/kimi-k2.6

Discount for context caching

Chinese mainland

CNY 6.5

CNY 27

kimi/kimi-k2.5

Discount for context caching

Chinese mainland

CNY 4

CNY 21

GLM

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

glm-5.1

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 6

CNY 24

1 million tokens

32K<token≤200K

CNY 8

CNY 28

glm-5

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 4

CNY 18

1 million tokens

32K<token≤198K

CNY 6

CNY 22

glm-4.7

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 3

CNY 14

1 million tokens

32K<token≤166K

CNY 4

CNY 16

glm-4.6

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 3

CNY 14

1 million tokens

32K<token≤166K

CNY 4

CNY 16

glm-4.5

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 3

CNY 14

1 million tokens

32K<token≤96K

CNY 4

CNY 16

glm-4.5-air

Chinese mainland

non-thinking and thinking modes

0<token≤32K

CNY 0.8

CNY 6

1 million tokens

32K<token≤96K

CNY 1.2

CNY 8

US (Virginia)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

glm-5.1

Global

non-thinking and thinking modes

0<token≤32K

CNY 6

CNY 24

32K<token≤200K

CNY 8

CNY 28

Singapore

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

glm-5.1

International

non-thinking and thinking modes

0<token≤200K

CNY 10.492

CNY 32.974

Not available

Germany (Frankfurt)

Model ID

Deployment scope

Mode

Input tokens

Input price

Output price

Chain of thought and answer

glm-5.1

Global

non-thinking and thinking modes

0<token≤32K

CNY 6

CNY 24

32K<token≤200K

CNY 8

CNY 28

GLM-Zhipu AI

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Mode

Input price

Output price

Chain of thought and answer

Free quota (Note)

ZHIPU/GLM-5.1

Chinese mainland

Non-thinking and Thinking modes

CNY 8

CNY 28

None

ZHIPU/GLM-5

Chinese mainland

Non-thinking and Thinking modes

CNY 6

CNY 22

None

MiniMax

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment region

Mode

Input price

Output price

Chain of thought

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

MiniMax-M2.5

Chinese mainland

Chain of thought mode only

CNY 2.1

CNY 8.4

1 million tokens

MiniMax-M2.1

Chinese mainland

Chain of thought mode only

CNY 2.1

CNY 8.4

MiniMax

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Region

Mode

Input price

Output price

Chain of thought

Free quota (note)

MiniMax/MiniMax-M3

Discount on context caching

Chinese mainland

Thinking and non-thinking modes

CNY 4.2

CNY 16.8

None

MiniMax/MiniMax-M2.7

Discount on context caching

Chinese mainland

Thinking mode only

CNY 2.1

CNY 8.4

MiniMax/MiniMax-M2.5

Discount on context caching

Chinese mainland

Thinking mode only

CNY 2.1

CNY 8.4

MiniMax/MiniMax-M2.1

Discount on context caching

Chinese mainland

Thinking mode only

CNY 2.1

CNY 8.4

MiMo-Xiaomi

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Region

Input tokens

Input price

Output price

Chain of thought

Free quota (Note)

xiaomi/mimo-v2.5-pro

Chinese mainland

0 < tokens ≤ 256K

CNY 7

CNY 21

None

256K < tokens ≤ 1M

CNY 14

CNY 42

Stepfun-StepFun

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Chain of thought and answer

Free quota (Note)

stepfun/step-3.7-flash

Chinese mainland

CNY 1.35

CNY 8.1

None

Image generation

You are not charged for input. You are charged for output based on the number of successfully generated images.

Formula: Cost = Image unit price × Number of images generated.

Notes:

  • Cost does not depend on image resolution or aspect ratio.

  • Failed requests incur no cost and do not consume your free quota.

Billing example: Some images fail to generate

Assume the image unit price is CNY 0.10 per image. If you call the API to generate four images but only three image URLs return successfully, the system charges only for the three successfully generated images.

  • Number billed: 3 images.

  • Cost calculation: 0.1 × 3 = CNY 0.3.

Qwen text-to-image

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-image-2.0-pro

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0-pro-2026-04-22

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0-pro-2026-03-03

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-2.0-2026-03-03

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-max

Currently an alias for qwen-image-max-2025-12-30

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-max-2025-12-30

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-plus

Currently an alias for qwen-image

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-plus-2026-01-09

Chinese mainland

CNY 0.2 per image

100 images

qwen-image

Chinese mainland

CNY 0.25 per image

100 images

Singapore

Model ID

Deployment scope

Output price

qwen-image-2.0-pro

International

CNY 0.550443 per image

qwen-image-2.0-pro-2026-04-22

International

CNY 0.550443 per image

qwen-image-2.0-pro-2026-03-03

International

CNY 0.550443 per image

qwen-image-2.0

International

CNY 0.256873 per image

qwen-image-2.0-2026-03-03

International

CNY 0.256873 per image

qwen-image-max

Currently an alias for qwen-image-max-2025-12-30

International

CNY 0.550443 per image

qwen-image-max-2025-12-30

International

CNY 0.550443 per image

qwen-image-plus

Currently an alias for qwen-image

International

CNY 0.220177 per image

qwen-image-plus-2026-01-09

International

CNY 0.220177 per image

qwen-image

International

CNY 0.256873 per image

Qwen image editing

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Region

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-image-2.0-pro

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0-pro-2026-04-22

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0-pro-2026-03-03

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-2.0

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-2.0-2026-03-03

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-edit-max

Currently equivalent to qwen-image-edit-max-2026-01-16

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-edit-max-2026-01-16

Chinese mainland

CNY 0.5 per image

100 images

qwen-image-edit-plus

Currently equivalent to qwen-image-edit-plus-2025-10-30

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-edit-plus-2025-12-15

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-edit-plus-2025-10-30

Chinese mainland

CNY 0.2 per image

100 images

qwen-image-edit

Chinese mainland

CNY 0.3 per image

100 images

Singapore

Model ID

Region

Output price

qwen-image-2.0-pro

International

CNY 0.550443 per image

qwen-image-2.0-pro-2026-04-22

International

CNY 0.550443 per image

qwen-image-2.0-pro-2026-03-03

International

CNY 0.550443 per image

qwen-image-2.0

International

CNY 0.256873 per image

qwen-image-2.0-2026-03-03

International

CNY 0.256873 per image

qwen-image-edit-max

Currently equivalent to qwen-image-edit-max-2026-01-16

International

CNY 0.550443 per image

qwen-image-edit-max-2026-01-16

International

CNY 0.550443 per image

qwen-image-edit-plus

Currently equivalent to qwen-image-edit-plus-2025-10-30

International

CNY 0.220177 per image

qwen-image-edit-plus-2025-12-15

International

CNY 0.220177 per image

qwen-image-edit-plus-2025-10-30

International

CNY 0.220177 per image

qwen-image-edit

International

CNY 0.330266 per image

Qwen Image Translation

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Price

Free quota(Note)

Expires 90 days after you activate Alibaba Cloud Model Studio

qwen-mt-image

Chinese mainland

CNY 0.003 per image

100 images

Z-Image

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

z-image-turbo

Chinese mainland

Prompt rewriting disabled (prompt_extend=false): CNY 0.1 per image

Prompt rewriting enabled (prompt_extend=true): CNY 0.2 per image

100 images

Singapore

Model id

Deployment scope

Output price

z-image-turbo

international

Prompt rewriting disabled (prompt_extend=false): CNY 0.110089 per image

Prompt rewriting enabled (prompt_extend=true): CNY 0.220177 per image

Wanx text-to-image

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.6-t2i

Chinese mainland

CNY 0.20 per image

50 images

wan2.5-t2i-preview

Chinese mainland

CNY 0.20 per image

50 images

wan2.2-t2i-plus

Chinese mainland

CNY 0.20 per image

100 images

wan2.2-t2i-flash

Chinese mainland

CNY 0.14 per image

100 images

wanx2.1-t2i-plus

Chinese mainland

CNY 0.20 per image

500 images

wanx2.1-t2i-turbo

Chinese mainland

CNY 0.14 per image

500 images

wanx2.0-t2i-turbo

Chinese mainland

CNY 0.04 per image

500 images

wanx-v1

Chinese mainland

CNY 0.16 per image

500 images

US (Virginia)

Model ID

Deployment scope

Output price

wan2.6-t2i

global

CNY 0.20 per image

Singapore

Model ID

Deployment scope

Output price

wan2.6-t2i

international

CNY 0.220177 per image

wan2.5-t2i-preview

international

CNY 0.220177 per image

wan2.2-t2i-plus

international

CNY 0.366962 per image

wan2.2-t2i-flash

international

CNY 0.183481 per image

wanx2.1-t2i-plus

international

CNY 0.366962 per image

wanx2.1-t2i-turbo

international

CNY 0.183481 per image

Germany (Frankfurt)

Model ID

Deployment scope

Output price

wan2.6-t2i

global

CNY 0.20 per image

Wanx image generation and editing

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-image-pro

chinese mainland

CNY 0.50 per image

50 images

wan2.7-image

chinese mainland

CNY 0.20 per image

50 images

wan2.6-image

chinese mainland

CNY 0.20 per image

50 images

US (Virginia)

Model ID

Deployment scope

Output price

wan2.6-image

global

CNY 0.20 per image

Singapore

Model ID

Deployment scope

Output price

wan2.7-image-pro

international

CNY 0.562065 per image

wan2.7-image

international

CNY 0.220177 per image

wan2.6-image

international

CNY 0.220177 per image

Germany (Frankfurt)

Model ID

Deployment scope

Output price

wan2.6-image

global

CNY 0.20 per image

Wan general-purpose image editing

You are billed for output only. For billing rules, see image generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.5-i2i-preview

Chinese mainland

CNY 0.20 per image

50 images

wanx2.1-imageedit

Chinese mainland

CNY 0.14 per image

500 images

Singapore

Model ID

Deployment scope

Output price

wan2.5-i2i-preview

International

CNY 0.220177 per image

Wanx Sketch-to-Image

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Available regions

Price per image

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wanx-sketch-to-image-lite

Chinese mainland

CNY 0.06 per image

500 images

Wanx Image Inpainting

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wanx-x-painting

Chinese mainland

Currently available for a free trial only.

After you exhaust the free quota, you can no longer call this model. For alternatives, see Image editing - Qwen or Image editing - Wan 2.1.

500 images

Portrait style repaint

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Service region

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio.

wanx-style-repaint-v1

Chinese mainland

CNY 0.12/image

500 images

Image background generation

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after activation of Alibaba Cloud Model Studio

wanx-background-generation-v2

Chinese mainland

CNY 0.08 per image

500 images

Image outpainting

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model id

Deployment scope

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

image-out-painting

Chinese mainland

CNY 0.18 per image

500 images

Human instance segmentation

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Availability

Price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio.

image-instance-segmentation

Chinese mainland

Currently available as a free trial only.

You cannot call the model once you exhaust the free quota.

500 images

Image erasing and inpainting

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio.

image-erase-completion

Chinese mainland

Free trial only.

Once the free quota is exhausted, you will be unable to call the model. Consider using Image editing - Qwen or Image editing - Wan2.1 as alternatives.

500 images

Virtual model

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Service region

Output price

Free Tier (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wanx-virtualmodel

Chinese mainland

Currently available for a free trial only.

Once your free quota is exhausted, you cannot call the model. Consider using image editing - Qwen or image editing - Wan2.1 as alternatives.

500 images each

virtualmodel-v2

Chinese mainland

Footwear model

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Expires 90 days after you activate Alibaba Cloud Model Studio

shoemodel-v1

Chinese mainland

Available for free trial only.

Once you use up the free quota, you can no longer call the model.

500 images

Creative poster generation

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wanx-poster-generation-v1

Chinese mainland

Currently available as a free trial only.

Once the free quota is used up, this model becomes unavailable. Consider using Image editing - Qwen or Image editing - Wan2.1 as alternatives.

500 images

Portrait generation - FaceChain

  • facechain-facedetect: Free for a limited time.

  • facechain-finetune: Billed per training run. You are not charged for failed requests.

  • facechain-generation: You are not billed for input. You are billed for each successfully generated image. For billing rules, see image generation.

China (Beijing)

Model ID

Service region

Unit price

Free quota(note)

facechain-facedetect

Chinese mainland

Free for a limited time

Free for a limited time

facechain-finetune

Chinese mainland

CNY 2.5 per training run

50 training runs

Valid for 90 days after application approval

facechain-generation

Chinese mainland

CNY 0.18 per image

500 images

Valid for 90 days after application approval

Creative text generation - WordArt

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model id

Region

Price

Free quota (Note)

Expires 90 days after you activate Alibaba Cloud Model Studio.

wordart-texture

Chinese mainland

CNY 0.08 per image

500 images

wordart-semantic

Chinese mainland

CNY 0.24 per image

AI Virtual Try-on - OutfitAnyone

  • aitryon: You are billed for output only. For billing details, see image generation.

  • aitryon-plus: You are billed for output only. For billing details, see image generation.

  • aitryon-parsing-v1: You are billed per input image, and output is free. You are not charged for failed requests.

  • aitryon-refiner: You are billed for output only. For billing details, see image generation.

China (Beijing)

Model id

Deployment scope

Free quotaGuidelines

Valid for 90 days after you activate Alibaba Cloud Model Studio

aitryon

Chinese mainland

400 images

aitryon-plus

Chinese mainland

400 images

aitryon-parsing-v1

Chinese mainland

400 images

aitryon-refiner

Chinese mainland

100 images

China (Beijing)

Model id

Deployment scope

Unit price

Discount

Pricing tier

aitryon

Chinese mainland

CNY 0.20 per image

None

None

aitryon-plus

Chinese mainland

CNY 0.50 per image

None

None

aitryon-parsing-v1

Chinese mainland

CNY 0.004 per image

None

None

aitryon-refiner

Chinese mainland

CNY 0.30 per image

None

Generated quantity ≤ 25 images

CNY 0.275 per image

8%

25 images < generated quantity ≤ 125 images

CNY 0.25 per image

16%

125 images < generated quantity ≤ 250 images

CNY 0.225 per image

25%

250 images < generated quantity ≤ 1,250 images

CNY 0.20 per image

33%

1,250 images < generated quantity ≤ 2,500 images

CNY 0.175 per image

42%

2,500 images < generated quantity ≤ 25,000 images

CNY 0.15 per image

50%

Generated quantity > 25,000 images

Image generation - third-party models

Kling-Image-Generation

You are billed for output only. For billing rules, see image generation.

China (Beijing)

Model ID

Deployment scope

Output resolution

Price

Free quota (Note)

kling/kling-v3-image-generation

Chinese mainland

1K

CNY 0.2 per image

No free quota.

2K

CNY 0.2 per image

kling/kling-v3-omni-image-generation

Chinese mainland

1K

CNY 0.2 per image

2K

CNY 0.2 per image

4K

CNY 0.4 per image

Music generation

Billing: Charged per second of audio output. Input is free of charge.

China (Beijing)

Model ID

Region

Price (per second)

Free quotaNote

Valid for 90 days after you activate Alibaba Cloud Model Studio

fun-music-preview

Chinese mainland

CNY 0.005

1,000 seconds

fun-music-v1

Chinese mainland

CNY 0.002

Text-to-speech

Qwen-TTS

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Qwen3-TTS-Instruct-Flash

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-instruct-flash

Chinese mainland

CNY 0.8

Free

10,000 characters

qwen3-tts-instruct-flash-2026-01-26

Chinese mainland

CNY 0.8

Free

10,000 characters

Qwen3-TTS-VD

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vd-2026-01-26

Chinese mainland

CNY 0.8

Free

10,000 characters

Qwen3-TTS-VC

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vc-2026-01-22

Chinese mainland

CNY 0.8

Free

10,000 characters

Qwen3-TTS-Flash

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-flash

Currently an alias for qwen3-tts-flash-2025-11-27

Chinese mainland

CNY 0.8

Free

10,000 characters

qwen3-tts-flash-2025-11-27

Chinese mainland

CNY 0.8

Free

10,000 characters

qwen3-tts-flash-2025-09-18

Chinese mainland

CNY 0.8

Free

For accounts activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025: 10,000 characters

Qwen-TTS

Pricing is based on both input and output tokens.

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-tts-flash

Chinese mainland

CNY 1.6

CNY 10

1 million tokens

qwen-tts-latest

Chinese mainland

CNY 1.6

CNY 10

1 million tokens

qwen-tts-2025-05-22

Chinese mainland

CNY 1.6

CNY 10

1 million tokens

qwen-tts-2025-04-10

Chinese mainland

CNY 1.6

CNY 10

1 million tokens

Singapore

Qwen3-TTS-Instruct-Flash

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-instruct-flash

International

CNY 0.8

qwen3-tts-instruct-flash-2026-01-26

International

CNY 0.8

Qwen3-TTS-VD

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-vd-2026-01-26

International

CNY 0.8

Qwen3-TTS-VC

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-vc-2026-01-22

International

CNY 0.8

Qwen3-TTS-Flash

Pricing is based on the number of characters in the input text. Output is free of charge.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-flash

Currently an alias for qwen3-tts-flash-2025-11-27

International

CNY 0.733924

qwen3-tts-flash-2025-11-27

International

CNY 0.733924

qwen3-tts-flash-2025-09-18

International

CNY 0.733924

Qwen-TTS-Realtime

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-instruct-flash-realtime

Chinese mainland

CNY 1

Not charged

10,000 characters

qwen3-tts-instruct-flash-realtime-2026-01-22

Chinese mainland

CNY 1

Not charged

10,000 characters

Qwen3-TTS-VD-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vd-realtime-2026-01-15

Chinese mainland

CNY 1

Not charged

10,000 characters

qwen3-tts-vd-realtime-2025-12-16

Chinese mainland

CNY 1

Not charged

10,000 characters

Qwen3-TTS-VC-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-vc-realtime-2026-01-15

Chinese mainland

CNY 1

Not charged

10,000 characters

qwen3-tts-vc-realtime-2025-11-27

Chinese mainland

10,000 characters

Qwen3-TTS-Flash-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen3-tts-flash-realtime

Chinese mainland

CNY 1

Not charged

10,000 characters for users activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025.

qwen3-tts-flash-realtime-2025-11-27

Chinese mainland

CNY 1

Not charged

10,000 characters

qwen3-tts-flash-realtime-2025-09-18

Chinese mainland

CNY 1

Not charged

10,000 characters for users activating Alibaba Cloud Model Studio after 00:00 (UTC+8) on November 13, 2025.

Qwen-TTS-Realtime

Pricing rule: Billed based on the number of input and output tokens.

Model ID

Deployment scope

Input price (per 1 million tokens)

Output price (per 1 million tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-tts-realtime

Chinese mainland

CNY 2.4

CNY 12

1 million tokens

qwen-tts-realtime-latest

Chinese mainland

CNY 2.4

CNY 12

1 million tokens

qwen-tts-realtime-2025-07-15

Chinese mainland

CNY 2.4

CNY 12

1 million tokens

Singapore

Qwen3-TTS-Instruct-Flash-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-instruct-flash-realtime

International

CNY 1

qwen3-tts-instruct-flash-realtime-2026-01-22

International

CNY 1

Qwen3-TTS-VD-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-vd-realtime-2026-01-15

International

CNY 0.954101

qwen3-tts-vd-realtime-2025-12-16

International

CNY 0.954101

Qwen3-TTS-VC-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-vc-realtime-2026-01-15

International

CNY 0.954101

qwen3-tts-vc-realtime-2025-11-27

International

Qwen3-TTS-Flash-Realtime

Pricing rule: Billed based on the number of input characters. Output is not charged.

Model ID

Deployment scope

Input price (per 10,000 characters)

qwen3-tts-flash-realtime

International

CNY 0.954101

qwen3-tts-flash-realtime-2025-11-27

International

CNY 0.954101

qwen3-tts-flash-realtime-2025-09-18

International

CNY 0.954101

Qwen-TTS voice cloning

You are billed for each new voice clone you create.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Price (per clone)

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen-voice-enrollment

Chinese mainland

CNY 0.01

1,000 voice clones/account

Singapore

Model ID

Deployment scope

Price (per clone)

qwen-voice-enrollment

International

CNY 0.01

Qwen-TTS voice design

Billing rules: You are billed for creating each new voice clone.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Price (per voice clone)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-voice-design

chinese mainland

CNY 0.2

10 voice clones/account

Singapore

Model ID

Deployment scope

Price (per voice clone)

qwen-voice-design

international

CNY 0.2

CosyVoice

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Billing rule: Billing is based on the number of input characters; output is not charged.

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

cosyvoice-v3.5-plus

Chinese mainland

CNY 1.5

10,000 characters

cosyvoice-v3.5-flash

Chinese mainland

CNY 0.8

10,000 characters

cosyvoice-v3-plus

Chinese mainland

CNY 2

10,000 characters

cosyvoice-v3-flash

Chinese mainland

CNY 1

10,000 characters

cosyvoice-v2

Chinese mainland

CNY 2

10,000 characters

cosyvoice-v1

Chinese mainland

CNY 2

10,000 characters

Singapore

Billing rule: Billing is based on the number of input characters; output is not charged.

Model ID

Deployment scope

Input price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

cosyvoice-v3-plus

International

CNY 1.9082

N/A

cosyvoice-v3-flash

International

CNY 0.9541

N/A

Sambert

Billing is based on the number of input characters. Output is free.

China (Beijing)

Model ID

Deployment scope

Input price (per 10,000 characters)

Free quota(Note)

See the Java SDK

Chinese mainland

CNY 1

Each Alibaba Cloud account receives a free monthly quota of 30,000 characters per model.

MiniMax

Billing is based on the number of characters in the input text, and the output is free.

Voice cloning incurs a one-time fee. This fee is billed along with the speech synthesis fee when the cloned voice is first used.

Model name

Deployment scope

Price (per 10,000 characters)

Voice cloning fee

Free quota (Note)

MiniMax/speech-2.8-hd

Chinese mainland

CNY 3.5

CNY 9.9

(Charged when first used for speech synthesis)

None

MiniMax/speech-02-hd

Chinese mainland

CNY 3.5

MiniMax/speech-2.8-turbo

Chinese mainland

CNY 2

MiniMax/speech-02-turbo

Chinese mainland

CNY 2

Speech recognition and translation

Qwen-LiveTranslate-Flash-Realtime

Billing is based on the number of input and output tokens. For details about how tokens are calculated for different modalities, see Billing.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1,000,000 tokens)

Output price (per 1,000,000 tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio.

Input: audio

Input: image

Output: text

Output: audio

qwen3.5-livetranslate-flash-realtime

Chinese mainland

CNY 40

CNY 3.3

CNY 100

CNY 160

1 million tokens

qwen3.5-livetranslate-flash-realtime-2026-05-19

Chinese mainland

CNY 40

CNY 3.3

CNY 100

CNY 160

1 million tokens

qwen3-livetranslate-flash-realtime

Chinese mainland

CNY 64

CNY 8

CNY 64

CNY 240

1 million tokens

qwen3-livetranslate-flash-realtime-2025-09-22

Chinese mainland

CNY 64

CNY 8

CNY 64

CNY 240

1 million tokens

Singapore

Model ID

Deployment scope

Input price (per 1,000,000 tokens)

Output price (per 1,000,000 tokens)

Input: audio

Input: image

Output: text

Output: audio

qwen3.5-livetranslate-flash-realtime

International

CNY 56.207

CNY 4.122

CNY 149.884

CNY 224.826

qwen3.5-livetranslate-flash-realtime-2026-05-19

International

CNY 56.207

CNY 4.122

CNY 149.884

CNY 224.826

qwen3-livetranslate-flash-realtime

International

CNY 73.392

CNY 9.541

CNY 73.392

CNY 278.891

qwen3-livetranslate-flash-realtime-2025-09-22

International

CNY 73.392

CNY 9.541

CNY 73.392

CNY 278.891

Qwen-LiveTranslate-Flash

Billing is based on input and output tokens. For details on the token calculation rules for different modalities, see Billing Details.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price (per 1M tokens)

Output price (per 1M tokens)

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Input: audio

Input: image

Output: text

Output: audio

qwen3-livetranslate-flash

chinese mainland

CNY 10

CNY 4

CNY 10

CNY 40

1 million tokens

qwen3-livetranslate-flash-2025-12-01

chinese mainland

CNY 10

CNY 4

CNY 10

CNY 40

1 million tokens

Singapore

Model ID

Deployment scope

Input price (per 1M tokens)

Output price (per 1M tokens)

Input: audio

Input: image

Output: text

Output: audio

qwen3-livetranslate-flash

international

CNY 11.573

CNY 4.629

CNY 11.573

CNY 46.292

qwen3-livetranslate-flash-2025-12-01

international

CNY 11.573

CNY 4.629

CNY 11.573

CNY 46.292

Qwen-ASR

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Pricing rule: You are billed per second of input audio. Output is free of charge.

Model ID

Deployment scope

Input price

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-asr-flash-filetrans

Chinese mainland

CNY 0.00022 per second

Not charged

36,000 seconds (10 hours)

qwen3-asr-flash-filetrans-2025-11-17

Chinese mainland

36,000 seconds (10 hours)

qwen3-asr-flash

Currently equivalent to qwen3-asr-flash-2025-09-08

Chinese mainland

36,000 seconds (10 hours)

qwen3-asr-flash-2026-02-10

Chinese mainland

36,000 seconds (10 hours)

qwen3-asr-flash-2025-09-08

Chinese mainland

36,000 seconds (10 hours)

US (Virginia)

Pricing rule: You are billed per second of input audio. Output is free of charge.

Model ID

Deployment scope

Input price

Output price

qwen3-asr-flash-us

US

CNY 0.000035 per second

Not charged

qwen3-asr-flash-2025-09-08-us

US

CNY 0.000035 per second

Singapore

Pricing rule: You are billed per second of input audio. Output is free of charge.

Model ID

Deployment scope

Input price

Output price

qwen3-asr-flash-filetrans

international

CNY 0.00026 per second

Not charged

qwen3-asr-flash-filetrans-2025-11-17

international

CNY 0.00026 per second

qwen3-asr-flash

Currently equivalent to qwen3-asr-flash-2025-09-08

international

CNY 0.00026 per second

qwen3-asr-flash-2026-02-10

international

CNY 0.00026 per second

qwen3-asr-flash-2025-09-08

international

CNY 0.00026 per second

Qwen-ASR-Realtime

Charges for input audio are calculated per second. Output is not charged.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-asr-flash-realtime

chinese mainland

CNY 0.00033 per second

36,000 seconds (10 hours)

qwen3-asr-flash-realtime-2026-02-10

chinese mainland

36,000 seconds (10 hours)

qwen3-asr-flash-realtime-2025-10-27

chinese mainland

36,000 seconds (10 hours)

Singapore

Model ID

Deployment scope

Input price

qwen3-asr-flash-realtime

international

CNY 0.00066 per second

qwen3-asr-flash-realtime-2026-02-10

international

qwen3-asr-flash-realtime-2025-10-27

International

Fun-ASR

Audio file recognition

Billing rule: Billed per second of input audio. The output is not billed.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Free quotaGuidelines

Valid for 90 days after you activate Alibaba Cloud Model Studio

fun-asr

Chinese mainland

CNY 0.00022 per second

36,000 seconds (10 hours)

fun-asr-2025-11-07

Chinese mainland

36,000 seconds (10 hours)

fun-asr-2025-08-25

Chinese mainland

36,000 seconds (10 hours)

fun-asr-mtl

Chinese mainland

36,000 seconds (10 hours)

fun-asr-mtl-2025-08-25

Chinese mainland

36,000 seconds (10 hours)

Singapore

Model ID

Deployment scope

Input price

fun-asr

International

CNY 0.00026 per second

fun-asr-2025-11-07

International

fun-asr-2025-08-25

International

fun-asr-mtl

International

fun-asr-mtl-2025-08-25

International

Real-time speech recognition

Billing rule: Billed per second of input audio. The output is not billed.

China (Beijing)

Model ID

Deployment scope

Input price

Free quotaGuidelines

Valid for 90 days after you activate Alibaba Cloud Model Studio

fun-asr-realtime

Chinese mainland

CNY 0.00033 per second

36,000 seconds (10 hours)

fun-asr-realtime-2026-02-28

Chinese mainland

36,000 seconds (10 hours)

fun-asr-realtime-2025-11-07

Chinese mainland

36,000 seconds (10 hours)

fun-asr-realtime-2025-09-15

Chinese mainland

36,000 seconds (10 hours)

fun-asr-mtl-realtime

Chinese mainland

36,000 seconds (10 hours)

fun-asr-mtl-realtime-2025-12-10

Chinese mainland

36,000 seconds (10 hours)

fun-asr-flash-8k-realtime

Chinese mainland

CNY 0.00022 per second

36,000 seconds (10 hours)

fun-asr-flash-8k-realtime-2026-01-28

Chinese mainland

36,000 seconds (10 hours)

Singapore

Model ID

Deployment scope

Input price

fun-asr-realtime

International

CNY 0.00066 per second

fun-asr-realtime-2025-11-07

International

Paraformer

Audio file recognition

Charges apply per second of input audio. Output is not billed.

China (Beijing)

Model ID

Deployment scope

Input price

Free quota(Note)

paraformer-v2

Chinese mainland

CNY 0.00008 per second

36,000 seconds (10 hours)

Issued at 00:00 (UTC+8) on the first day of each month.

Expires after one month.

paraformer-8k-v2

Chinese mainland

paraformer-v1

Chinese mainland

paraformer-8k-v1

Chinese mainland

paraformer-mtl-v1

Chinese mainland

Real-time speech recognition

Charges apply per second of input audio. Output is not billed.

China (Beijing)

Model ID

Deployment scope

Input price

Free quota(Note)

paraformer-realtime-v2

Chinese mainland

CNY 0.00024 per second

36,000 seconds (10 hours)

Issued at 00:00 (UTC+8) on the first day of each month.

Expires after one month.

paraformer-realtime-v1

Chinese mainland

paraformer-realtime-8k-v2

Chinese mainland

paraformer-realtime-8k-v1

Chinese mainland

Video generation

You are not charged for input. You are charged for output based on the total duration of successfully generated videos (in seconds).

Formula: Cost = Video unit price × Video duration (seconds).

Notes:

  • Some models charge by output video resolution. Prices differ for resolutions such as 480P, 720P, and 1080P.

  • Some models charge by output video edition. Prices differ for editions such as Standard Edition and Professional Edition.

  • Some models charge by output video aspect ratio. Prices differ for aspect ratios such as 1:1 and 3:4.

  • Some models use a flat rate, regardless of resolution, edition, or aspect ratio.

  • Failed requests incur no cost and do not consume your free quota.

HappyHorse-Text-to-video

Charges are based on output only. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output resolution

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

happyhorse-1.0-t2v

Chinese mainland

720P

CNY 0.9 per second

10 seconds

1080P

CNY 1.6 per second

US (Virginia)

Model ID

Deployment scope

Output resolution

Output price

happyhorse-1.0-t2v

Global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

Singapore

Model ID

Deployment scope

Output resolution

Output price

happyhorse-1.0-t2v

International

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Germany (Frankfurt)

Model ID

Deployment scope

Output resolution

Output price

happyhorse-1.0-t2v

Global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

HappyHorse: Image-to-video (first frame)

Billing applies to output only. For details, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output video resolution

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

happyhorse-1.0-i2v

Chinese mainland

720P

CNY 0.9 per second

10 seconds

1080P

CNY 1.6 per second

US (Virginia)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.0-i2v

Global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

Singapore

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.0-i2v

International

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Germany (Frankfurt)

Model ID

Deployment scope

Output video resolution

Output price

happyhorse-1.0-i2v

Global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

HappyHorse: Reference-to-video

You are billed for output only. See video generation for billing rules.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Output video resolution

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

happyhorse-1.0-r2v

Chinese mainland

720P

CNY 0.9 per second

10 seconds

1080P

CNY 1.6 per second

US (Virginia)

Model id

Deployment scope

Output video resolution

Output price

happyhorse-1.0-r2v

global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

Singapore

Model id

Deployment scope

Output video resolution

Output price

happyhorse-1.0-r2v

international

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Germany (Frankfurt)

Model id

Deployment scope

Output video resolution

Output price

happyhorse-1.0-r2v

global

720P

CNY 0.9 per second

1080P

CNY 1.6 per second

HappyHorse-Video Editing

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.

Model id

Deployment scope

Output resolution

Price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

happyhorse-1.0-video-edit

Chinese mainland

720P

CNY 0.9 per second

10 seconds

1080P

CNY 1.6 per second

US (Virginia)

Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.

Model id

Deployment scope

Output resolution

Price

happyhorse-1.0-video-edit

global

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Singapore

Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.

Model id

Deployment scope

Output resolution

Price

happyhorse-1.0-video-edit

global

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Germany (Frankfurt)

Billing rules: Billing is based on the video duration (seconds) for both input and output videos. Failed requests do not incur charges or consume your free quota.

Model id

Deployment scope

Output resolution

Price

happyhorse-1.0-video-edit

global

720P

CNY 1.049188 per second

1080P

CNY 1.798608 per second

Wanx-text-to-video

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Output resolution

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wan2.7-t2v-2026-04-25

Chinese mainland

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

wan2.7-t2v

Chinese mainland

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

wan2.6-t2v

Chinese mainland

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

wan2.5-t2v-preview

Chinese mainland

480P

CNY 0.3 per second

50 seconds

720P

CNY 0.6 per second

1080P

CNY 1 per second

wan2.2-t2v-plus

Chinese mainland

480P

CNY 0.14 per second

50 seconds

1080P

CNY 0.70 per second

wanx2.1-t2v-turbo

Chinese mainland

480P

CNY 0.24 per second

200 seconds

720P

CNY 0.24 per second

wanx2.1-t2v-plus

Chinese mainland

720P

CNY 0.70 per second

200 seconds

US (Virginia)

Model id

Deployment scope

Output resolution

Output price

wan2.6-t2v

Global

720P

CNY 0.6 per second

1080P

CNY 1 per second

wan2.6-t2v-us

US

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

Singapore

Model id

Deployment scope

Output resolution

Output price

wan2.7-t2v-2026-04-25

International

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

wan2.7-t2v

International

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

wan2.6-t2v

International

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

wan2.5-t2v-preview

International

480P

CNY 0.366961 per second

720P

CNY 0.733923 per second

1080P

CNY 1.100885 per second

wan2.2-t2v-plus

International

480P

CNY 0.146785 per second

1080P

CNY 0.733924 per second

wan2.1-t2v-turbo

International

480P

CNY 0.264213 per second

720P

CNY 0.264213 per second

wan2.1-t2v-plus

International

720P

CNY 0.733924 per second

Germany (Frankfurt)

Model id

Deployment scope

Output resolution

Output price

wan2.6-t2v

Global

720P

CNY 0.6 per second

1080P

CNY 1 per second

Wan Image-to-Video

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Price

Free quotaGuidelines

Valid for 90 days after activating Model Studio

wan2.7-i2v-2026-04-25

Chinese mainland

Video with audio

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

wan2.7-i2v

Chinese mainland

Video with audio

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

Singapore

Model ID

Deployment scope

Output video type

Output video resolution

Price

wan2.7-i2v-2026-04-25

International

Video with audio

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

wan2.7-i2v

International

Video with audio

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

Wan: Image-to-video (first frame)

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model id

Deployment scope

Output video type

Output video resolution

Unit price

Free quota (Note)

Valid for 90 days after activating Model Studio

wan2.6-i2v-flash

Chinese mainland

video with audio

audio=true

720P

CNY 0.3/second

50 seconds

1080P

CNY 0.5/second

silent video

audio=false

720P

CNY 0.15/second

1080P

CNY 0.25/second

wan2.6-i2v

Chinese mainland

video with audio

720P

CNY 0.6/second

50 seconds

1080P

CNY 1/second

wan2.5-i2v-preview

Chinese mainland

video with audio

480P

CNY 0.3/second

50 seconds

720P

CNY 0.6/second

1080P

CNY 1/second

wan2.2-i2v-flash

Chinese mainland

silent video

480P

CNY 0.10/second

50 seconds

720P

CNY 0.20/second

1080P

CNY 0.48/second

wan2.2-i2v-plus

Chinese mainland

silent video

480P

CNY 0.14/second

50 seconds

1080P

CNY 0.70/second

wanx2.1-i2v-turbo

Chinese mainland

silent video

480P

CNY 0.24/second

200 seconds

720P

CNY 0.24/second

wanx2.1-i2v-plus

Chinese mainland

silent video

720P

CNY 0.70/second

200 seconds

US (Virginia)

Model id

Deployment scope

Output video type

Output video resolution

Unit price

wan2.6-i2v

Global

video with audio

720P

CNY 0.6/second

1080P

CNY 1/second

wan2.6-i2v-us

US

video with audio

720P

CNY 0.733924/second

1080P

CNY 1.100886/second

Singapore

Model id

Deployment scope

Output video type

Output video resolution

Unit price

wan2.6-i2v-flash

International

video with audio

audio=true

720P

CNY 0.366962/second

1080P

CNY 0.550443/second

silent video

audio=false

720P

CNY 0.183481/second

1080P

CNY 0.275221/second

wan2.6-i2v

International

video with audio

720P

CNY 0.733924/second

1080P

CNY 1.100886/second

wan2.5-i2v-preview

International

video with audio

480P

CNY 0.366961/second

720P

CNY 0.733923/second

1080P

CNY 1.100885/second

wan2.2-i2v-flash

International

silent video

480P

CNY 0.110089/second

720P

CNY 0.264213/second

wan2.2-i2v-plus

International

silent video

480P

CNY 0.146785/second

1080P

CNY 0.733924/second

wan2.1-i2v-turbo

International

silent video

480P

CNY 0.264213/second

720P

CNY 0.264213/second

wan2.1-i2v-plus

International

silent video

720P

CNY 0.733924/second

Germany (Frankfurt)

Model id

Deployment scope

Output video type

Output video resolution

Unit price

wan2.6-i2v

Global

video with audio

720P

CNY 0.6/second

1080P

CNY 1/second

Wanx - image-to-video (first and last frames)

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output video resolution

Price

Free quotaGuidelines

Valid for 90 days after you activate Alibaba Cloud Model Studio

wanx2.2-kf2v-flash

chinese mainland

480P

CNY 0.10/second

50 seconds

720P

CNY 0.20/second

1080P

CNY 0.48/second

wanx2.1-kf2v-plus

chinese mainland

720P

CNY 0.70/second

200 seconds

Singapore

Model ID

Deployment scope

Output video resolution

Price

wanx2.1-kf2v-plus

international

720P

CNY 0.733924/second

Wan - Reference-to-video

Billing rules: Both input and output videos are billed based on video duration (seconds). Failed requests are not billed and do not consume the free quota.

Billing formula: Billed duration = Input video duration (up to 5 seconds) + Output video duration.

  • The billable duration of the input video does not exceed 5 seconds. For calculation rules, see Billing and rate limiting.

  • The billable duration of the output video is the length of the successfully generated video in seconds.

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Rate

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

wan2.7-r2v

Chinese mainland

audio video

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

wan2.6-r2v-flash

Chinese mainland

audio video

audio=true

720P

CNY 0.3 per second

50 seconds

1080P

CNY 0.5 per second

silent video

audio=false

720P

CNY 0.15 per second

1080P

CNY 0.25 per second

wan2.6-r2v

Chinese mainland

audio video

720P

CNY 0.6 per second

50 seconds

1080P

CNY 1 per second

US (Virginia)

Model ID

Deployment scope

Output video type

Output video resolution

Rate

wan2.6-r2v

global

audio video

720P

CNY 0.6 per second

1080P

CNY 1 per second

Singapore

Model ID

Deployment scope

Output video type

Output video resolution

Rate

wan2.7-r2v

international

audio video

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

wan2.6-r2v-flash

international

audio video

audio=true

720P

CNY 0.366962 per second

1080P

CNY 0.550443 per second

silent video

audio=false

720P

CNY 0.183481 per second

1080P

CNY 0.275221 per second

wan2.6-r2v

international

audio video

720P

CNY 0.733924 per second

1080P

CNY 1.100886 per second

Germany (Frankfurt)

Model ID

Deployment scope

Output video type

Output video resolution

Rate

wan2.6-r2v

global

audio video

720P

CNY 0.6 per second

1080P

CNY 1 per second

Wan video editing

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Pricing rule: Both input and output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wan2.7-videoedit

Chinese mainland

720P

CNY 0.6/second

50 seconds

1080P

CNY 1/second

Pricing rule: Input is not billed. Output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Output price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wanx2.1-vace-plus

Chinese mainland

720P

CNY 0.70/second

50 seconds

Singapore

Pricing rule: Both input and output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Input and output price

wan2.7-videoedit

international

720P

CNY 0.733924/second

1080P

CNY 1.100886/second

Pricing rule: Input is not billed. Output videos are billed by video duration (seconds). Failed requests are not billed and do not consume the free quota.

Model ID

Deployment scope

Output video resolution

Output price

wanx2.1-vace-plus

international

720P

CNY 0.733924/second

Wan - digital human

  • wan2.2-s2v-detect: Input is billed per image for each successful request, regardless of the detection result. Output is free.

  • wan2.2-s2v: Input is free, while output is billed based on the duration (in seconds) of successfully generated videos. For billing details, see Video generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

wan2.2-s2v-detect

Chinese mainland

Input image: CNY 0.004/image

200 images

wan2.2-s2v

Chinese mainland

Output video:

  • 480p: CNY 0.5/second

  • 720p: CNY 0.9/second

100 seconds

Wanx-Image-to-Motion

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output video mode

Output price

Free Tier(note)

Valid for 90 days after activation

wan2.2-animate-move

Chinese mainland

standard modewan-std

CNY 0.4 per second

50 seconds

Valid for 90 days after activation

professional modewan-pro

CNY 0.6/second

Singapore

Model ID

Deployment scope

Output video mode

Output price

wan2.2-animate-move

International

standard modewan-std

CNY 0.880709 per second

professional modewan-pro

CNY 1.321063 per second

Wan - Video face swap

Only output is billed. For billing rules, see video generation.
Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Output mode

Output price

Free quota(Note)

Valid for 90 days after activating Model Studio

wan2.2-animate-mix

Chinese mainland

standard modewan-std

CNY 0.6 per second

50 seconds

professional modewan-pro

CNY 0.9 per second

Singapore

Model ID

Deployment scope

Output mode

Output price

wan2.2-animate-mix

International

standard modewan-std

CNY 1.321063 per second

professional modewan-pro

CNY 1.908202 per second

AnimateAnyone

  • animate-anyone-detect-gen2: Input is billed, but output is free. You are charged for each input image in a successful request, regardless of the detection result.

  • animate-anyone-template-gen2: Input is free, while output is billed by the second for each successfully generated video. For billing rules, see video generation.

  • animate-anyone-gen2: Input is free, while output is billed by the second for each successfully generated video. For billing rules, see video generation.

China (Beijing)

Model ID

Service region

Unit price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

animate-anyone-detect-gen2

Chinese mainland

input image: CNY 0.004 per image

200 images

animate-anyone-template-gen2

Chinese mainland

output video: CNY 0.08 per second

1,800 seconds (30 minutes)

animate-anyone-gen2

Chinese mainland

output video: CNY 0.08 per second

1,800 seconds (30 minutes)

EMO

  • emo-detect-v1: The input is billed, and the output is free. Billing is based on the number of images processed. You are charged once for each input image in a successful request, regardless of the detection result.

  • emo-v1: The input is free, and the output is billed. Output is billed per second of successfully generated video. For pricing rules, see video generation.

China (Beijing)

Model id

Region

Unit price

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

emo-detect-v1

Chinese mainland

Input image: CNY 0.004/image

200 images

emo-v1

Chinese mainland

Output video:

  • 1:1 video: CNY 0.08/second

  • 3:4 video: CNY 0.16/second

1,800 seconds (30 minutes)

LivePortrait

  • liveportrait-detect: You are billed for input images, while output is free. Each image is billed once for every successful request, regardless of the detection result.

  • liveportrait: Input is free, but output is billed. You are billed based on the duration (in seconds) of successfully generated videos. For billing rules, see video generation.

China (Beijing)

Model ID

Region

Price

Quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

liveportrait-detect

Chinese mainland

CNY 0.004 per input image

200 images

liveportrait

Chinese mainland

CNY 0.02 per second of output video

1,800 seconds (30 minutes)

Emoji sticker

  • emoji-detect-v1: You are billed for each input image processed in a successful request, regardless of the detection result. The output is not billed.

  • emoji-v1: Input is free. You are billed for the output based on the duration (in seconds) of each successfully generated video. For billing rules, see video generation.

China (Beijing)

Model ID

Deployment scope

Unit price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

emoji-detect-v1

Chinese mainland

Input image: CNY 0.004 per image

200 images

emoji-v1

Chinese mainland

Output video: CNY 0.08 per second

1,800 seconds (30 minutes)

VideoRetalk

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model ID

Deployment scope

Output price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

videoretalk

Chinese mainland

CNY 0.08 per second

1,800 seconds (30 minutes)

Video style transfer

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model id

Deployment scope

Output video resolution

Unit price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

video-style-transform

chinese mainland

540P

CNY 0.2 per second

600 seconds

720P

CNY 0.5 per second

Video generation - third-party model

Pixverse text-to-video

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model id

Deployment scope

Video type

Video resolution

Price

Free quotaGuidelines

pixverse/pixverse-c1-t2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.24 per second

No free quota

540P

CNY 0.30 per second

720P

CNY 0.39 per second

1080P

CNY 0.71 per second

Silent video

audio=false

360P

CNY 0.18 per second

540P

CNY 0.24 per second

720P

CNY 0.30 per second

1080P

CNY 0.56 per second

pixverse/pixverse-v6-t2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.21 per second

No free quota

540P

CNY 0.27 per second

720P

CNY 0.36 per second

1080P

CNY 0.68 per second

Silent video

audio=false

360P

CNY 0.15 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.53 per second

pixverse/pixverse-v5.6-t2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.47 per second

No free quota

540P

CNY 0.47 per second

720P

CNY 0.53 per second

1080P

CNY 0.70 per second

Silent video

audio=false

360P

CNY 0.21 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.44 per second

pixverse/pixverse-v5.6-it2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.47 per second

No free quota

540P

CNY 0.47 per second

720P

CNY 0.53 per second

1080P

CNY 0.70 per second

Silent video

audio=false

360P

CNY 0.21 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.44 per second

Pika: Image-to-video (initial frame)

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model ID

Region

Video type

Resolution

Price

Free quota(Note)

pixverse/pixverse-c1-it2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.24 per second

No free quota

540P

CNY 0.30 per second

720P

CNY 0.39 per second

1080P

CNY 0.71 per second

silent video

audio=false

360P

CNY 0.18 per second

540P

CNY 0.24 per second

720P

CNY 0.30 per second

1080P

CNY 0.56 per second

pixverse/pixverse-v6-it2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.21 per second

No free quota

540P

CNY 0.27 per second

720P

CNY 0.36 per second

1080P

CNY 0.68 per second

silent video

audio=false

360P

CNY 0.15 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.53 per second

pixverse/pixverse-v5.6-it2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.47 per second

No free quota

540P

CNY 0.47 per second

720P

CNY 0.53 per second

1080P

CNY 0.70 per second

silent video

audio=false

360P

CNY 0.21 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.44 per second

Pika - Image-to-video (first and last frames)

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model ID

Deployment scope

Output video type

Output video resolution

Output price

Free quota(Note)

pixverse/pixverse-c1-kf2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.24 per second

No free quota

540P

CNY 0.30 per second

720P

CNY 0.39 per second

1080P

CNY 0.71 per second

silent video

audio=false

360P

CNY 0.18 per second

540P

CNY 0.24 per second

720P

CNY 0.30 per second

1080P

CNY 0.56 per second

pixverse/pixverse-v6-kf2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.21 per second

No free quota

540P

CNY 0.27 per second

720P

CNY 0.36 per second

1080P

CNY 0.68 per second

silent video

audio=false

360P

CNY 0.15 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.53 per second

pixverse/pixverse-v5.6-kf2v

Chinese mainland

video with audio

audio=true

360P

CNY 0.47 per second

No free quota

540P

CNY 0.47 per second

720P

CNY 0.53 per second

1080P

CNY 0.70 per second

silent video

audio=false

360P

CNY 0.21 per second

540P

CNY 0.21/s

720P

CNY 0.27/second

1080P

CNY 0.44 per second

Pika-Reference-to-Video

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model id

Deployment scope

Output type

Resolution

Price

Free quota(Note)

pixverse/pixverse-c1-r2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.24 per second

No free quota

540P

CNY 0.3 per second

720P

CNY 0.39 per second

1080P

CNY 0.71 per second

silent video

audio=false

360P

CNY 0.18 per second

540P

CNY 0.24 per second

720P

CNY 0.3 per second

1080P

CNY 0.56 per second

pixverse/pixverse-v5.6-r2v

Chinese mainland

Video with audio

audio=true

360P

CNY 0.47 per second

No free quota

540P

CNY 0.47 per second

720P

CNY 0.53 per second

1080P

CNY 0.7 per second

silent video

audio=false

360P

CNY 0.21 per second

540P

CNY 0.21 per second

720P

CNY 0.27 per second

1080P

CNY 0.44 per second

Kling-Video-Generation

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model id

Deployment scope

Video type

Video resolution

Price

Free quota (Note)

kling/kling-v3-video-generation

Chinese mainland

Silent video

720P

CNY 0.6 per second

No free quota

1080P

CNY 0.8 per second

Video with audio

720P

CNY 0.9 per second

1080P

CNY 1.2 per second

kling/kling-v3-omni-video-generation

Chinese mainland

Silent video (without reference video)

720P

CNY 0.6 per second

No free quota

1080P

CNY 0.8 per second

Silent video (with reference video)

720P

CNY 0.9 per second

1080P

CNY 1.2 per second

Video with audio (without reference video)

720P

CNY 0.9 per second

1080P

CNY 1.2 per second

Vidu-Text-to-video

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model id

Deployment scope

Output video resolution

Output price

Free quota (Note)

vidu/viduq3-pro_text2video

Chinese mainland

540P

CNY 0.3125 per second

No free quota

720P

CNY 0.78125 per second

1080P

CNY 0.9375 per second

vidu/viduq3-turbo_text2video

Chinese mainland

540P

CNY 0.25 per second

No free quota

720P

CNY 0.375 per second

1080P

CNY 0.4375 per second

vidu/viduq2_text2video

Chinese mainland

540P

CNY 0.1125 per second

No free quota

720P

CNY 0.21875 per second

1080P

CNY 0.375 per second

Vidu: Image-to-video (first frame)

Only output is billed. For billing rules, see video generation.

China (Beijing)

Model ID

Deployment scope

Output video resolution

Price

Free quota (Note)

vidu/viduq3-pro_img2video

Chinese mainland

540P

CNY 0.3125 per second

No free quota

720P

CNY 0.78125 per second

1080P

CNY 0.9375 per second

vidu/viduq3-turbo_img2video

Chinese mainland

540P

CNY 0.25 per second

No free quota

720P

CNY 0.375 per second

1080P

CNY 0.4375 per second

vidu/viduq2-pro_img2video

Chinese mainland

540P

CNY 0.15625 per second

No free quota

720P

CNY 0.34375 per second

1080P

CNY 0.71875 per second

vidu/viduq2-turbo_img2video

Chinese mainland

540P

CNY 0.0875 per second

No free quota

720P

CNY 0.25 per second

1080P

CNY 0.46875 per second

vidu/viduq2-pro-fast_img2video

Chinese mainland

720P

CNY 0.1 per second

No free quota

1080P

CNY 0.2 per second

Vidu image-to-video - first and last frames

Only output is billed. For billing rules, see video generation.

North China 2 (Beijing)

Model ID

Region

Output video resolution

Unit price

Free quota(note)

vidu/viduq3-pro_start-end2video

Chinese mainland

540P

CNY 0.3125 per second

No free quota

720P

CNY 0.78125 per second

1080P

CNY 0.9375 per second

vidu/viduq3-turbo_start-end2video

Chinese mainland

540P

CNY 0.25 per second

No free quota

720P

CNY 0.375 per second

1080P

CNY 0.4375 per second

vidu/viduq2-pro_start-end2video

Chinese mainland

540P

CNY 0.15625 per second

No free quota

720P

CNY 0.34375 per second

1080P

CNY 0.71875 per second

vidu/viduq2-turbo_start-end2video

Chinese mainland

540P

CNY 0.0875 per second

No free quota

720P

CNY 0.25 per second

1080P

CNY 0.46875 per second

Vidu - Video generation from reference

Only output is billed. For billing rules, see video generation.

North China 2 (Beijing)

Model ID

Service region

Output resolution

Unit price

Free quota(Note)

vidu/viduq3-mix_reference2video

Chinese mainland

720P

CNY 0.78125 per second

No free quota

1080P

CNY 0.9375 per second

vidu/viduq3_reference2video

Chinese mainland

540P

CNY 0.3125 per second

No free quota

720P

CNY 0.625 per second

1080P

CNY 0.78125 per second

vidu/viduq3-turbo_reference2video

Chinese mainland

540P

CNY 0.15625 per second

No free quota

720P

CNY 0.3125 per second

1080P

CNY 0.40625 per second

vidu/viduq2-pro_reference2video

Chinese mainland

540P

CNY 0.25 per second

No free quota

720P

CNY 0.3125 per second

1080P

CNY 0.78125 per second

vidu/viduq2_reference2video

Chinese mainland

540P

CNY 0.21875 per second

No free quota

720P

CNY 0.28125 per second

1080P

CNY 0.71875 per second

3D model generation - Third-party models

Tripo-3D model generation

Billing is per output request; input is not charged.

China (Beijing)

Model ID

Deployment scope

Task type

Specification

Price

Tripo/Tripo-H3.1

Chinese mainland

text-to-3D

Standard + no texture

CNY 0.7 per request

Standard + SD texture

CNY 1.4 per request

Standard + HD texture

CNY 2.1 per request

HD + no texture

CNY 2.1 per request

HD + SD texture

CNY 2.8 per request

HD + HD texture

CNY 3.5 per request

single-image-to-3D / multi-image-to-3D

Standard + no texture

CNY 1.4 per request

Standard + SD texture

CNY 2.1 per request

Standard + HD texture

CNY 2.8 per request

HD + no texture

CNY 2.8 per request

HD + SD texture

CNY 3.5 per request

HD + HD texture

CNY 4.2 per request

Tripo/Tripo-P1.0

Chinese mainland

text-to-3D

no texture

CNY 2.1 per request

SD texture

CNY 2.8 per request

HD texture

CNY 3.5 per request

single-image-to-3D / multi-image-to-3D

no texture

CNY 2.8 per request

SD texture

CNY 3.5 per request

HD texture

CNY 4.2 per request

Text embedding

You are charged only for input tokens. Output tokens are not charged.

If the model supports batch calls, the unit price for both input and output tokens is 50% of the real-time inference price.

China (Beijing)

Model id

Region

Input price

Free quota(Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

text-embedding-v4

50% discount for batch inference

Chinese mainland

CNY 0.5

1 million tokens

text-embedding-v3

50% discount for batch inference

Chinese mainland

CNY 0.5

500,000 tokens

text-embedding-v2

50% discount for batch inference

Chinese mainland

CNY 0.7

500,000 tokens

text-embedding-v1

50% discount for batch inference

Chinese mainland

CNY 0.7

500,000 tokens

text-embedding-async-v2

Chinese mainland

CNY 0.7

20 million tokens

text-embedding-async-v1

Chinese mainland

CNY 0.7

20 million tokens

Singapore

Model id

Region

Input price

text-embedding-v4

international

CNY 0.514

text-embedding-v3

international

CNY 0.514

Multimodal embedding

You are charged for input tokens. Output is free.

China (Beijing)

Model ID

Service region

Input price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

Text

Image/video

qwen3-vl-embedding

Chinese mainland

CNY 0.7

CNY 1.8

1 million tokens

qwen2.5-vl-embedding

Chinese mainland

1 million tokens

tongyi-embedding-vision-plus

Chinese mainland

CNY 0.5

CNY 0.5

1 million tokens

tongyi-embedding-vision-flash

Chinese mainland

CNY 0.15

CNY 0.15

1 million tokens

multimodal-embedding-v1

Chinese mainland

CNY 0.7

CNY 0.9

1 million tokens

Text reranking

Text reranking models

You are billed for input tokens. There is no charge for output.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Price (per 1M tokens)

Free quota (Note)

Valid for 90 days after activating Alibaba Cloud Model Studio

qwen3-vl-rerank

Chinese mainland

Text input: CNY 0.7

Image input: CNY 1.8

1 million tokens

qwen3-rerank

Chinese mainland

Text input: CNY 0.5

1 million tokens

gte-rerank-v2

Chinese mainland

Text input: CNY 0.8

1 million tokens

Singapore

Model ID

Deployment scope

Price (per 1M tokens)

qwen3-rerank

International

CNY 0.74942

Industry models

Tongyi Farui

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

farui-plus

Chinese mainland

CNY 20

CNY 20

No free quota

Intent understanding

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

tongyi-intent-detect-v3

Chinese mainland

CNY 0.4

CNY 1

1 million tokens

Role play

You are charged for input tokens and output tokens.

Note

The following models offer a free quota only in the Chinese mainland service deployment scope. No free quota is available in other service deployment scopes.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

qwen-plus-character

Discount available with Session Cache

Chinese mainland

CNY 0.8

CNY 2

1 million tokens

qwen-flash-character

Discount available with Session Cache

Chinese mainland

CNY 0.25

CNY 1.5

1 million tokens

qwen-flash-character-2026-02-26

Discount available with Session Cache

Chinese mainland

CNY 0.18

CNY 1.5

1 million tokens

Asia Pacific SE 1 (Singapore)

Model ID

Deployment scope

Input price

Output price

qwen-plus-character

Discount available with Session Cache

International

CNY 3.747

CNY 10.492

qwen-flash-character

Discount available with Session Cache

International

CNY 0.375

CNY 2.998

qwen-plus-character-ja

International

CNY 3.67

CNY 10.275

UI interaction

You are charged for input tokens and output tokens.

China (Beijing)

Model ID

Deployment scope

Input price

Output price

Free quota (Note)

Valid for 90 days after you activate Alibaba Cloud Model Studio

gui-plus

Chinese mainland

CNY 1.5

CNY 4.5

1 million tokens

gui-plus-2026-02-26

Chinese mainland

Error codes

If a model call fails and returns an error message, see Error codes.