Product Announcement: Function Compute introduces a billable item for shallow hibernation (formerly idle) GPU usage

更新时间:
复制 MD 格式

Starting from 00:00 on June 14, 2024, Function Compute introduced idle billing for GPU usage. When a provisioned GPU-accelerated instance is waiting between requests, its GPU resources are frozen and billed at USD 0.000007 per compute unit (CU) — significantly less than the active rate of USD 0.000018/CU. This reduces costs for AI inference workloads with bursty or periodic traffic patterns, where instances spend meaningful time idle between invocations.

How it works

Enable idle mode when you configure a provisioned instance policy for your function. Once enabled:

  • GPU resources are frozen when no requests are being processed.

  • The instance enters the idle state and is billed at the idle rate.

For setup instructions, see Configure provisioned instances. For pricing details, see CNY 0.00004/CUBilling overview.

Billing example

The following example shows the cost difference for a Stable Diffusion application running on a 16 GB Tesla GPU card, with provisioned instances reserved for 1 hour. The function receives 1,800 invocations during that hour, each taking 1 second to complete, leaving 1,800 seconds of idle time.

Summary

Idle mode offIdle mode on
Active GPU feeUSD 1.0368USD 0.5184
Idle GPU feeUSD 0USD 0.2016
TotalUSD 1.0368USD 0.72
Savings--~30.6%

Idle mode off

Without idle mode, GPU-accelerated instances stay fully active for the entire provisioned duration, regardless of whether requests are being processed.

ItemCalculationResult
Active GPU usage16 GB x 3,600 s57,600 CUs (Tier 0)
FeeUSD 0.000018/CU x 57,600 CUsUSD 1.0368

Fee = Tier 0 unit price × Usage = CNY 0.00011/CU × 57,600 CU = CNY 6.336

For Tier 0 pricing details, see Billing overview.

Idle mode on

With idle mode, billing splits into active usage (while processing requests) and idle usage (while waiting).

  • Active duration: 1,800 s (1,800 invocations x 1 s/invocation)

  • Idle duration: 1,800 s (3,600 s provisioned duration - 1,800 s active duration)

ItemCalculationResult
Active GPU usage16 GB x 1,800 s28,800 CUs (Tier 0)
Idle GPU usage16 GB x 1,800 s28,800 CUs
Active GPU feeUSD 0.000018/CU x 28,800 CUsUSD 0.5184
Idle GPU feeUSD 0.000007/CU x 28,800 CUsUSD 0.2016
Total feeUSD 0.72

Fee = (Tier 0 unit price × Active GPU usage) + (Shallow hibernation (formerly idle) GPU unit price × Shallow hibernation (formerly idle) GPU usage) = (CNY 0.00011/CU × 28,800 CU) + (CNY 0.00004/CU × 28,800 CU) = CNY 4.32

For Tier 0 pricing details, see Billing overview.

GPU compute unit mapping

Function Compute provides GPU-accelerated instances in two GPU series. For both series, 1 CU equals 1 GB-second.

GPU card typeCUGB-second
Tesla series11
Ampere series11