Retired GPU-accelerated instance types

更新时间:
复制 MD 格式

This topic describes the details of all retired GPU-accelerated instance types. You can use recommended instance types based on your business requirements on use scenarios and computing capabilities.

Note

GPU compute-optimized bare metal instance family ebmgn7vx

The ebmgn7vx instance family includes the following features:

  • The ebmgn7vx instance family builds on the fourth-generation SHENLONG architecture and uses Alibaba Cloud's latest CIPU architecture. Multiple bare metal instances interconnect through an eRDMA network for RDMA communication at 160 Gbit/s. After enabling eRDMA, you can elastically scale your cluster to meet the demands of large-scale AI training.

  • Compute

    • Processor: 3rd Generation Intel ® Xeon ® Scalable processors (Ice Lake) with a base frequency of 2.9 GHz, an all-core turbo of 3.5 GHz, and PCIe 4.0 support.

  • Storage

    • I/O optimized instance

    • Supports only ESSDs and ESSD AutoPL disks.

  • Network

    • Supports IPv4 and IPv6.

    • Supports physical network cards.

    • Provides high-throughput networking with a packet forwarding rate of 24,000,000 PPS.

    • Supports Elastic RDMA Interface (ERI), which enables RDMA passthrough and accelerated interconnection over a VPC network to increase bandwidth to 160 Gbit/s.

      Note

      For more information about how to use ERI, see Enable eRDMA for enterprise-level instances or Enable eRDMA for GPU-accelerated instances.

  • Use cases

    • Various deep learning training and development workloads.

    • HPC-accelerated computing and simulation.

      Note

      When you run AI training workloads with high communication demands, such as Transformer models, you must enable NVLink for data communication between GPUs. Otherwise, large-scale data transmission over PCIe links may cause unexpected failures, resulting in data corruption. If you are unsure about the communication topology for your training jobs, submit a ticket for technical support from Alibaba Cloud experts.

The following table lists the instance types and specifications of the ebmgn7vx family.

Instance type

vCPU

Memory (GiB)

GPU memory

Network bandwidth (Gbit/s)

Packet forwarding rate (PPS)

IPv4 addresses per NIC

IPv6 addresses per NIC

Physical NICs

Multi-queue (Primary NIC/Secondary ENI)

ENIs

ecs.ebmgn7vx.32xlarge

128

1024

80 GB × 8

160 (80 × 2)

24,000,000

30

30

2

32/32

16

Note
  • You can go to the Instance Types Available for Each Region page to view the instance types available in each region.

  • The ebmgn7vx instance family requires images to use the UEFI boot mode. If you use a custom image, ensure that it supports this mode and its boot mode property is set to UEFI. For more information, see Set the boot mode of a custom image to UEFI by using an API call.

  • For more information about these specifications, see the "Instance type specifications" section in Overview of instance families. Packet forwarding rates vary significantly based on business scenarios. We recommend that you perform business stress tests on instances to choose appropriate instance types.

  • Currently, you cannot retrieve basic CPU monitoring data for ECS Bare Metal Instances. You can obtain this data by installing the CloudMonitor agent. For more information, see Install the CloudMonitor agent.

ebmgn6ia: GPU compute-optimized bare metal instance family

  • Instance family overview:

    • Powered by the third-generation X-Dragon architecture and chip-level fast-path acceleration, this instance family delivers stable, predictable, ultra-high performance in compute, storage, and networking.

    • These instances use NVIDIA T4 GPUs to accelerate graphics and AI workloads. Combined with container technology, they can host over 60 virtual Android terminals, providing hardware-accelerated video transcoding for each display.

  • Use cases:

    • Providing remote app services based on Android, such as always-on cloud applications, cloud gaming, cloud phones, and Android-based web crawlers.

  • Compute:

    • vCPU-to-memory ratio of approximately 1:3.

    • Processor: 2.8 GHz Ampere® Altra® processor with a turbo frequency of 3.0 GHz. The native Arm compute platform provides efficient performance and excellent app compatibility for Android servers.

  • Storage:

  • Network:

    • These instances support IPv4 and IPv6. For information about IPv6 communication, see IPv6 communication.

The following table describes the instance types and key metrics of the ebmgn6ia instance family.

Instance type

vCPU

Memory (GiB)

GPU

GPU memory

Baseline bandwidth (Gbit/s)

Packet rate (pps)

Multi-queue

Elastic network interfaces

Private IPv4s per ENI

IPv6s per ENI

ecs.ebmgn6ia.20xlarge

80

256

NVIDIA T4 × 2

16 GB × 2

32

24,000,000

32

15

10

1

Note

Ampere® Altra® processors have specific requirements for operating system kernel versions. When you create an ECS instance of this instance type, you can directly use Alibaba Cloud Linux 3 or CentOS 8.4 or later images. We recommend that you use Alibaba Cloud Linux 3 images. If you need to use a different operating system, see the Ampere Altra (TM) Linux Kernel Porting Guide. You must patch the kernel on an ECS instance running the desired operating system, create a custom image from the instance, and then use that image to create a new instance of this type.

vGPU-accelerated instance family vgn6i

The vgn6i instance family provides the following features:

  • Compute:

    • Uses NVIDIA T4 GPU accelerators.

    • Instances include vGPUs created by partitioning a physical GPU.

      • vGPUs provide 1/4 or 1/2 of the computing power of a physical NVIDIA Tesla T4 GPU.

      • This provides 4 GB and 8 GB of GPU memory, respectively.

    • A vCPU-to-memory ratio of approximately 1:5.

    • Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake).

  • Storage:

    • I/O optimized instance.

    • Supports only SSD cloud disks and ultra disks.

  • Network:

    • Supports IPv6.

    • Network performance improves with larger instance types.

  • Use cases:

    • Real-time rendering for cloud gaming.

    • Real-time cloud rendering for augmented reality (AR) and virtual reality (VR).

    • Deep learning (DL) and machine learning (ML) inference, ideal for elastically scaling internet applications.

    • Deep learning training and educational environments.

    • Deep learning model experimentation.

The following table lists the instance types and specifications of the vgn6i family.

Instance type

vCPU

Memory (GiB)

GPU

GPU memory

Baseline bandwidth (Gbit/s)

Packet rate (pps)

Queues (primary/secondary)

ENIs

Private IPv4 addresses/ENI

ecs.vgn6i-m4.xlarge

4

23

NVIDIA T4 * 1/4

16 GB * 1/4

2

500,000

4/2

3

10

ecs.vgn6i-m8.2xlarge

10

46

NVIDIA T4 * 1/2

16 GB * 1/2

4

800,000

8/2

4

10

gn5 GPU compute instance family

  • Scenarios:

    • Deep learning.

    • Scientific computing, such as computational fluid dynamics, computational finance, genomics research, and environmental analysis.

    • High-performance computing (HPC), rendering, multimedia encoding and decoding, and other server-side GPU compute workloads.

  • Compute:

    • Features NVIDIA P100 GPUs.

    • Offers multiple processor-to-memory ratios.

    • Processor: 2.5 GHz Intel ® Xeon ® E5-2682 v4 (Broadwell).

  • Storage:

    • Equipped with high-performance NVMe SSD local disks.

    • I/O optimized instances.

    • Supported cloud disk types: SSD cloud disks and Ultra Disks.

  • Network:

    • Supports IPv4 only.

    • Network performance scales with the instance type.

The gn5 family includes the following instance types:

Instance type

vCPUs

Memory (GiB)

GPU

GPU memory

Local storage (GiB)

Baseline bandwidth (Gbit/s)

Network PPS

Multi-queue

ENIs

Private IPv4s per ENI

ecs.gn5-c4g1.xlarge

4

30

NVIDIA P100 × 1

16 GB × 1

440

3

300,000

1

3

10

ecs.gn5-c8g1.2xlarge

8

60

NVIDIA P100 × 1

16 GB × 1

440

3

400,000

1

4

10

ecs.gn5-c4g1.2xlarge

8

60

NVIDIA P100 × 2

16 GB × 2

880

5

1,000,000

4

4

10

ecs.gn5-c8g1.4xlarge

16

120

NVIDIA P100 × 2

16 GB × 2

880

5

1,000,000

4

8

20

ecs.gn5-c28g1.7xlarge

28

112

NVIDIA P100 × 1

16 GB × 1

440

5

2,250,000

7

8

10

ecs.gn5-c8g1.8xlarge

32

240

NVIDIA P100 × 4

16 GB × 4

1,760

10

2,000,000

8

8

20

ecs.gn5-c28g1.14xlarge

56

224

NVIDIA P100 × 2

16 GB × 2

880

10

4,500,000

14

8

20

ecs.gn5-c8g1.14xlarge

54

480

NVIDIA P100 × 8

16 GB × 8

3,520

25

4,000,000

14

8

10

GPU compute instance family gn5i

  • Scenarios: Ideal for server-side GPU computing workloads such as deep learning inference and multimedia encoding and decoding.

  • Compute:

    • Powered by NVIDIA P4 GPUs.

    • A vCPU-to-memory ratio of 1:4.

    • Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).

  • Storage:

    • I/O optimized instances.

    • Supported cloud disk types: SSD cloud disks and ultra cloud disks.

  • Network:

    • Supports IPv4 and IPv6. For more information about IPv6 communication, see IPv6 communication.

    • Network performance scales with the instance type.

The gn5i instance family includes the following instance types and specifications:

Instance type

vCPU

Memory (GiB)

GPU model

GPU memory

Baseline bandwidth (Gbit/s)

Packet forwarding rate (PPS)

Queues per ENI

ENIs

Private IPv4s per ENI

IPv6s per ENI

ecs.gn5i-c2g1.large

2

8

1 × NVIDIA P4

1 × 8 GB

1

100,000

2

2

6

1

ecs.gn5i-c4g1.xlarge

4

16

1 × NVIDIA P4

1 × 8 GB

1.5

200,000

2

3

10

1

ecs.gn5i-c8g1.2xlarge

8

32

1 × NVIDIA P4

1 × 8 GB

2

400,000

4

4

10

1

ecs.gn5i-c16g1.4xlarge

16

64

1 × NVIDIA P4

1 × 8 GB

3

800,000

4

8

20

1

ecs.gn5i-c16g1.8xlarge

32

128

2 × NVIDIA P4

2 × 8 GB

6

1,200,000

8

8

20

1

ecs.gn5i-c28g1.14xlarge

56

224

2 × NVIDIA P4

2 × 8 GB

10

2,000,000

14

8

20

1

vGPU instance family vgn5i

The vgn5i instance family offers the following features:

  • Compute:

    • Powered by NVIDIA P4 GPU accelerators.

    • Instances include vGPUs created through GPU slicing.

      • vGPUs provide 1/8, 1/4, 1/2, or the full computing power of a single NVIDIA Tesla P4 GPU.

      • vGPUs are available with 1 GB, 2 GB, 4 GB, or 8 GB of GPU memory.

    • A vCPU-to-memory ratio of 1:3.

    • Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).

  • Storage:

    • I/O-optimized instance.

    • Supports only SSD cloud disks and Ultra Disks.

  • Network:

    • Supports IPv6.

    • Network performance scales with the instance size.

  • Use cases:

    • Real-time cloud rendering for gaming.

    • Real-time cloud rendering for augmented reality (AR) and virtual reality (VR).

    • AI inference for deep learning (DL) and machine learning (ML), ideal for elastically scaling internet applications with AI capabilities.

    • Deep learning training and educational environments.

    • Deep learning model experimentation.

The following table lists the specifications for the vgn5i instance family.

Instance type

vCPUs

Memory (GiB)

GPU

GPU memory

Baseline bandwidth (Gbit/s)

Packet rate (pps)

Multi-queue

ENIs

Private IPv4 addresses

ecs.vgn5i-m1.large

2

6

NVIDIA P4 * 1/8

1 GB

1

300,000

2

2

6

ecs.vgn5i-m2.xlarge

4

12

NVIDIA P4 * 1/4

2 GB

2

500,000

2

3

10

ecs.vgn5i-m4.2xlarge

8

24

NVIDIA P4 * 1/2

4 GB

3

800,000

2

4

10

ecs.vgn5i-m8.4xlarge

16

48

NVIDIA P4 * 1

8 GB

5

1,000,000

4

5

20

Note

The GPU column in the table shows the GPU model and GPU slicing information. GPU slicing divides a physical GPU into multiple slices, and each instance is assigned one slice. For example:

In NVIDIA P4 * 1/8, NVIDIA P4 specifies the GPU card model. 1/8 specifies the GPU slice, which means that one GPU card is divided into 8 slices, and each instance uses one slice.

GPU-accelerated compute-optimized SCC instance family sccgn6e

The sccgn6e instance family offers the following features:

  • Includes all the features of an ECS Bare Metal Instance. For more information, see ECS Bare Metal Instance families.

  • Compute:

    • GPU accelerator:

      • Innovative Volta architecture

      • 32 GB of HBM2 GPU memory

      • 5,120 CUDA Cores

      • 640 Tensor Cores

      • 900 GB/s GPU memory bandwidth

      • A single GPU supports six bidirectional NVLink links, each with a unidirectional bandwidth of 25 GB/s, for a total bidirectional bandwidth of 300 GB/s.

    • vCPU-to-memory ratio of 1:8

    • Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake)

  • Storage:

    • I/O optimized instance

    • Supports only ESSDs, ESSD AutoPL disks, standard SSDs, and ultra disks

    • Supports CPFS

  • Network:

    • Supports IPv6

    • Supports VPCs

    • Supports RoCE v2 networks for low-latency RDMA.

  • Use cases:

    • Ultra-large-scale machine learning cluster training

    • Large-scale high-performance scientific computing and simulation

    • Large-scale data analytics, batch computing, and video encoding

The following table lists the specifications of the sccgn6e instance family.

Instance type

vCPU

Memory (GiB)

GPU

GPU memory (GB)

Network bandwidth (Gbit/s)

Packet rate (PPS)

RoCE bandwidth (Gbit/s)

Multi-queue

ENIs

Private IPv4 addresses

ecs.sccgn6e.24xlarge

96

768.0

NVIDIA V100 × 8

32 GB × 8

32

4,800,000

50

8

32

10

GPU compute-optimized SCC instance family sccgn6

The sccgn6 instance family includes the following features:

  • Includes all features of an ECS Bare Metal Instance. For more information, see ECS Bare Metal Instance families.

  • Compute:

    • GPU accelerator: NVIDIA V100 (SXM2-based)

      • Innovative Volta architecture

      • 16 GB HBM2 GPU memory

      • 5,120 CUDA Cores

      • 640 Tensor Cores

      • 900 GB/s GPU memory bandwidth

      • Each GPU supports six bidirectional NVLink links, each providing 25 GB/s of bandwidth in a single direction for a total of 300 GB/s of bidirectional bandwidth.

    • A vCPU-to-memory ratio of 1:4

    • Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake)

  • Storage:

    • I/O optimized instance

    • Supports only ESSD cloud disk, ESSD AutoPL cloud disk, Standard SSD, and Ultra Disk.

    • Supports high-performance CPFS.

  • Network:

    • Supports IPv6.

    • Supports VPCs.

    • Enables low-latency RDMA communication through RoCE v2 networks.

  • Use cases:

    • Training for ultra-large-scale machine learning clusters

    • Large-scale high-performance scientific computing and simulation

    • Large-scale data analytics, batch computing, and video encoding

The following table describes the instance types and key metrics for the sccgn6 instance family.

Instance type

vCPU

Memory (GiB)

GPU

Network baseline bandwidth (Gbit/s)

Packet forwarding rate (PPS)

RoCE network bandwidth (Gbit/s)

Multi-queue

ENIs

Private IPv4 addresses

ecs.sccgn6.24xlarge

96

384.0

NVIDIA V100 * 8

30

4,500,000

50

8

32

10

GPU compute-optimized instance family sccgn6ne

The sccgn6ne instance family offers the following features:

  • Provides all the features of an ECS Bare Metal Instance.

  • Compute:

    • GPU accelerator: V100 (SXM2 form factor)

      • Innovative Volta architecture

      • 32 GB of HBM2 GPU memory

      • 5,120 CUDA Cores

      • 640 Tensor Cores

      • 900 GB/s GPU memory bandwidth

      • Six NVLink links at 25 GB/s each, for a total bandwidth of 300 GB/s.

    • A vCPU-to-memory ratio of 1:4

    • Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake).

  • Storage:

    • I/O optimized instance

    • Support for ESSDs, Standard SSDs, and Ultra Disks.

    • Support for the high-performance Cloud Paralleled File System (CPFS).

  • Network:

    • Support for IPv6.

    • Support for Virtual Private Cloud (VPC).

    • Support for RoCE v2.

  • Use cases:

    • Ultra-large-scale machine learning cluster training

    • Large-scale high-performance scientific and simulation computing

    • Large-scale data analytics, batch computing, and video encoding

The following table lists the specifications of the sccgn6ne instance family.

Instance type

vCPUs

Memory (GiB)

GPU

GPU memory

Baseline bandwidth (Gbit/s)

Packet forwarding rate (pps)

RoCE bandwidth (Gbit/s)

Queues per ENI

ENIs

Private IPv4 addresses per ENI

ecs.sccgn6ne.24xlarge

96

768.0

8 × NVIDIA V100

8 × 32 GB

32.0

4,800,000

100

16

8

20

Gn4 GPU compute instance family

The gn4 instance family provides the following features:

  • Powered by NVIDIA M40 GPUs

  • Compute:

    • Various vCPU-to-memory ratios

    • Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell)

  • Storage:

    • I/O optimized instance

    • Supports only Standard SSD and Ultra Disk

  • Network:

    • Larger instance types provide higher network performance.

  • Use cases:

    • Deep learning

    • Scientific computing, including computational fluid dynamics, computational finance, genomics research, and environmental analysis

    • High-performance computing, rendering, multimedia encoding and decoding, and other server-side GPU computing workloads

The following table describes the instance types and specifications of the gn4 instance family.

Instance type

vCPU

Memory (GiB)

GPU model

GPU memory

Network bandwidth (Gbit/s)

Packet forwarding rate (pps)

Queues

ENIs (max)

Private IPv4s/ENI (max)

ecs.gn4-c4g1.xlarge

4

30.0

NVIDIA M40 * 1

12GB * 1

3.0

300,000

1

3

10

ecs.gn4-c8g1.2xlarge

8

30.0

NVIDIA M40 * 1

12GB * 1

3.0

400,000

1

4

10

ecs.gn4.8xlarge

32

48.0

NVIDIA M40 * 1

12GB * 1

6.0

800,000

3

8

20

ecs.gn4-c4g1.2xlarge

8

60.0

NVIDIA M40 * 2

12GB * 2

5.0

500,000

1

4

10

ecs.gn4-c8g1.4xlarge

16

60.0

NVIDIA M40 * 2

12GB * 2

5.0

500,000

1

8

20

ecs.gn4.14xlarge

56

96.0

NVIDIA M40 * 2

12GB * 2

10.0

1,200,000

4

8

20

Ga1 GPU visual computing instance family

The ga1 instance family provides the following features:

  • Powered by AMD S7150 GPU cards.

  • Features high-performance NVMe SSD local disks.

  • Compute:

    • A vCPU-to-memory ratio of 1:2.5.

    • Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).

  • Storage:

    • I/O optimized instances.

    • Supports only Standard SSD and Ultra Disk.

  • Network:

    • Network performance scales with the instance type.

  • Use cases:

    • Rendering and multimedia encoding/decoding.

    • Machine learning, high-performance computing, and high-performance databases.

    • Other server-side workloads requiring powerful parallel floating-point computing capabilities.

The following table lists the specifications of the ga1 instance types.

Instance type

vCPU

Memory (GiB)

Local storage (GiB)

GPU

GPU memory

Baseline bandwidth (Gbit/s)

Packet forwarding rate (PPS)

Multi-queue

Elastic network interfaces

Private IPv4 addresses

ecs.ga1.xlarge

4

10.0

1 × 87

AMD S7150 × 1/4

8 GB × 1/4

1.0

200,000

1

3

10

ecs.ga1.2xlarge

8

20.0

1 × 175

AMD S7150 × 1/2

8 GB × 1/2

1.5

300,000

1

4

10

ecs.ga1.4xlarge

16

40.0

1 × 350

AMD S7150 × 1

8 GB × 1

3.0

500,000

2

8

20

ecs.ga1.8xlarge

32

80.0

1 × 700

AMD S7150 × 2

8 GB × 2

6.0

800,000

3

8

20

ecs.ga1.14xlarge

56

160.0

1 × 1400

AMD S7150 × 4

8 GB × 4

10.0

1,200,000

4

8

20