This topic describes the details of all retired GPU-accelerated instance types. You can use recommended instance types based on your business requirements on use scenarios and computing capabilities.
For more information about available instance types, see Overview of instance families.
If you have an instance of a retired instance type, we recommend that you change it to an available one. For more information about which instance types can be changed to which other types, see Instance types and families that support instance type changes.
ebmgn7vx, GPU-accelerated compute-optimized ECS Bare Metal Instance family
ebmgn6ia, GPU-accelerated compute-optimized ECS Bare Metal Instance family
sccgn6e, GPU-accelerated compute-optimized SCC instance family
sccgn6, GPU-accelerated compute-optimized SCC instance family
sccgn6ne, GPU-accelerated compute-optimized SCC instance family
GPU compute-optimized bare metal instance family ebmgn7vx
The ebmgn7vx instance family includes the following features:
The ebmgn7vx instance family builds on the fourth-generation SHENLONG architecture and uses Alibaba Cloud's latest CIPU architecture. Multiple bare metal instances interconnect through an eRDMA network for RDMA communication at 160 Gbit/s. After enabling eRDMA, you can elastically scale your cluster to meet the demands of large-scale AI training.
Compute
Processor: 3rd Generation Intel ® Xeon ® Scalable processors (Ice Lake) with a base frequency of 2.9 GHz, an all-core turbo of 3.5 GHz, and PCIe 4.0 support.
Storage
I/O optimized instance
-
Supports only ESSDs and ESSD AutoPL disks.
Network
Supports IPv4 and IPv6.
Supports physical network cards.
Provides high-throughput networking with a packet forwarding rate of 24,000,000 PPS.
Supports Elastic RDMA Interface (ERI), which enables RDMA passthrough and accelerated interconnection over a VPC network to increase bandwidth to 160 Gbit/s.
NoteFor more information about how to use ERI, see Enable eRDMA for enterprise-level instances or Enable eRDMA for GPU-accelerated instances.
Use cases
Various deep learning training and development workloads.
HPC-accelerated computing and simulation.
NoteWhen you run AI training workloads with high communication demands, such as Transformer models, you must enable NVLink for data communication between GPUs. Otherwise, large-scale data transmission over PCIe links may cause unexpected failures, resulting in data corruption. If you are unsure about the communication topology for your training jobs, submit a ticket for technical support from Alibaba Cloud experts.
The following table lists the instance types and specifications of the ebmgn7vx family.
Instance type | vCPU | Memory (GiB) | GPU memory | Network bandwidth (Gbit/s) | Packet forwarding rate (PPS) | IPv4 addresses per NIC | IPv6 addresses per NIC | Physical NICs | Multi-queue (Primary NIC/Secondary ENI) | ENIs |
ecs.ebmgn7vx.32xlarge | 128 | 1024 | 80 GB × 8 | 160 (80 × 2) | 24,000,000 | 30 | 30 | 2 | 32/32 | 16 |
-
You can go to the Instance Types Available for Each Region page to view the instance types available in each region.
The ebmgn7vx instance family requires images to use the UEFI boot mode. If you use a custom image, ensure that it supports this mode and its boot mode property is set to UEFI. For more information, see Set the boot mode of a custom image to UEFI by using an API call.
For more information about these specifications, see the "Instance type specifications" section in Overview of instance families. Packet forwarding rates vary significantly based on business scenarios. We recommend that you perform business stress tests on instances to choose appropriate instance types.
Currently, you cannot retrieve basic CPU monitoring data for ECS Bare Metal Instances. You can obtain this data by installing the CloudMonitor agent. For more information, see Install the CloudMonitor agent.
ebmgn6ia: GPU compute-optimized bare metal instance family
Instance family overview:
Powered by the third-generation X-Dragon architecture and chip-level fast-path acceleration, this instance family delivers stable, predictable, ultra-high performance in compute, storage, and networking.
These instances use NVIDIA T4 GPUs to accelerate graphics and AI workloads. Combined with container technology, they can host over 60 virtual Android terminals, providing hardware-accelerated video transcoding for each display.
Use cases:
Providing remote app services based on Android, such as always-on cloud applications, cloud gaming, cloud phones, and Android-based web crawlers.
Compute:
vCPU-to-memory ratio of approximately 1:3.
Processor: 2.8 GHz Ampere® Altra® processor with a turbo frequency of 3.0 GHz. The native Arm compute platform provides efficient performance and excellent app compatibility for Android servers.
Storage:
-
These are I/O optimized instances.
-
Supported disk types: elastic ephemeral disk, ESSDs, ESSD AutoPL disks, and Regional ESSDs. For more information, see Block Storage Overview.
-
Network:
These instances support IPv4 and IPv6. For information about IPv6 communication, see IPv6 communication.
The following table describes the instance types and key metrics of the ebmgn6ia instance family.
Instance type | vCPU | Memory (GiB) | GPU | GPU memory | Baseline bandwidth (Gbit/s) | Packet rate (pps) | Multi-queue | Elastic network interfaces | Private IPv4s per ENI | IPv6s per ENI |
ecs.ebmgn6ia.20xlarge | 80 | 256 | NVIDIA T4 × 2 | 16 GB × 2 | 32 | 24,000,000 | 32 | 15 | 10 | 1 |
Ampere® Altra® processors have specific requirements for operating system kernel versions. When you create an ECS instance of this instance type, you can directly use Alibaba Cloud Linux 3 or CentOS 8.4 or later images. We recommend that you use Alibaba Cloud Linux 3 images. If you need to use a different operating system, see the Ampere Altra (TM) Linux Kernel Porting Guide. You must patch the kernel on an ECS instance running the desired operating system, create a custom image from the instance, and then use that image to create a new instance of this type.
vGPU-accelerated instance family vgn6i
The vgn6i instance family provides the following features:
Compute:
Uses NVIDIA T4 GPU accelerators.
Instances include vGPUs created by partitioning a physical GPU.
vGPUs provide 1/4 or 1/2 of the computing power of a physical NVIDIA Tesla T4 GPU.
This provides 4 GB and 8 GB of GPU memory, respectively.
A vCPU-to-memory ratio of approximately 1:5.
Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake).
Storage:
I/O optimized instance.
Supports only SSD cloud disks and ultra disks.
Network:
Supports IPv6.
Network performance improves with larger instance types.
Use cases:
Real-time rendering for cloud gaming.
Real-time cloud rendering for augmented reality (AR) and virtual reality (VR).
Deep learning (DL) and machine learning (ML) inference, ideal for elastically scaling internet applications.
Deep learning training and educational environments.
Deep learning model experimentation.
The following table lists the instance types and specifications of the vgn6i family.
Instance type | vCPU | Memory (GiB) | GPU | GPU memory | Baseline bandwidth (Gbit/s) | Packet rate (pps) | Queues (primary/secondary) | ENIs | Private IPv4 addresses/ENI |
ecs.vgn6i-m4.xlarge | 4 | 23 | NVIDIA T4 * 1/4 | 16 GB * 1/4 | 2 | 500,000 | 4/2 | 3 | 10 |
ecs.vgn6i-m8.2xlarge | 10 | 46 | NVIDIA T4 * 1/2 | 16 GB * 1/2 | 4 | 800,000 | 8/2 | 4 | 10 |
gn5 GPU compute instance family
Scenarios:
Deep learning.
Scientific computing, such as computational fluid dynamics, computational finance, genomics research, and environmental analysis.
High-performance computing (HPC), rendering, multimedia encoding and decoding, and other server-side GPU compute workloads.
Compute:
Features NVIDIA P100 GPUs.
Offers multiple processor-to-memory ratios.
Processor: 2.5 GHz Intel ® Xeon ® E5-2682 v4 (Broadwell).
Storage:
Equipped with high-performance NVMe SSD local disks.
I/O optimized instances.
Supported cloud disk types: SSD cloud disks and Ultra Disks.
Network:
Supports IPv4 only.
Network performance scales with the instance type.
The gn5 family includes the following instance types:
Instance type | vCPUs | Memory (GiB) | GPU | GPU memory | Local storage (GiB) | Baseline bandwidth (Gbit/s) | Network PPS | Multi-queue | ENIs | Private IPv4s per ENI |
ecs.gn5-c4g1.xlarge | 4 | 30 | NVIDIA P100 × 1 | 16 GB × 1 | 440 | 3 | 300,000 | 1 | 3 | 10 |
ecs.gn5-c8g1.2xlarge | 8 | 60 | NVIDIA P100 × 1 | 16 GB × 1 | 440 | 3 | 400,000 | 1 | 4 | 10 |
ecs.gn5-c4g1.2xlarge | 8 | 60 | NVIDIA P100 × 2 | 16 GB × 2 | 880 | 5 | 1,000,000 | 4 | 4 | 10 |
ecs.gn5-c8g1.4xlarge | 16 | 120 | NVIDIA P100 × 2 | 16 GB × 2 | 880 | 5 | 1,000,000 | 4 | 8 | 20 |
ecs.gn5-c28g1.7xlarge | 28 | 112 | NVIDIA P100 × 1 | 16 GB × 1 | 440 | 5 | 2,250,000 | 7 | 8 | 10 |
ecs.gn5-c8g1.8xlarge | 32 | 240 | NVIDIA P100 × 4 | 16 GB × 4 | 1,760 | 10 | 2,000,000 | 8 | 8 | 20 |
ecs.gn5-c28g1.14xlarge | 56 | 224 | NVIDIA P100 × 2 | 16 GB × 2 | 880 | 10 | 4,500,000 | 14 | 8 | 20 |
ecs.gn5-c8g1.14xlarge | 54 | 480 | NVIDIA P100 × 8 | 16 GB × 8 | 3,520 | 25 | 4,000,000 | 14 | 8 | 10 |
GPU compute instance family gn5i
Scenarios: Ideal for server-side GPU computing workloads such as deep learning inference and multimedia encoding and decoding.
Compute:
Powered by NVIDIA P4 GPUs.
A vCPU-to-memory ratio of 1:4.
Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).
Storage:
I/O optimized instances.
Supported cloud disk types: SSD cloud disks and ultra cloud disks.
Network:
Supports IPv4 and IPv6. For more information about IPv6 communication, see IPv6 communication.
Network performance scales with the instance type.
The gn5i instance family includes the following instance types and specifications:
Instance type | vCPU | Memory (GiB) | GPU model | GPU memory | Baseline bandwidth (Gbit/s) | Packet forwarding rate (PPS) | Queues per ENI | ENIs | Private IPv4s per ENI | IPv6s per ENI |
ecs.gn5i-c2g1.large | 2 | 8 | 1 × NVIDIA P4 | 1 × 8 GB | 1 | 100,000 | 2 | 2 | 6 | 1 |
ecs.gn5i-c4g1.xlarge | 4 | 16 | 1 × NVIDIA P4 | 1 × 8 GB | 1.5 | 200,000 | 2 | 3 | 10 | 1 |
ecs.gn5i-c8g1.2xlarge | 8 | 32 | 1 × NVIDIA P4 | 1 × 8 GB | 2 | 400,000 | 4 | 4 | 10 | 1 |
ecs.gn5i-c16g1.4xlarge | 16 | 64 | 1 × NVIDIA P4 | 1 × 8 GB | 3 | 800,000 | 4 | 8 | 20 | 1 |
ecs.gn5i-c16g1.8xlarge | 32 | 128 | 2 × NVIDIA P4 | 2 × 8 GB | 6 | 1,200,000 | 8 | 8 | 20 | 1 |
ecs.gn5i-c28g1.14xlarge | 56 | 224 | 2 × NVIDIA P4 | 2 × 8 GB | 10 | 2,000,000 | 14 | 8 | 20 | 1 |
vGPU instance family vgn5i
The vgn5i instance family offers the following features:
Compute:
Powered by NVIDIA P4 GPU accelerators.
Instances include vGPUs created through GPU slicing.
vGPUs provide 1/8, 1/4, 1/2, or the full computing power of a single NVIDIA Tesla P4 GPU.
vGPUs are available with 1 GB, 2 GB, 4 GB, or 8 GB of GPU memory.
A vCPU-to-memory ratio of 1:3.
Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).
Storage:
I/O-optimized instance.
Supports only SSD cloud disks and Ultra Disks.
Network:
Supports IPv6.
Network performance scales with the instance size.
Use cases:
Real-time cloud rendering for gaming.
Real-time cloud rendering for augmented reality (AR) and virtual reality (VR).
AI inference for deep learning (DL) and machine learning (ML), ideal for elastically scaling internet applications with AI capabilities.
Deep learning training and educational environments.
Deep learning model experimentation.
The following table lists the specifications for the vgn5i instance family.
Instance type | vCPUs | Memory (GiB) | GPU | GPU memory | Baseline bandwidth (Gbit/s) | Packet rate (pps) | Multi-queue | ENIs | Private IPv4 addresses |
ecs.vgn5i-m1.large | 2 | 6 | NVIDIA P4 * 1/8 | 1 GB | 1 | 300,000 | 2 | 2 | 6 |
ecs.vgn5i-m2.xlarge | 4 | 12 | NVIDIA P4 * 1/4 | 2 GB | 2 | 500,000 | 2 | 3 | 10 |
ecs.vgn5i-m4.2xlarge | 8 | 24 | NVIDIA P4 * 1/2 | 4 GB | 3 | 800,000 | 2 | 4 | 10 |
ecs.vgn5i-m8.4xlarge | 16 | 48 | NVIDIA P4 * 1 | 8 GB | 5 | 1,000,000 | 4 | 5 | 20 |
The GPU column in the table shows the GPU model and GPU slicing information. GPU slicing divides a physical GPU into multiple slices, and each instance is assigned one slice. For example:
In NVIDIA P4 * 1/8, NVIDIA P4 specifies the GPU card model. 1/8 specifies the GPU slice, which means that one GPU card is divided into 8 slices, and each instance uses one slice.
GPU-accelerated compute-optimized SCC instance family sccgn6e
The sccgn6e instance family offers the following features:
Includes all the features of an ECS Bare Metal Instance. For more information, see ECS Bare Metal Instance families.
Compute:
GPU accelerator:
Innovative Volta architecture
32 GB of HBM2 GPU memory
5,120 CUDA Cores
640 Tensor Cores
900 GB/s GPU memory bandwidth
A single GPU supports six bidirectional NVLink links, each with a unidirectional bandwidth of 25 GB/s, for a total bidirectional bandwidth of 300 GB/s.
vCPU-to-memory ratio of 1:8
Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake)
Storage:
I/O optimized instance
Supports only ESSDs, ESSD AutoPL disks, standard SSDs, and ultra disks
Supports CPFS
Network:
Supports IPv6
Supports VPCs
Supports RoCE v2 networks for low-latency RDMA.
Use cases:
Ultra-large-scale machine learning cluster training
Large-scale high-performance scientific computing and simulation
Large-scale data analytics, batch computing, and video encoding
The following table lists the specifications of the sccgn6e instance family.
Instance type | vCPU | Memory (GiB) | GPU | GPU memory (GB) | Network bandwidth (Gbit/s) | Packet rate (PPS) | RoCE bandwidth (Gbit/s) | Multi-queue | ENIs | Private IPv4 addresses |
ecs.sccgn6e.24xlarge | 96 | 768.0 | NVIDIA V100 × 8 | 32 GB × 8 | 32 | 4,800,000 | 50 | 8 | 32 | 10 |
GPU compute-optimized SCC instance family sccgn6
The sccgn6 instance family includes the following features:
Includes all features of an ECS Bare Metal Instance. For more information, see ECS Bare Metal Instance families.
Compute:
GPU accelerator: NVIDIA V100 (SXM2-based)
Innovative Volta architecture
16 GB HBM2 GPU memory
5,120 CUDA Cores
640 Tensor Cores
900 GB/s GPU memory bandwidth
Each GPU supports six bidirectional NVLink links, each providing 25 GB/s of bandwidth in a single direction for a total of 300 GB/s of bidirectional bandwidth.
A vCPU-to-memory ratio of 1:4
Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake)
Storage:
I/O optimized instance
Supports only ESSD cloud disk, ESSD AutoPL cloud disk, Standard SSD, and Ultra Disk.
Supports high-performance CPFS.
Network:
Supports IPv6.
Supports VPCs.
Enables low-latency RDMA communication through RoCE v2 networks.
Use cases:
Training for ultra-large-scale machine learning clusters
Large-scale high-performance scientific computing and simulation
Large-scale data analytics, batch computing, and video encoding
The following table describes the instance types and key metrics for the sccgn6 instance family.
Instance type | vCPU | Memory (GiB) | GPU | Network baseline bandwidth (Gbit/s) | Packet forwarding rate (PPS) | RoCE network bandwidth (Gbit/s) | Multi-queue | ENIs | Private IPv4 addresses |
ecs.sccgn6.24xlarge | 96 | 384.0 | NVIDIA V100 * 8 | 30 | 4,500,000 | 50 | 8 | 32 | 10 |
GPU compute-optimized instance family sccgn6ne
The sccgn6ne instance family offers the following features:
Provides all the features of an ECS Bare Metal Instance.
Compute:
GPU accelerator: V100 (SXM2 form factor)
Innovative Volta architecture
32 GB of HBM2 GPU memory
5,120 CUDA Cores
640 Tensor Cores
900 GB/s GPU memory bandwidth
Six NVLink links at 25 GB/s each, for a total bandwidth of 300 GB/s.
A vCPU-to-memory ratio of 1:4
Processor: 2.5 GHz Intel® Xeon® Platinum 8163 (Skylake).
Storage:
I/O optimized instance
Support for ESSDs, Standard SSDs, and Ultra Disks.
Support for the high-performance Cloud Paralleled File System (CPFS).
Network:
Support for IPv6.
Support for Virtual Private Cloud (VPC).
Support for RoCE v2.
Use cases:
Ultra-large-scale machine learning cluster training
Large-scale high-performance scientific and simulation computing
Large-scale data analytics, batch computing, and video encoding
The following table lists the specifications of the sccgn6ne instance family.
Instance type | vCPUs | Memory (GiB) | GPU | GPU memory | Baseline bandwidth (Gbit/s) | Packet forwarding rate (pps) | RoCE bandwidth (Gbit/s) | Queues per ENI | ENIs | Private IPv4 addresses per ENI |
ecs.sccgn6ne.24xlarge | 96 | 768.0 | 8 × NVIDIA V100 | 8 × 32 GB | 32.0 | 4,800,000 | 100 | 16 | 8 | 20 |
Gn4 GPU compute instance family
The gn4 instance family provides the following features:
Powered by NVIDIA M40 GPUs
Compute:
Various vCPU-to-memory ratios
Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell)
Storage:
I/O optimized instance
Supports only Standard SSD and Ultra Disk
Network:
Larger instance types provide higher network performance.
Use cases:
Deep learning
Scientific computing, including computational fluid dynamics, computational finance, genomics research, and environmental analysis
High-performance computing, rendering, multimedia encoding and decoding, and other server-side GPU computing workloads
The following table describes the instance types and specifications of the gn4 instance family.
Instance type | vCPU | Memory (GiB) | GPU model | GPU memory | Network bandwidth (Gbit/s) | Packet forwarding rate (pps) | Queues | ENIs (max) | Private IPv4s/ENI (max) |
ecs.gn4-c4g1.xlarge | 4 | 30.0 | NVIDIA M40 * 1 | 12GB * 1 | 3.0 | 300,000 | 1 | 3 | 10 |
ecs.gn4-c8g1.2xlarge | 8 | 30.0 | NVIDIA M40 * 1 | 12GB * 1 | 3.0 | 400,000 | 1 | 4 | 10 |
ecs.gn4.8xlarge | 32 | 48.0 | NVIDIA M40 * 1 | 12GB * 1 | 6.0 | 800,000 | 3 | 8 | 20 |
ecs.gn4-c4g1.2xlarge | 8 | 60.0 | NVIDIA M40 * 2 | 12GB * 2 | 5.0 | 500,000 | 1 | 4 | 10 |
ecs.gn4-c8g1.4xlarge | 16 | 60.0 | NVIDIA M40 * 2 | 12GB * 2 | 5.0 | 500,000 | 1 | 8 | 20 |
ecs.gn4.14xlarge | 56 | 96.0 | NVIDIA M40 * 2 | 12GB * 2 | 10.0 | 1,200,000 | 4 | 8 | 20 |
Ga1 GPU visual computing instance family
The ga1 instance family provides the following features:
Powered by AMD S7150 GPU cards.
Features high-performance NVMe SSD local disks.
Compute:
A vCPU-to-memory ratio of 1:2.5.
Processor: 2.5 GHz Intel® Xeon® E5-2682 v4 (Broadwell).
Storage:
I/O optimized instances.
Supports only Standard SSD and Ultra Disk.
Network:
Network performance scales with the instance type.
Use cases:
Rendering and multimedia encoding/decoding.
Machine learning, high-performance computing, and high-performance databases.
Other server-side workloads requiring powerful parallel floating-point computing capabilities.
The following table lists the specifications of the ga1 instance types.
Instance type | vCPU | Memory (GiB) | Local storage (GiB) | GPU | GPU memory | Baseline bandwidth (Gbit/s) | Packet forwarding rate (PPS) | Multi-queue | Elastic network interfaces | Private IPv4 addresses |
ecs.ga1.xlarge | 4 | 10.0 | 1 × 87 | AMD S7150 × 1/4 | 8 GB × 1/4 | 1.0 | 200,000 | 1 | 3 | 10 |
ecs.ga1.2xlarge | 8 | 20.0 | 1 × 175 | AMD S7150 × 1/2 | 8 GB × 1/2 | 1.5 | 300,000 | 1 | 4 | 10 |
ecs.ga1.4xlarge | 16 | 40.0 | 1 × 350 | AMD S7150 × 1 | 8 GB × 1 | 3.0 | 500,000 | 2 | 8 | 20 |
ecs.ga1.8xlarge | 32 | 80.0 | 1 × 700 | AMD S7150 × 2 | 8 GB × 2 | 6.0 | 800,000 | 3 | 8 | 20 |
ecs.ga1.14xlarge | 56 | 160.0 | 1 × 1400 | AMD S7150 × 4 | 8 GB × 4 | 10.0 | 1,200,000 | 4 | 8 | 20 |