Cloud-native database PolarDB

更新时间:
复制 MD 格式

PolarDB is a cloud-native relational database by Alibaba Cloud. Built on a storage-compute decoupled architecture, it combines software and hardware optimization to deliver high performance, high availability, security, mass storage, and second-level elasticity. PolarDB is 100% compatible with MySQL and PostgreSQL, highly compatible with Oracle syntax, and supports centralized and distributed deployments. Compared to self-managed databases, PolarDB delivers up to 6x transaction performance and 400x analytical performance at 50% of the total cost of ownership (TCO).

Choose the PolarDB engine that fits your application — each maintains full ecosystem compatibility.

PolarDB database

Ecosystem compatibility

Service architecture

Product form

PolarDB for MySQL

100% compatible with MySQL

Shared storage, compute-storage decoupled

Public cloud, Apsara Stack Enterprise Edition, DBStack

PolarDB for PostgreSQL

100% compatible with PostgreSQL, highly compatible with Oracle syntax

PolarDB Distributed Edition

Shared-nothing, integrated centralized and distributed architecture

Benefits

Ecosystem compatibility

  • 100% compatible with MySQL and PostgreSQL ecosystems.

  • Highly compatible with Oracle syntax with end-to-end migration solutions. The service remains available during migration, as proven by over 500 customers.

  • PolarDB integrates with existing systems for smooth upgrades in both centralized and distributed deployments.

High performance

  • Up to 6x the transactional processing (TP) performance of open-source databases.

  • Up to 400x online analytical processing (OLAP) performance, powered by parallel query and In-Memory Column Index (IMCI).

High availability

  • Single-zone, dual-zone, three-zone (RPO = 0), and cross-region multi-level high availability configurations.

  • Up to 99.995% SLA.

Mass storage

  • PolarDB for MySQL/PolarDB for PostgreSQL: Supports up to 500 TB of storage.

  • PolarDB Distributed Edition supports petabyte-scale storage.

Easy to scale

  • The intelligent proxy supports multiple read consistency levels.

  • Serverless dynamic and elastic scaling, and distributed linear scaling.

  • Low-latency physical replication for efficient, stable inter-node data transfer.

Security

  • Access control: Resource Access Management (RAM) users, IP address whitelists, security groups, and virtual private clouds (VPCs).

  • Data security: Transparent data encryption (TDE), backup and recovery, and flashback query.

  • Transmission security: Secure Sockets Layer (SSL) encryption.

Service architecture

PolarDB for MySQL

PolarDB for MySQL uses a cloud-native, storage-compute decoupled architecture that combines commercial-grade stability, performance, and scalability with open-source simplicity and rapid iteration.

PolarDB for MySQL architecture

  • Database proxy (Proxy)

    A network proxy between applications and the database. Provides security authentication, automatic read/write splitting, load balancing, consistency levels, connection pools, persistent connections, and overload protection.

  • Database compute nodes

    • PolarDB uses a multi-node cluster architecture. A Cluster Edition cluster contains one read/write node and multiple read-only nodes. A Multi-master Cluster contains multiple read/write nodes and multiple read-only nodes.

    • Read/write nodes and read-only nodes use active-active failover to provide high database availability.

    • Compute nodes provide the SQL engine feature and are available in General-purpose and Dedicated specifications.

  • Shared distributed storage

    Multiple compute nodes share a single data copy, reducing storage costs. Built on a distributed block storage and file system, capacity scales online to hundreds of terabytes.

PolarDB for PostgreSQL

PolarDB for PostgreSQL clusters support centralized and distributed deployments:

  • Centralized

    Built on a cloud-native, storage-compute decoupled architecture, it combines commercial-grade stability and performance with open-source simplicity and rapid iteration.

  • Distributed

    A distributed database built on centralized PolarDB for PostgreSQL clusters. Uses a two-layer CN/DN architecture for distributed scalability with storage-compute decoupling, while retaining all centralized cluster features.

Centralized (Enterprise and Standard Editions)

PolarDB for PostgreSQL centralized architecture

  • Database proxy (Proxy)

    A network proxy between applications and the database. Provides security authentication, automatic read/write splitting, load balancing, consistency levels, connection pools, persistent connections, and overload protection.

  • Database compute nodes

    • PolarDB uses a multi-node cluster architecture. A Cluster Edition cluster contains one read/write node and multiple read-only nodes.

    • Read/write nodes and read-only nodes use active-active failover to provide high database availability.

    • Compute nodes provide the SQL engine feature and are available in General-purpose and Dedicated specifications.

  • Shared distributed storage

    Multiple compute nodes share a single data copy, reducing storage costs. Built on a distributed block storage and file system, capacity scales online to hundreds of terabytes.

Distributed

PolarDB for PostgreSQL distributed architecture

  • Database nodes

    • Composed of compute nodes (CNs) and data nodes (DNs). CNs are compute nodes responsible for cluster metadata management and distributed plan generation. DNs are data nodes responsible for storing actual data shards.

    • Each CN and DN uses a centralized architecture with storage-compute decoupling. It supports a database proxy and a one-primary-multiple-read-only mode. It provides high availability failover for compute nodes within a CN or DN and supports adding read-only (RO) nodes to scale the read capability of a single CN or DN.

  • Distributed features

    • Supports manual sharding and horizontal scaling for business scenarios at the petabyte level.

    • Ensures consistency for distributed transactions.

    • Supports dual-zone deployment. The primary zone is the working cluster, and the secondary zone is the Hot Standby Cluster.

    • Provides 24/7 non-disruptive upgrade, downgrade, and maintenance capabilities. You can also add heterogeneous CN and DN nodes to the cluster.

PolarDB Distributed Edition

In a PolarDB for X-Engine cluster, storage nodes (DNs) start as a centralized deployment 100% compatible with MySQL 5.7 and 8.0. When you need distributed scaling, upgrade in place — distributed components connect to existing storage nodes without data migration or application changes.

Centralized (Standard Edition)

PolarDB Distributed Edition centralized architecture

  • Storage node (DN)

    Handles data persistence using the Paxos consensus protocol for high reliability and strong consistency. The self-developed Lizard distributed transaction engine system provides more reliable high availability and ~35% better performance than the native MySQL distributed engine.

  • Multi-replica data architecture

    The Paxos consensus protocol ensures strong consistency (RPO = 0) between replicas — each write requires majority confirmation. If one node fails, the cluster continues operating. Replica roles:

    • Leader

      Processes client requests, makes decisions, and maintains logs for data consistency and recoverability.

    • Follower

      Executes instructions from the Leader. Can be elected as the new Leader if the current Leader fails.

    • Logger

      Provides Paxos protocol services only (no data services). Participates in Leader elections and may temporarily assume the Leader role until Followers catch up, then yields.

    • Learner

      Passively receives system status without voting or decision-making rights.

Distributed (Enterprise Edition)

PolarDB Distributed Edition distributed architecture

  • Global meta service (GMS)

    Maintains globally consistent metadata (tables, schemas, statistics), security information (accounts, permissions), and the global timestamp service (TSO).

  • Compute node (CN)

    Stateless entry point with SQL parser, optimizer, and executor. Handles distributed routing, computation, 2PC coordination, distributed DDL, and global index maintenance.

  • Storage node (DN)

    Persists row store data using Paxos for strong consistency and MVCC for distributed transaction visibility. Supports computation pushdown (Project, Filter, Join, Aggregation).

  • Column store node (Columnar)

    Persists columnstore indexes on OSS, consuming distributed transaction binary logs in real time. Combined with compute nodes, provides snapshot-consistent column store queries.

  • Change data capture (CDC) node

    Provides incremental subscriptions compatible with MySQL binary log format and primary/secondary replication compatible with MySQL replication protocol.

Video introduction

How to use PolarDB

Manage PolarDB clusters, databases, and accounts through:

  • Console: A graphical web interface for easy operations.

  • API: Programmatic access to all console operations.

  • SDK: Language-specific wrappers for all console operations.

  • CLI: Command-line access to all console operations.

Quick start

PolarDB database

Billing

Operation Guides

Whitepapers

PolarDB for MySQL

Billing

User Guide

Performance Whitepaper

PolarDB for PostgreSQL

Billing

Performance Whitepaper

PolarDB for PostgreSQL (Compatible with Oracle)

Billing

PolarDB Distributed Edition

Billing overview

User Guide