PolarDB for PostgreSQL (Compatible with Oracle) inter-cluster migration

更新时间:
复制 MD 格式

Data Transmission Service (DTS) enables data migration between PolarDB for PostgreSQL (Compatible with Oracle) clusters.

Prerequisites

  • The source and destination PolarDB for PostgreSQL (Compatible with Oracle) clusters must exist. For more information, see Create a PolarDB for PostgreSQL (Compatible with Oracle) cluster.

  • In the source PolarDB for PostgreSQL (Compatible with Oracle) cluster, the wal_level parameter must be set to logical. This setting adds the information required for logical decoding to the write-ahead logging (WAL). For more information, see Configure cluster parameters.

Usage notes

Note
  • During schema migration, DTS migrates foreign keys from the source database to the target database.

  • During full migration and incremental migration, DTS temporarily disables constraint checks and foreign key cascading operations at the session level. If cascading update or delete operations occur on the source database while the task runs, data inconsistency may occur.

Type

Description

Source database limits

  • Bandwidth: The source database server must have sufficient egress bandwidth. Otherwise, the data migration speed is affected.

  • Tables to be migrated must have a primary key or a UNIQUE constraint, and the fields in the constraint must be unique. Otherwise, duplicate data may occur in the destination database.

  • If you migrate objects at the table level and need to edit them, such as mapping column names, a single data migration task can migrate a maximum of 1,000 tables. If you exceed this limit, an error is reported when you submit the task. In this case, split the tables into smaller batches and configure multiple tasks, or configure a task to migrate the entire database.

  • If you perform incremental migration, for the write-ahead log (WAL):

    • Enable it.

    • For incremental migration tasks, DTS requires the source database to retain WAL logs for more than 24 hours. For tasks that include both full and incremental migration, DTS requires the source database to retain WAL logs for at least 7 days. After the full migration is complete, you can change the log retention period to more than 24 hours. If the retention period is too short, the DTS task may fail because DTS cannot obtain the required WAL logs. In extreme cases, this can lead to data inconsistency or data loss. Issues caused by a log retention period that is shorter than the DTS requirement are not covered by the DTS Service-level agreement (SLA).

  • Source database operation limits:

    • During the schema migration and full migration phases, do not perform DDL operations that change the database or table structure. Otherwise, the data migration task fails.

    • If you perform only full data migration, do not write new data to the source instance. Otherwise, data inconsistency occurs between the source and destination. To maintain real-time data consistency, select schema migration, full data migration, and incremental data migration.

    • To ensure the migration task runs properly and to prevent logical subscription interruptions caused by a primary/secondary switchover, your PolarDB for PostgreSQL (Compatible with Oracle) instance must support and have Logical Replication Slot Failover enabled. For more information, see Enable Logical Replication Slot Failover.

      Note

      If the source PolarDB for PostgreSQL (Compatible with Oracle) cluster does not support the logical replication slot failover feature (for example, when the cluster's Database Engine is Oracle Syntax Compatible 2.0), the migration instance may fail and cannot be recovered when the source database triggers an HA failover.

    • Due to the limits of logical replication in the source database, if a single piece of data to be migrated exceeds 256 MB after an incremental change, the migration instance may fail and cannot be recovered. You must reconfigure the migration instance.

  • If the source database has long-running transactions and the instance includes an incremental migration task, the write-ahead log (WAL) before the transaction commit may accumulate and cannot be cleared. This can lead to insufficient disk space in the source database.

Other limits

  • A single data migration task can migrate only one database. To migrate multiple databases, configure a separate data migration task for each database.

  • DTS does not support the migration of tables created by the TimescaleDB extension, tables with cross-schema inheritance, or tables with unique indexes based on expressions.

  • If a table to be migrated contains a SERIAL column, the source database automatically creates a sequence for that column. Therefore, when you configure the Source Objects, if you select Schema Migration for Migration Types, select Sequence or migrate the entire schema. Otherwise, the migration instance may fail.

  • If the migration instance performs incremental data migration, you must run the ALTER TABLE schema.table REPLICA IDENTITY FULL; command on the tables to be migrated in the source database before you write data to them. This ensures data consistency. This requirement applies in the following two scenarios. During the execution of this command, do not perform table lock operations to avoid deadlocks. If you skip the related check in the precheck, DTS automatically runs this command during the instance initialization.

    • When the instance runs for the first time.

    • When the migration object granularity is Schema, and a new table is created in the schema to be migrated or a table to be migrated is rebuilt using the RENAME command.

    Note
    • In the command, replace schema and table with the schema name and table name of the data to be migrated.

    • We recommend that you perform this operation during off-peak hours.

  • DTS creates the following temporary tables in the source database to obtain DDL statements for incremental data, the structure of incremental tables, and heartbeat information. Do not delete these temporary tables during the migration. Otherwise, the DTS task becomes abnormal. The temporary tables are automatically deleted after the DTS instance is released.

    public.dts_pg_class, public.dts_pg_attribute, public.dts_pg_type, public.dts_pg_enum, public.dts_postgres_heartbeat, public.dts_ddl_command, public.dts_args_session, and public.aliyun_dts_instance.

  • To ensure the accuracy of the displayed latency for incremental data migration, DTS adds a heartbeat table named dts_postgres_heartbeat to the source database.

  • During incremental data migration, DTS creates a replication slot with the dts_sync_ prefix in the source database to replicate data. Using this replication slot, DTS can obtain incremental logs from the source database within the last 15 minutes. When the data migration fails or the migration instance is released, DTS attempts to automatically clear this replication slot.

    Note
    • If you change the password of the source database account used by the task or delete the DTS IP address whitelist from the source database during data migration, the replication slot cannot be automatically cleared. In this case, you must manually clear the replication slot in the source database to prevent it from accumulating and occupying disk space, which can make the source database unavailable.

    • If a failover occurs in the source database, you must log on to the secondary database to manually clear the slot.

  • Evaluate the performance of the source and destination databases before data migration. Perform data migration during off-peak hours. During full data migration, DTS consumes some read and write resources on both databases, which can increase the database load.

  • Because full data migration performs concurrent INSERT operations, it causes table fragmentation in the destination database. Therefore, after the full migration is complete, the table storage space in the destination database is larger than that in the source instance.

  • Confirm that the migration precision of DTS for columns of the FLOAT or DOUBLE data type meets your business requirements. DTS reads the values of these two types of columns using ROUND(COLUMN,PRECISION). If the precision is not explicitly defined, DTS migrates FLOAT with a precision of 38 digits and DOUBLE with a precision of 308 digits.

  • DTS attempts to resume failed migration tasks within seven days. Therefore, before you switch your business to the destination instance, you must end or release the task, or use the revoke command to revoke the write permissions of the account that DTS uses to access the destination instance. This prevents the source data from overwriting the data in the destination instance if the task is automatically resumed.

  • DTS validates data content but does not validate metadata such as Sequences. You must validate metadata yourself.

  • After you switch your business to the destination instance, new sequences do not start incrementing from the maximum value of the source sequences. You must update the sequence values in the destination database before the switchover. For more information, see Update the sequence values in the destination database.

  • For a full or incremental migration task, if the tables to be migrated in the source database contain foreign keys, triggers, or event triggers, DTS temporarily sets the session_replication_role parameter to replica at the session level if the destination database account is a privileged account or has superuser permissions. If the destination database account does not have these permissions, you must manually set the session_replication_role parameter to replica in the destination database. During this period (when session_replication_role is replica), if cascade update or delete operations occur in the source database, data inconsistency may occur. After the DTS migration task is released, you can change the session_replication_role parameter back to origin.

  • If a task fails, DTS support staff will attempt to restore it within eight hours. During restoration, they may restart the task or adjust its parameters.

    Note

    Only DTS task parameters are modified—not database parameters. Parameters that may be adjusted include those listed in Modify instance parameters.

  • When migrating partitioned tables, include both the child partitions and the parent table as synchronization objects. Otherwise, data inconsistency may occur for the partitioned table.

    Note

    The parent table of a partitioned table in PolarDB for PostgreSQL (Compatible with Oracle) does not directly store data. All data is stored in the child partitions. The sync task must include both the parent table and all its child partitions. Otherwise, data from the child partitions may be missed, which leads to data inconsistency between the source and destination.

Billing

Migration type

Instance configuration fee

Internet traffic fee

Schema migration and full data migration

Free of charge.

When the Access Method parameter of the destination database is set to Public IP Address, you are charged for Internet traffic. For more information, see Billing overview.

Incremental data migration

Charged. For more information, see Billing overview.

Migration types

Migration type

Description

Schema migration

DTS migrates object schemas to the destination database. DTS supports schema migration for the following object types: tables, views, synonyms, stored procedures, stored functions, packages, and user-defined types.

Note

DTS does not support schema migration for triggers. We recommend deleting triggers from the source database to prevent data inconsistency. For more information, see Configure a data synchronization or migration task for a source database that contains a trigger.

Full data migration

DTS migrates historical data from the source database to the destination database.

Note

Do not perform DDL operations on the migration objects before the schema migration and full data migration are complete. Otherwise, the migration task may fail.

Incremental data migration

After the full data migration, DTS captures incremental data updates from the source database's redo log and migrates them to the destination database.

Incremental data migration enables a smooth migration with minimal application downtime.

Supported objects

  • schema and table

    Note

    This includes primary key, unique key, foreign key, built-in data type, and default constraint.

  • Support for other objects, such as view and procedure (requires PostgreSQL version 11 or later), depends on the destination database type. For specific details, refer to the console.

Supported SQL for incremental migration

Operation type

SQL statement

DML

INSERT, UPDATE, and DELETE

DDL

  • DTS migrates DDL operations only for data migration tasks created after October 1, 2020.

    Important
    • For data migration tasks that were created before September 9, 2022, you must create triggers and functions in the source database to capture DDL information before you configure the migration task. For more information, see Use triggers and functions to perform DDL incremental migration for PostgreSQL.

    • DTS does not support migrating data of the BIT type during incremental data migration.

  • When the source database account is a privileged account, DTS supports the following DDL statements:

    • CREATE TABLE and DROP TABLE

    • ALTER TABLE (including RENAME TABLE, ADD COLUMN, ADD COLUMN DEFAULT, ALTER COLUMN TYPE, DROP COLUMN, ADD CONSTRAINT, ADD CONSTRAINT CHECK, and ALTER COLUMN DROP DEFAULT)

    • TRUNCATE TABLE

    • CREATE INDEX ON TABLE

    Important
    • DTS does not migrate additional clauses in DDL statements, such as CASCADE or RESTRICT.

    • DTS does not migrate DDL statements from a session in which the SET session_replication_role = replica command is executed.

    • DTS does not migrate DDL statements executed by invoking a function.

    • If a single commit in the source database contains both DML and DDL statements, DTS does not migrate the DDL statements.

    • If a single commit in the source database includes DDL statements for unselected objects, DTS does not migrate these statements.

Permission requirements for database accounts

Database

Permission requirements

Source PolarDB for PostgreSQL (Compatible with Oracle) cluster

A privileged account.

Destination PolarDB for PostgreSQL (Compatible with Oracle) cluster

Database owner permission.

Important

You specify the database owner when you create the database.

To create a database account and grant permissions:

For a PolarDB for PostgreSQL (Compatible with Oracle) cluster, see Create a database account.

Procedure

  1. Navigate to the migration task list page for the destination region using one of the following methods.

    From the DTS console

    1. Log on to the Data Transmission Service (DTS) console.

    2. In the navigation pane on the left, click Data Migration.

    3. In the upper-left corner of the page, select the region where the migration instance is located.

    From the DMS console

    Note

    The actual operations may vary based on the mode and layout of the DMS console. For more information, see Simple mode console and Customize the layout and style of the DMS console.

    1. Log on to the Data Management (DMS) console.

    2. In the top menu bar, choose Data + AI > Data Transmission (DTS) > Data Migration.

    3. To the right of Data Migration Tasks, select the region where the migration instance is located.

  2. Click Create Task to navigate to the task configuration page.

  3. Configure the source and destination databases.

    Warning

    After you select the source and destination instances, we recommend that you carefully read the limits displayed at the top of the page. Otherwise, the task may fail or data inconsistency may occur.

    Section

    Parameter

    Description

    N/A

    Task Name

    DTS automatically generates a task name. We recommend that you specify a descriptive name for easy identification. The name does not need to be unique.

    Source Database

    Select Existing Connection

    • To use a database instance that has been added to the system (created or saved), select the desired database instance from the drop-down list. The database information below will be automatically configured.

      Note

      In the DMS console, this parameter is named Select a DMS database instance..

    • If you have not registered the database instance with the system, or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select PolarDB (Compatible with Oracle).

    Connection Type

    Select Public IP Address.

    Instance Region

    Select the region where the source PolarDB for PostgreSQL (Compatible with Oracle) cluster is located.

    Data Sources

    • Single Data Source: This option requires you to specify the Domain Name or IP Address and Port Number of the primary node in the source PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    • Multiple Data Sources: This option requires you to also enter the IP addresses and port numbers of multiple nodes, including the primary node, in the source PolarDB for PostgreSQL (Compatible with Oracle) cluster in the IP:Port of Data Sources field.

      Note

      If you select Multiple Data Sources and configure the IP:Port of Data Sources parameter, DTS can automatically fail over to a new primary node if a primary/secondary switchover occurs in the PolarDB for PostgreSQL (Compatible with Oracle) cluster. This feature supports disaster recovery scenarios.

    In this example, Single Data Source is selected.

    Domain Name or IP Address

    Enter the endpoint of the primary node in the source PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    Note

    You can run the ping command on your computer to obtain the IP address of the direct connection endpoint of the target PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    Port Number

    Enter the service port of the source database. The default value is 1521.

    Note

    In this example, this service port must be accessible from the internet.

    IP:Port of Data Sources

    Enter the IP addresses and port numbers of multiple nodes in the source PolarDB for PostgreSQL (Compatible with Oracle) cluster. Separate multiple entries with commas (,).

    Oracle Type

    • Non-RAC Instance: This option requires you to specify the SID.

    • RAC or PDB Instance: This option requires you to specify the Service Name.

    In this example, Non-RAC Instance is selected.

    Database Name

    Enter the name of the database that contains the migration object in the source PolarDB for PostgreSQL (Oracle Compatible) cluster.

    Database Account

    Enter the database account for the source PolarDB for PostgreSQL (Compatible with Oracle) cluster. For information about the required permissions, see Required permissions for database accounts.

    Database Password

    Enter the password for the database account.

    Destination Database

    Select Existing Connection

    • To use a database instance that has been added to the system (created or saved), select the desired database instance from the drop-down list. The database information below will be automatically configured.

      Note

      In the DMS console, this parameter is named Select a DMS database instance..

    • If you have not registered the database instance with the system, or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select PolarDB (Compatible with Oracle).

    Connection Type

    Select Public IP Address.

    Instance Region

    Select the region where the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster is located.

    Data Sources

    • Single Data Source: This option requires you to specify the Domain Name or IP Address and Port Number of the primary node in the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    • Multiple Data Sources: This option requires you to also enter the IP addresses and port numbers of multiple nodes, including the primary node, in the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster in the IP:Port of Data Sources field.

      Note

      If you select Multiple Data Sources and configure the IP:Port of Data Sources parameter, DTS can automatically fail over to a new primary node if a primary/secondary switchover occurs in the PolarDB for PostgreSQL (Compatible with Oracle) cluster. This feature supports disaster recovery scenarios.

    In this example, Single Data Source is selected.

    Domain Name or IP Address

    Enter the endpoint of the primary node in the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    Note

    You can run the ping command on your computer to obtain the IP address of the direct connection endpoint of the target PolarDB for PostgreSQL (Compatible with Oracle) cluster.

    Port Number

    Enter the service port of the destination database. The default value is 1521.

    IP:Port of Data Sources

    Enter the IP addresses and port numbers of multiple nodes in the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster. Separate multiple entries with commas (,).

    Database Name

    Enter the name of the database in the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster to which the objects will be migrated.

    Database Account

    Enter the database account for the destination PolarDB for PostgreSQL (Compatible with Oracle) cluster. For information about the required permissions, see Required permissions for database accounts.

    Database Password

    Enter the password for the database account.

  4. After completing the configuration, click Test Connectivity and Proceed at the bottom of the page.

    Note
    • Ensure that you add the CIDR blocks of the DTS servers (either automatically or manually) to the security settings of both the source and destination databases to allow access. For more information, see Add the IP address whitelist of DTS servers.

    • If the source or destination is a self-managed database (i.e., the Access Method is not Alibaba Cloud Instance), you must also click Test Connectivity in the CIDR Blocks of DTS Servers dialog box.

  5. Configure the task objects.

    1. On the Configure Objects page, configure the objects that you want to migrate.

      Parameter

      Description

      Migration Types

      • If you only need to perform a full migration, select both Schema Migration and Full Data Migration.

      • To perform a migration with no downtime, select Schema Migration, Full Data Migration, and Incremental Data Migration.

      Note
      • If you do not select Schema Migration, you must ensure that a database and tables to receive the data exist in the destination database. You can also use the object name mapping feature in the Selected Objects box as needed.

      • If you do not select Incremental Data Migration, do not write new data to the source instance during data migration to ensure data consistency.

      Processing Mode of Conflicting Tables

      • Precheck and Report Errors: Checks whether tables with the same names exist in the destination database. If no tables with the same names exist, the precheck is passed. If tables with the same names exist, an error is reported during the precheck, and the data migration task does not start.

        Note

        If a table in the destination database has the same name but cannot be easily deleted or renamed, you can change the name of the table in the destination database. For more information, see Object name mapping.

      • Ignore Errors and Proceed: Skips the check for tables with the same names.

        Warning

        Selecting Ignore Errors and Proceed may cause data inconsistency and business risks. For example:

        • If the table schemas are consistent and a record in the destination database has the same primary key value as a record in the source database:

          • During full migration, DTS keeps the record in the destination database. The record from the source database is not migrated.

          • During incremental migration, DTS does not keep the record in the destination database. The record from the source database overwrites the record in the destination database.

        • If the table schemas are inconsistent, only some columns of data may be migrated, or the migration may fail. Proceed with caution.

      Source Objects

      In the Source Objects box, click the objects to migrate, and then click Right arrow to move them to the Selected Objects box.

      Note
      • You can select migration objects at the schema or table level. If you select tables, other objects such as views, triggers, and stored procedures are not migrated to the destination database.

      • If a table to be migrated contains a field of the SERIAL type and you have selected Schema Migration for Migration Types, also select Sequence or migrate the entire schema.

      Selected Objects

      • To change the name of a single migration object in the destination instance, right-click the object in the Selected Objects box. For more information, see Map object names individually.

      • To change the names of multiple migration objects in the destination instance, click Batch Edit in the upper-right corner of the Selected Objects box. For more information, see Map object names in batches.

      Note
      • If you use the object name mapping feature, the migration of other objects that depend on the renamed object might fail.

      • To set WHERE conditions to filter data, right-click a table in the Selected Objects box and specify the filter conditions in the dialog box that appears. For more information, see Set filter conditions.

      • To select which SQL operations to migrate at the database or table level, right-click a migration object in the Selected Objects box and select the desired SQL operations in the dialog box that appears.

    2. Click Next: Advanced Settings to configure advanced parameters.

      Parameter

      Description

      Dedicated Cluster for Task Scheduling

      By default, DTS schedules tasks on a shared cluster. You do not need to select one. If you want more stable tasks, you can purchase a dedicated cluster to run DTS migration tasks.

      Retry Time for Failed Connections

      After the migration task starts, if the connection to the source or destination database fails, DTS reports an error and immediately begins to retry the connection. The default retry duration is 720 minutes. You can customize the retry time to a value from 10 to 1440 minutes. We recommend that you set the duration to more than 30 minutes. If DTS reconnects to the source and destination databases within the specified duration, the migration task automatically resumes. Otherwise, the task fails.

      Note
      • For multiple DTS instances that share the same source or destination, the network retry time is determined by the setting of the last created task.

      • Because you are charged for the task during the connection retry period, we recommend that you customize the retry time based on your business needs, or release the DTS instance as soon as possible after the source and destination database instances are released.

      Retry Time for Other Issues

      After the migration task starts, if a non-connectivity issue, such as a DDL or DML execution exception, occurs in the source or destination database, DTS reports an error and immediately begins to retry the operation. The default retry duration is 10 minutes. You can customize the retry time to a value from 1 to 1440 minutes. We recommend that you set the duration to more than 10 minutes. If the related operations succeed within the specified retry duration, the migration task automatically resumes. Otherwise, the task fails.

      Important

      The value of Retry Time for Other Issues must be less than the value of Retry Time for Failed Connections.

      Enable Throttling for Full Data Migration

      During full migration, DTS consumes read and write resources on the source and destination databases, which may increase the database load. If required, you can enable throttling for the full migration task. You can set Queries per second (QPS) to the source database, RPS of Full Data Migration, and Data migration speed for full migration (MB/s) to reduce the load on the destination database.

      Note
      • This configuration item is available only if you select Full Data Migration for Migration Types.

      • You can also adjust the full migration speed after the migration instance is running.

      Enable Throttling for Incremental Data Migration

      If required, you can also choose to set speed limits for the incremental migration task. You can set RPS of Incremental Data Migration and Data migration speed for incremental migration (MB/s) to reduce the load on the destination database.

      Note
      • This configuration item is available only if you select Incremental Data Migration for Migration Types.

      • You can also adjust the incremental migration speed after the migration instance is running.

      Environment Tag

      You can select an environment tag to identify the instance based on your business needs. In this example, you do not need to configure this parameter.

      Configure ETL

      Choose whether to enable the extract, transform, and load (ETL) feature. For more information, see What is ETL? Valid values:

      Monitoring and Alerting

      Select whether to set alerts and receive alert notifications based on your business needs.

      • No: Does not set an alert.

      • Yes: Configure alerts by setting an alert threshold and an alert contact. If a migration fails or the latency exceeds the threshold, the system sends an alert notification.

    3. Click Next: Data Validation to configure a data validation task.

      For more information about the data validation feature, see Configure data validation.

  6. Save the task and run a precheck.

    • To view the parameters for configuring this instance when you call the API operation, move the pointer over the Next: Save Task Settings and Precheck button and click Preview OpenAPI parameters in the bubble that appears.

    • If you do not need to view or have finished viewing the API parameters, click Next: Save Task Settings and Precheck at the bottom of the page.

    Note
    • Before the migration task starts, DTS performs a precheck. The task starts only after it passes the precheck.

    • If the precheck fails, click View Details next to the failed check item, fix the issue based on the prompt, and then run the precheck again.

    • If a warning is reported during the precheck:

      • For check items that cannot be ignored, click View Details next to the failed item, fix the issue based on the prompt, and then run the precheck again.

      • For check items that can be ignored, you can click Confirm Alert Details, Ignore, OK, and Precheck Again to skip the alert item and run the precheck again. If you choose to ignore a warning, it may cause issues such as data inconsistency and pose risks to your business.

  7. Purchase the instance.

    1. When the Success Rate is 100%, click Next: Purchase Instance.

    2. On the Purchase page, select the link specification for the data migration instance. For more information, see the following table.

      Category

      Parameter

      Description

      New Instance Class

      Resource Group Settings

      Select the resource group to which the instance belongs. The default value is default resource group. For more information, see What is Resource Management?

      Instance Class

      DTS provides migration specifications with different performance levels. The link specification affects the migration speed. You can select a specification based on your business scenario. For more information, see Data migration link specifications.

    3. After the configuration is complete, read and select Data Transmission Service (Pay-as-you-go) Service Terms.

    4. Click Buy and Start. In the OK dialog box that appears, click OK.

      You can view the progress of the migration task on the Data Migration Tasks list page.

      Note
      • If the migration task does not include incremental migration, it stops automatically after the full migration is complete. After the task stops, its Status changes to Completed.

      • If the migration task includes incremental migration, it does not stop automatically. The incremental migration task continues to run. While the incremental migration task is running, the Status of the task is Running.