Migrate data between AnalyticDB for MySQL V3.0 clusters

更新时间:
复制 MD 格式

This topic describes how to use Data Transmission Service (DTS) to migrate data between AnalyticDB for MySQL V3.0 clusters.

Prerequisites

  • A destination AnalyticDB for MySQL V3.0 cluster must be available with more storage space than is used by the source AnalyticDB for MySQL V3.0 cluster.

  • To perform incremental data migration, you must enable the change data capture (CDC) feature for the source AnalyticDB for MySQL V3.0 cluster and enable the binary log for the tables to be migrated.

    Note

    If the AnalyticDB for MySQL V3.0 cluster runs an engine version earlier than 3.2.1.0, you must upgrade the engine version.

Limitations

Note
  • During schema migration, DTS does not migrate foreign keys from the source database to the destination database.

  • During full and incremental migration, DTS temporarily disables constraint checks and foreign key cascade operations at the session level. If cascade update or delete operations occur in the source database while the task is running, data inconsistency may occur.

Type

Description

Source database limitations

  • Bandwidth requirement: The server that hosts the source database must have sufficient outbound bandwidth. Otherwise, the data migration speed slows.

  • To migrate incremental data, the AnalyticDB for MySQL V3.0 cluster must run engine version 3.2.1.0 or later.

  • Each table to be migrated must have a custom primary key with unique values. Otherwise, duplicate data may be written to the destination database.

    Note

    If the primary key of a table to be migrated is the auto-generated __adb_auto_id__, DTS cannot migrate data from that table.

  • During schema migration and full data migration, do not perform DDL operations to change the schemas of databases or tables. Otherwise, the migration instance fails.

  • If you perform only full data migration, do not write new data to the source instance during the migration. Otherwise, data will become inconsistent between the source and destination databases. To ensure real-time data consistency, we recommend that you select schema migration, full data migration, and incremental data migration.

Other limitations

  • An incremental data migration instance can migrate data from only one table. To migrate multiple tables incrementally, create a separate migration instance for each table.

  • The destination database must have a custom primary key. Or, in the Configurations for Databases, Tables, and Columns step, set the Primary Key Column. Otherwise, migration may fail.

  • The data verification feature is not supported.

  • If a migration instance is paused for more than one day, the binary log of the source AnalyticDB for MySQL V3.0 cluster may expire, and the instance may become unrecoverable.

  • Do not disable the binary log of the source AnalyticDB for MySQL V3.0 cluster while the migration instance is running. Otherwise, the instance fails and cannot be resumed. If an instance fails because you disabled the binary log on the source, you must create a new instance to resolve the issue.

  • Due to the usage limits of an AnalyticDB for MySQL V3.0 cluster, if the disk space usage on a node in the cluster exceeds 80%, the DTS instance may experience latency. Therefore, evaluate the required storage space based on the objects to be migrated to ensure that the destination cluster has sufficient storage space.

  • If the destination AnalyticDB for MySQL V3.0 cluster is being backed up while the DTS instance is running, the instance fails.

  • Full data migration involves concurrent INSERT operations, which cause fragmentation in the destination database tables. After the full data migration is complete, the destination tables occupy more storage space than the source tables.

  • During full data migration, DTS consumes read and write resources on both the source and destination databases, which can increase the database load. Evaluate the performance of the source and destination databases before the data migration. Perform the data migration during off-peak hours, for example, when the CPU usage of both databases is below 30%.

  • When the destination database is an AnalyticDB for MySQL cluster, DTS supports writing only the data types that are supported by the cluster. These include basic data types and complex data types such as ARRAY, MAP, and JSON. Other types, such as MULTIVALUE, are not supported.

  • If a task fails, DTS support staff will attempt to restore it within eight hours. During restoration, they may restart the task or adjust its parameters.

    Note

    Only DTS task parameters are modified—not database parameters. Parameters that may be adjusted include those listed in Modify instance parameters.

Billing

Migration type

Instance configuration fee

Internet traffic fee

Schema migration and full data migration

Free of charge.

When the Access Method parameter of the destination database is set to Public IP Address, you are charged for Internet traffic. For more information, see Billing overview.

Incremental data migration

Charged. For more information, see Billing overview.

SQL operations supported for incremental migration

Operation type

SQL statement

DML

INSERT, UPDATE, DELETE

Note

When data is written to an AnalyticDB for MySQL V3.0 cluster, UPDATE statements are automatically converted to REPLACE INTO statements. If the primary key is updated, the statement is converted to a DELETE and INSERT operation.

DDL

ADD COLUMN, DROP COLUMN

Required permissions for database accounts

Database

Required permissions

Creation and authorization

Source AnalyticDB for MySQL V3.0 cluster

Read permissions on the objects to be migrated.

Create a database account

Destination AnalyticDB for MySQL V3.0 cluster

Read and write permissions on the destination database.

Procedure

  1. Navigate to the migration task list page for the destination region using one of the following methods.

    From the DTS console

    1. Log on to the Data Transmission Service (DTS) console.

    2. In the navigation pane on the left, click Data Migration.

    3. In the upper-left corner of the page, select the region where the migration instance is located.

    From the DMS console

    Note

    The actual operations may vary based on the mode and layout of the DMS console. For more information, see Simple mode console and Customize the layout and style of the DMS console.

    1. Log on to the Data Management (DMS) console.

    2. In the top menu bar, choose Data + AI > Data Transmission (DTS) > Data Migration.

    3. To the right of Data Migration Tasks, select the region where the migration instance is located.

  2. Click Create Task to navigate to the task configuration page.

  3. Configure the source and destination databases.

    Category

    Parameter

    Description

    N/A

    Task Name

    DTS automatically generates a task name. We recommend that you specify a descriptive name for easy identification. The name does not need to be unique.

    Source Database

    Select Existing Connection

    • To use a database instance that has been added to the system (created or saved), select the desired database instance from the drop-down list. The database information below will be automatically configured.

      Note

      In the DMS console, this parameter is named Select a DMS database instance..

    • If you have not registered the database instance with the system, or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select AnalyticDB for MySQL 3.0.

    Access Method

    Select Alibaba Cloud Instance.

    Instance Region

    Select the region where the source AnalyticDB for MySQL V3.0 cluster resides.

    Replicate Data Across Alibaba Cloud Accounts

    In this example, a database instance under the current Alibaba Cloud account is used. Select No.

    Instance ID

    Select the ID of the source AnalyticDB for MySQL V3.0 cluster.

    Database Account

    Enter the database account of the source AnalyticDB for MySQL V3.0 cluster. For information about the required permissions, see Required permissions for database accounts.

    Database Password

    Enter the password for the database account.

    Destination Database

    Select Existing Connection

    • To use a database instance that has been added to the system (created or saved), select the desired database instance from the drop-down list. The database information below will be automatically configured.

      Note

      In the DMS console, this parameter is named Select a DMS database instance..

    • If you have not registered the database instance with the system, or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select AnalyticDB for MySQL 3.0.

    Access Method

    Select Alibaba Cloud Instance.

    Instance Region

    Select the region where the destination AnalyticDB for MySQL V3.0 cluster resides.

    Instance ID

    Select the ID of the destination AnalyticDB for MySQL V3.0 cluster.

    Database Account

    Enter the database account of the destination AnalyticDB for MySQL V3.0 cluster. For information about the required permissions, see Required permissions for database accounts.

    Database Password

    Enter the password for the database account.

  4. After you complete the configuration, click Test Connectivity and Proceed at the bottom of the page.

    Note

    Ensure that the DTS service IP address segments are automatically or manually added to the security settings of the source and destination databases to allow access from DTS servers. For more information, see Add DTS server IP addresses to a whitelist.

  5. Configure the task objects.

    1. On the Configure Objects page, configure the objects that you want to migrate.

      Parameter

      Description

      Migration Types

      • If you only need to perform a full migration, select both Schema Migration and Full Data Migration.

      • To perform a migration with no downtime, select Schema Migration, Full Data Migration, and Incremental Data Migration.

      Note
      • If you do not select Schema Migration, you must ensure that a database and tables to receive the data exist in the destination database. You can also use the object name mapping feature in the Selected Objects box as needed.

      • If you do not select Incremental Data Migration, do not write new data to the source instance during data migration to ensure data consistency.

      DDL and DML Operations to Be Synchronized

      When you select Migration Types for Incremental Data Migration, you can also select the operations for incremental migration at the instance level.

      Note

      To select operations for incremental migration at the table level, right-click a migration object in Selected Objects and select the desired operations in the dialog box that appears.

      Merge Tables

      When you do not select Migration Types for Incremental Data Migration, you can also configure whether to enable the table merging feature.

      • If you select Yes, DTS adds the __dts_data_source column to each table to record data sources. For more information, see Enable multi-table merge.

      • If you select No, this is the default option.

      Note

      The table merging feature is configured at the task level, not the table level. To merge some tables but not others, you must create two separate data migration tasks.

      Warning

      Do not perform DDL operations to change the schema of the source database or tables. Otherwise, data inconsistency or task failure may occur.

      Processing Mode of Conflicting Tables

      • Precheck and Report Errors: Checks whether tables with the same names exist in the destination database. If no tables with the same names exist, the precheck is passed. If tables with the same names exist, an error is reported during the precheck, and the data migration task does not start.

        Note

        If a table in the destination database has the same name but cannot be easily deleted or renamed, you can change the name of the table in the destination database. For more information, see Object name mapping.

      • Ignore Errors and Proceed: Skips the check for tables with the same names.

        Warning

        Selecting Ignore Errors and Proceed may cause data inconsistency and business risks. For example:

        • If the table schemas are consistent and a record in the destination database has the same primary key value as a record in the source database:

          • During full migration, DTS keeps the record in the destination database. The record from the source database is not migrated.

          • During incremental migration, DTS does not keep the record in the destination database. The record from the source database overwrites the record in the destination database.

        • If the table schemas are inconsistent, only some columns of data may be migrated, or the migration may fail. Proceed with caution.

      Source Objects

      In the Source Objects box, click the objects to migrate, and then click Right arrow to move them to the Selected Objects box.

      Important
      • If you select Migration Types for Incremental Data Migration, you can select only one table as the migration object.

      • If you do not select Migration Types for Incremental Data Migration, you can select a database, table, or column as the migration object.

      • If you select an entire database as the migration object, the following rules apply by default:

        • If a table to be migrated in the source database has a primary key (single-column or multi-column), the primary key column is used as the distribution key.

        • If a table to be migrated in the source database does not have a primary key, an auto-increment primary key column is automatically generated. This may cause data inconsistency between the source and destination databases.

      Selected Objects

      • To set the name of a migration object in the destination instance, or to specify the object that receives data in the destination instance, right-click the migration object in the Selected Objects box to make changes. For more information, see Object name mapping.

      • To remove a selected migration object, click the object in the Selected Objects box, and then click image to move it to the Source Objects box.

      Note
      • If you use the object name mapping feature, other objects that depend on the mapped object may fail to migrate.

      • To set a WHERE clause to filter data, right-click the table to migrate in the Selected Objects box and set the filter condition in the dialog box that appears. For more information about how to set the condition, see Set filter conditions.

      • To select the SQL operations for incremental migration, right-click the migration object in the Selected Objects box and select the desired SQL operations in the dialog box that appears.

    2. Click Next: Advanced Settings to configure advanced parameters.

      Parameter

      Description

      Dedicated Cluster for Task Scheduling

      By default, DTS schedules tasks on a shared cluster. You do not need to select one. If you want more stable tasks, you can purchase a dedicated cluster to run DTS migration tasks.

      Retry Time for Failed Connections

      After the migration task starts, if the connection to the source or destination database fails, DTS reports an error and immediately begins to retry the connection. The default retry duration is 720 minutes. You can customize the retry time to a value from 10 to 1440 minutes. We recommend that you set the duration to more than 30 minutes. If DTS reconnects to the source and destination databases within the specified duration, the migration task automatically resumes. Otherwise, the task fails.

      Note
      • For multiple DTS instances that share the same source or destination, the network retry time is determined by the setting of the last created task.

      • Because you are charged for the task during the connection retry period, we recommend that you customize the retry time based on your business needs, or release the DTS instance as soon as possible after the source and destination database instances are released.

      Retry Time for Other Issues

      After the migration task starts, if a non-connectivity issue, such as a DDL or DML execution exception, occurs in the source or destination database, DTS reports an error and immediately begins to retry the operation. The default retry duration is 10 minutes. You can customize the retry time to a value from 1 to 1440 minutes. We recommend that you set the duration to more than 10 minutes. If the related operations succeed within the specified retry duration, the migration task automatically resumes. Otherwise, the task fails.

      Important

      The value of Retry Time for Other Issues must be less than the value of Retry Time for Failed Connections.

      Enable Throttling for Full Data Migration

      During full migration, DTS consumes read and write resources on the source and destination databases, which may increase the database load. If required, you can enable throttling for the full migration task. You can set Queries per second (QPS) to the source database, RPS of Full Data Migration, and Data migration speed for full migration (MB/s) to reduce the load on the destination database.

      Note
      • This configuration item is available only if you select Full Data Migration for Migration Types.

      • You can also adjust the full migration speed after the migration instance is running.

      Enable Throttling for Incremental Data Migration

      If required, you can also choose to set speed limits for the incremental migration task. You can set RPS of Incremental Data Migration and Data migration speed for incremental migration (MB/s) to reduce the load on the destination database.

      Note
      • This configuration item is available only if you select Incremental Data Migration for Migration Types.

      • You can also adjust the incremental migration speed after the migration instance is running.

      Environment Tag

      You can select an environment tag to identify the instance based on your business needs. In this example, no tag is needed.

      Configure ETL

      Choose whether to enable the extract, transform, and load (ETL) feature. For more information, see What is ETL? Valid values:

      Monitoring and Alerting

      Select whether to set alerts and receive alert notifications based on your business needs.

      • No: Does not set an alert.

      • Yes: Configure alerts by setting an alert threshold and an alert contact. If a migration fails or the latency exceeds the threshold, the system sends an alert notification.

    3. Click Next: Data Validation to configure a data validation task.

      For more information about the data validation feature, see Configure data validation.

    4. Optional: After you complete the preceding configurations, click Next: Configure Database and Table Fields to set the Type, Primary Key Column, Distribution Key, and partition key information (Partition Key, Partitioning Rules, and Partition Lifecycle) for the tables to be migrated in the destination database.

      Note
      • This step is available only if you select Migration Types for Schema Migration when configuring migration objects. You can set Definition Status to All to make modifications.

      • You can select multiple columns to form a composite primary key for the Primary Key Column. You must select one or more columns from the Primary Key Column to serve as the Distribution Key and Partition Key. For more information, see CREATE TABLE.

  6. Save the task and run a precheck.

    • To view the parameters for configuring this instance when you call the API operation, move the pointer over the Next: Save Task Settings and Precheck button and click Preview OpenAPI parameters in the bubble that appears.

    • If you do not need to view or have finished viewing the API parameters, click Next: Save Task Settings and Precheck at the bottom of the page.

    Note
    • Before the migration task starts, DTS performs a precheck. The task starts only after it passes the precheck.

    • If the precheck fails, click View Details next to the failed check item, fix the issue based on the prompt, and then run the precheck again.

    • If a warning is reported during the precheck:

      • For check items that cannot be ignored, click View Details next to the failed item, fix the issue based on the prompt, and then run the precheck again.

      • For check items that can be ignored, you can click Confirm Alert Details, Ignore, OK, and Precheck Again to skip the alert item and run the precheck again. If you choose to ignore a warning, it may cause issues such as data inconsistency and pose risks to your business.

  7. Purchase the instance.

    1. When the Success Rate is 100%, click Next: Purchase Instance.

    2. On the Purchase page, select the link specification for the data migration instance. For more information, see the following table.

      Category

      Parameter

      Description

      New Instance Class

      Resource Group Settings

      Select the resource group to which the instance belongs. The default value is default resource group. For more information, see What is Resource Management?

      Instance Class

      DTS provides migration specifications with different performance levels. The link specification affects the migration speed. You can select a specification based on your business scenario. For more information, see Data migration link specifications.

    3. After the configuration is complete, read and select Data Transmission Service (Pay-as-you-go) Service Terms.

    4. Click Buy and Start. In the OK dialog box that appears, click OK.

      You can view the progress of the migration task on the Data Migration Tasks list page.

      Note
      • If the migration task does not include incremental migration, it stops automatically after the full migration is complete. After the task stops, its Status changes to Completed.

      • If the migration task includes incremental migration, it does not stop automatically. The incremental migration task continues to run. While the incremental migration task is running, the Status of the task is Running.