Synchronize RDS SQL Server to SelectDB

更新时间:
复制 MD 格式

Alibaba Cloud SelectDB supports sub-second queries on massive datasets, tens of thousands of concurrent point queries, and high-throughput for complex analysis. You can use Data Transmission Service (DTS) to synchronize data from SQL Server databases, such as ApsaraDB RDS for SQL Server, to Alibaba Cloud SelectDB for large-scale data analysis. This topic uses an ApsaraDB RDS for SQL Server instance as an example to illustrate the process.

Note

This feature is in beta testing, available only to select users.

Prerequisites

  • A target Alibaba Cloud SelectDB instance has been created. The storage space of the target instance is greater than the storage space that is used by the source RDS SQL Server instance. For more information, see Create an instance.

  • If the source instance meets any of the following conditions, we recommend splitting the process into multiple synchronization tasks.

    • The source instance has more than 10 databases.

    • Log backups are performed more than once per hour for a single database.

    • More than 100 Data Definition Language (DDL) operations are performed per hour on a single database.

    • The log generation rate for a single database exceeds 20 MB/s.

    • You must enable change data capture (CDC) on more than 1,000 tables.

Limitations

Note

During schema synchronization, DTS does not synchronize foreign keys from the source database to the target database.

Type

Description

Source database limitations

  • Tables for synchronization must have a primary key or unique constraint. Otherwise, duplicate data may occur in the destination database.

  • If a data synchronization task synchronizes more than 5,000 tables at the table level and requires object edits, such as mapping column names, split the tables into multiple tasks.

  • A single data synchronization task supports a maximum of 10 databases. If you exceed this limit, you risk stability and performance issues. In this case, split the tables into multiple tasks.

  • If you configure a task to synchronize specific objects instead of an entire database, you cannot synchronize objects that have the same table name but different schema names to the same destination database within that task.

  • Data Transmission Service (DTS) uses the fn_log function to obtain logs from the source database. This function has performance bottlenecks. Do not clean up source database logs prematurely. Otherwise, the task may fail.

  • Transaction logs:

    • Enable transaction logs, set the recovery model to Full, and make sure that a full physical backup has been completed.

    • For an incremental data synchronization task, DTS requires that transaction logs in the source database are retained for at least 24 hours. For a task that includes both full data synchronization and incremental data synchronization, transaction logs must be retained for at least 7 days. After the full data synchronization is complete, you can change the retention period to 24 hours or more. If DTS cannot obtain the transaction logs because the retention period is shorter than required, the task may fail. In extreme cases, this can lead to data inconsistency or loss. Issues that arise from a log retention period shorter than the DTS requirement are not covered by the service level agreement (SLA).

  • To enable Change Data Capture (CDC) for tables in the source database, you must meet the following conditions. Otherwise, the precheck fails.

    • The value of the srvname field in the sys.sysservers view must be the same as the return value of the SERVERPROPERTY function.

    • If the source database is a self-managed SQL Server, the database owner must be sa. If the source database is an ApsaraDB RDS for SQL Server instance, the database owner must be sqlsa.

    • If the source database is Enterprise Edition, it must be SQL Server 2008 or later.

    • If the source database is Standard Edition, it must be SQL Server 2016 SP1 or later.

    • If the source database is SQL Server 2017 (Standard or Enterprise Edition), upgrade it to a later version.

  • If the source database is a read-only instance, DDL operations are not synchronized.

  • If the source database is an Azure SQL Database, a single synchronization instance can synchronize data from only one database.

  • If the source database is an ApsaraDB RDS for SQL Server instance, disable the Transparent Data Encryption (TDE) feature to ensure the stability of the synchronization instance. For more information, see Disable TDE.

  • In hybrid log parsing mode, you cannot perform multiple operations to add or drop columns in quick succession (less than 10 minutes apart). For example, running the following SQL statements consecutively will cause the task to fail.

    ALTER TABLE test_table DROP COLUMN Flag;
    ALTER TABLE test_table ADD Remark nvarchar(50) not null default('');
  • If you use the sp_rename command to modify the names of objects, such as stored procedures, in the source database before the schema synchronization task runs, the task may produce unexpected results or fail.

    Note

    We recommend that you use the ALTER command to rename objects in the database.

  • Do not run DDL operations that change database or table schemas during schema synchronization or full synchronization. Otherwise, the synchronization task fails.

    Note

    During full synchronization, DTS queries the source database. This creates metadata locks that may block DDL operations on the source database.

  • If the source database is a Web Edition of ApsaraDB RDS for SQL Server, you must select Incremental Synchronization Based on Logs of Source Database (Heap tables are not supported) for SQL Server Incremental Synchronization Mode when you configure the task.

  • We recommend that you keep the READ_COMMITTED_SNAPSHOT transaction processing mode parameter for the source database enabled during a full data synchronization task to prevent shared locks from affecting data writes. Otherwise, issues such as data inconsistency and instance failures may occur. Any resulting exceptions are not covered by the DTS SLA.

Other limitations

  • DTS currently supports synchronizing data only to Unique engine tables in a SelectDB instance. Make sure that all unique keys in the target table also exist in the source table and are included in the synchronization object. Otherwise, data inconsistency may occur.

  • Requirements for synchronization objects:

    • Schema initialization is supported for database, schema, and table objects.

    • DTS does not support synchronizing the following data types: CURSOR, ROWVERSION, SQL_VARIANT, HIERARCHYID, POLYGON, GEOMETRY, GEOGRAPHY, and user-defined types created by using the CREATE TYPE statement.

    • DTS does not support synchronizing the following object types: PROCEDURE, FUNCTION, TRIGGER, DATATYPE, SYNONYM, CATALOG, PLAN_GUIDE, or SEQUENCE.

  • If you select Incremental Synchronization Based on Logs of Source Database (Heap tables are not supported) for SQL Server Incremental Synchronization Mode in the Configure Objects step, the synchronized tables must have a clustered index that includes the primary key columns. Heap tables, tables without primary keys, compressed tables, tables with computed columns, and tables with sparse columns are not supported. These limitations do not apply in hybrid log parsing mode.

  • If you select Log-based Parsing for Non-heap Tables and CDC-based Incremental Synchronization for Heap Tables (Hybrid Log-based Parsing) for SQL Server Incremental Synchronization Mode in the Configure Objects step, the following limitations also apply:

    • Incremental data synchronization in DTS depends on the CDC component. Ensure that the CDC jobs in the source database are running correctly. Otherwise, the DTS task fails.

    • By default, the CDC component stores incremental data for 3 days. You can use the exec console.sys.sp_cdc_change_job @job_type = 'cleanup', @retention= <time>; command to adjust the retention period as needed.

      Note
      • <time> specifies the time in minutes.

      • If a single table in the source database generates more than 10 million incremental change SQL statements per day on average, we recommend that you set <time> to 1440.

    • The DTS precheck for an incremental data synchronization task enables CDC on the source database and tables. Due to limitations in the SQL Server database engine, this process can cause brief locking on the source database.

    • In a single data synchronization task, enable CDC on no more than 1,000 tables. Exceeding this limit may cause task latency or instability.

  • If you select Polling and querying CDC instances for incremental synchronization for SQL Server Incremental Synchronization Mode in the Configure Objects step, the following limitations also apply:

    • The source database account used by the DTS instance must have permissions to enable CDC. An account with the sysadmin role is required to enable database-level CDC. A high-privilege account is required to enable table-level CDC.

      Note
      • The highest-privilege account (server administrator) provided in the Azure SQL Database console meets this requirement. For databases using the vCore-based purchasing model, all service tiers support CDC. For databases using the DTU-based purchasing model, the service tier must be S3 or higher to support CDC.

      • The high-privilege account for Amazon RDS for SQL Server meets this requirement and supports enabling database-level CDC for stored procedures.

      • CDC is not supported for tables with clustered columnstore indexes.

      • The DTS precheck for an incremental data synchronization task enables CDC on the source database and tables. Due to limitations in the SQL Server database engine, this process can cause brief locking on the source database.

    • DTS polls the CDC instance for each table in the source database to obtain incremental data. Therefore, synchronize no more than 1,000 tables. Otherwise, the task may experience latency or instability.

    • Incremental data stored by the CDC component is retained for 3 days by default. We recommend that you run the exec console.sys.sp_cdc_change_job @job_type = 'cleanup', @retention= <time>; command to set a proper retention period.

    • Note
      • <time> specifies the time in minutes.

      • If the average daily number of incremental change SQL statements for a single table in the source database exceeds 10 million, we recommend setting <time> to 1440.

    • Performing consecutive DDL operations to add or drop columns (more than two such operations within one minute) is not supported and may cause the task to fail.

    • Modifying the CDC instance in the source database is not supported and may cause task failure or data loss.

  • Complex DDL operations are not supported for DDL synchronization.

  • To ensure accurate latency monitoring for incremental data synchronization, DTS creates specific objects in the source database based on the selected mode. In log parsing mode, DTS creates the dts_cdc_sync_ddl trigger, the dts_sync_progress heartbeat table, and the dts_cdc_ddl_history DDL history table. In hybrid mode, DTS creates the dts_cdc_sync_ddl trigger, the dts_sync_progress heartbeat table, and the dts_cdc_ddl_history DDL history table, and also enables CDC at the database level and for some tables. The rate of data changes on tables with CDC enabled should not exceed 1,000 RPS.

  • When you set parameters in the Selected Objects section, you can currently set only the bucket_count (number of buckets) parameter.

    Note

    The value of bucket_count must be a positive integer. The default value is auto.

  • During data synchronization, do not create a new cluster in the target SelectDB instance. Otherwise, the task fails. You can try to restart the synchronization instance to recover the task.

  • SelectDB supports only database and table names that start with a letter. If the name of a database or table to be synchronized does not start with a letter, you must use the mapping feature to change it.

  • If the name of a synchronization object (database, table, or column) contains Chinese characters, you must use the mapping feature to change it (for example, to an English name). Otherwise, the task may fail.

  • DTS does not support DDL operations that modify multiple columns at once or perform consecutive modifications on the same table.

  • During data synchronization, do not add BE (Backend) nodes to the SelectDB database. Otherwise, the task fails. You can try to restart the synchronization instance to recover the task.

  • Before you start a data synchronization task, evaluate the performance of the source and destination databases. Also, run the task during off-peak hours. Full data initialization consumes read and write resources on both the source and destination databases, which can increase the database load.

  • Full data initialization performs concurrent INSERT operations, which can cause fragmentation in the tables of the destination database. As a result, the table space in the destination instance may be larger than in the source instance after initialization.

  • During DTS synchronization, do not allow any data writes to the target database from sources other than DTS. This can cause data inconsistency between the source and target databases.

  • Do not rebuild indexes during synchronization. This can cause task failure or even data loss.

    Note

    You cannot perform DDL operations related to the primary key on a table for which CDC is enabled.

  • If the number of tables with CDC enabled in a single data synchronization task exceeds the The maximum number of tables for which CDC is enabled that DTS supports., the precheck fails.

  • If you need to write more than 64 KB of data to a single field in a CDC-enabled table, you must use the exec sp_configure 'max text repl size', -1; command to adjust the configuration of the source database in advance.

    Note

    By default, the maximum size of a single field that a CDC job can process is 64 KB.

  • If a DDL write to the target database fails, the DTS task continues to run. You need to check the task log for the failed DDL statements. For more information about how to view task logs, see View Task Logs.

  • If you use the Modify Objects to be Synchronized feature, you cannot remove databases from the task.

  • Multiple synchronization instances that use the same SQL Server database as a source have independent incremental data capture modules.

  • During incremental synchronization, DTS uses a batch-and-flush strategy, writing data at most every 5 seconds by default. In normal cases, synchronization latency is usually within 10 seconds. You can adjust this by modifying the selectdb.reservoir.timeout.milliseconds parameter. The value can range from 1,000 to 10,000 milliseconds.

  • If a task fails, DTS support staff will attempt to restore it within eight hours. During restoration, they may restart the task or adjust its parameters.

    Note

    Only DTS task parameters are modified—not database parameters. Parameters that may be adjusted include those listed in Modify instance parameters.

  • SQL Server is a commercial, closed-source database. Due to known or unknown characteristics of its log format, issues can arise during incremental change data capture (CDC) and parsing by DTS. Before using DTS for incremental synchronization or migration from a SQL Server source in a production environment, perform comprehensive proof-of-concept (POC) testing. Your tests should cover all business change scenarios, schema modifications, and peak-load stress tests. Ensuring that your production business logic is identical to what was tested in the POC phase is critical for the stable and efficient operation of DTS.

  • During incremental synchronization, partial transaction rollbacks on the source are not supported. Rollback operations may be lost.

Special case

When the source instance is an ApsaraDB RDS for SQL Server, DTS creates an rdsdt_dtsacct account in the source instance for data synchronization. Do not delete this account or change its password while the task is running, or the task may fail. For more information, see System account descriptions.

Billing

Synchronization type

Pricing

Schema synchronization and full data synchronization

Free of charge.

Incremental data synchronization

Charged. For more information, see Billing overview.

Supported SQL for incremental synchronization

Operation type

SQL statement

DML

INSERT, UPDATE, DELETE

Note

DTS does not synchronize UPDATE statements that only update a large field.

DDL

ALTER TABLE supports only ADD COLUMN and DROP COLUMN.

Database account permissions

Database

Permissions

Actions

Source RDS SQL Server

Owner permissions on the source objects

Create standard, high-privilege, and global read-only accounts and Modify the permissions of an account

Target SelectDB

cluster access permissions (Usage_priv) and database permissions (Select_priv, Load_priv, Alter_priv, Create_priv, and Drop_priv)

Manage cluster permissions and Manage basic permissions

Procedure

  1. Go to the data synchronization task list page in the destination region. You can do this in one of two ways.

    DTS console

    1. Log on to the DTS console.

    2. In the navigation pane on the left, click Data Synchronization.

    3. In the upper-left corner of the page, select the region where the synchronization instance is located.

    DMS console

    Note

    The actual steps may vary depending on the mode and layout of the DMS console. For more information, see Simple mode console and Customize DMS console layout and style.

    1. Log on to the DMS console.

    2. In the top menu bar, choose Data + AI > DTS (DTS) > Data Synchronization.

    3. To the right of Data Synchronization Tasks, select the region of the synchronization instance.

  2. Click Create Task to navigate to the task configuration page.

  3. Configure the source and destination databases.

    Category

    Parameter

    Description

    None

    Task Name

    DTS automatically generates a task name. We recommend that you specify a descriptive name for easy identification. The name does not need to be unique.

    Source Database

    Select Existing Connection

    • Select the registered database instance with DTS from the drop-down list. The database information below is automatically configured.

      Note

      In the DMS console, this configuration item is Select a DMS database instance.

    • If you have not registered the database instance or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select SQL Server.

    Access Method

    Select Alibaba Cloud Instance.

    Instance Region

    Select the region where the source ApsaraDB RDS for SQL Server instance resides.

    Replicate Data Across Alibaba Cloud Accounts

    This example demonstrates synchronization within the same Alibaba Cloud account, so select No.

    Instance ID

    Select the instance ID of the source ApsaraDB RDS for SQL Server instance.

    Database Account

    Enter the database account of the source ApsaraDB RDS for SQL Server instance. For permission requirements, see Required permissions for database accounts.

    Database Password

    Enter the password for the specified database account.

    Encryption

    Select Non-encrypted or SSL-encrypted.

    • If SSL encryption is not enabled for the source database, select Non-encrypted.

    • If SSL encryption is enabled for the source database, select SSL-encrypted. DTS trusts the server certificate by default.

    Destination Database

    Select Existing Connection

    • Select the registered database instance with DTS from the drop-down list. The database information below is automatically configured.

      Note

      In the DMS console, this configuration item is Select a DMS database instance.

    • If you have not registered the database instance or do not need to use a registered instance, manually configure the database information below.

    Database Type

    Select SelectDB.

    Access Method

    Select Alibaba Cloud Instance.

    Instance Region

    Select the region where the target SelectDB instance resides.

    Replicate Data Across Alibaba Cloud Accounts

    This example demonstrates synchronization within the same Alibaba Cloud account, so select No.

    Instance ID

    Select the ID of the target SelectDB instance.

    Database Account

    Enter the database account of the target SelectDB instance. For permission requirements, see Required permissions for database accounts.

    Database Password

    Enter the password for the specified database account.

  4. After completing the configuration, click Test Connectivity and Proceed at the bottom of the page.

    Note
    • Ensure that you add the CIDR blocks of the DTS servers (either automatically or manually) to the security settings of both the source and destination databases to allow access. For more information, see Add the IP address whitelist of DTS servers.

    • If the source or destination is a self-managed database (i.e., the Access Method is not Alibaba Cloud Instance), you must also click Test Connectivity in the CIDR Blocks of DTS Servers dialog box.

  5. Configure the task objects.

    1. On the Configure Objects page, specify the objects to synchronize.

      Parameter

      Description

      Synchronization Types

      DTS always selects Incremental Data Synchronization. By default, you must also select Schema Synchronization and Full Data Synchronization. After the precheck, DTS initializes the destination cluster with the full data of the selected source objects, which serves as the baseline for subsequent incremental synchronization.

      Important

      Data types are converted when data is synchronized from SQL Server to SelectDB. If you do not select Schema Synchronization, you must create a Unique model table with the corresponding structure in the target SelectDB instance in advance. For more information, see Data type mappings.

      Schema Mapping Mode of Source and Destination Databases

      SQL Server uses a three-level hierarchy (database > schema > table), while SelectDB uses a two-level hierarchy (database > table). Therefore, you must map the source and target schemas. This task supports only the Do not use source schema name mode. In this mode, the source database name is used as the target database name, and the source table name is used as the target table name.

      Important

      In this mode, tables with the same name are not allowed in different schemas of the source database. Otherwise, the DTS task may fail or data inconsistency may occur.

      SQL Server Incremental Synchronization Mode

      • Log-based Parsing for Non-heap Tables and CDC-based Incremental Synchronization for Heap Tables (Hybrid Log-based Parsing):

        • Advantages:

          • Supports scenarios with source database heap tables, tables without primary keys, compressed tables, and tables with computed columns.

          • High link stability. This mode can obtain complete DDL statements and supports a wide range of DDL scenarios.

        • Disadvantages:

          • DTS creates the trigger `dts_cdc_sync_ddl`, the heartbeat table `dts_sync_progress`, and the DDL storage table `dts_cdc_ddl_history` in the source database. It also enables database-level Change Data Capture (CDC) and partial table CDC.

          • You cannot execute SELECT INTO, TRUNCATE, and RENAME COLUMN statements on tables for which CDC is enabled in the source database. Triggers created by DTS in the source database cannot be manually deleted.

      • Incremental Synchronization Based on Logs of Source Database (Heap tables are not supported):

        • Advantages:

          This mode is non-intrusive to the source database.

        • Disadvantages:

          Does not support scenarios with source database heap tables, tables without primary keys, compressed tables, or tables with computed columns.

      • Polling and querying CDC instances for incremental synchronization:

        • Advantages:

          • Supports full and incremental synchronization when the source database is Amazon RDS for SQL Server, Azure SQL Database, Azure SQL Managed Instance, Azure SQL Server on Virtual Machine, or Google Cloud SQL for SQL Server.

          • Uses the native SQL Server CDC component to obtain incremental data, which makes incremental synchronization more stable and uses less network bandwidth.

        • Disadvantages:

          • The source database account used by the DTS instance must have the permission to enable CDC. Incremental data synchronization has a latency of about 10 seconds.

          • In scenarios involving synchronization of multiple databases and tables, there may be risks of stability and performance issues.

      The maximum number of tables for which CDC is enabled that DTS supports.

      This setting limits the number of tables on which CDC can be enabled within the current synchronization instance. The default value is 1,000.

      Note

      This parameter is not available if you set SQL Server Incremental Synchronization Mode to Incremental Synchronization Based on Logs of Source Database (Heap tables are not supported).

      Processing Mode of Conflicting Tables

      • Precheck and Report Errors: Checks if tables with the same name exist in the target database. If no tables with the same name exist, the check passes. If tables with the same name exist, an error is reported during the precheck phase, and the data synchronization task does not start.

        Note

        If you cannot easily delete or rename the conflicting table in the target database, you can change its name in the target. For more information, see Map table and column names.

      • Ignore Errors and Proceed: Skips the check for tables with the same name in the target database.

        Warning

        Selecting Ignore Errors and Proceed may cause data inconsistency and pose risks to your business. For example:

        • If table structures are consistent, a source record with a primary key or unique key that already exists in the target database overwrites the target record.

        • If the table structures are inconsistent, data initialization may fail, only partial columns may be synchronized, or the synchronization may fail. Use this option with caution.

      Capitalization of Object Names in Destination Instance

      Configure the case-sensitivity policy for database, table, and column names in the destination instance. By default, the DTS default policy is selected. You can also choose to use the default policy of the source or destination database. For more information, see Case policy for destination object names.

      Source Objects

      In the Source Objects box, click the objects, and then click 向右 to move them to the Selected Objects box.

      Note

      You can select objects at the database or table level.

      Selected Objects

      • To change the name of a synchronization object in the target instance, right-click the object in the Selected Objects pane. For instructions, see Map table and column names.

      • If you selected Synchronization Types for Schema Synchronization, selected objects at the table level, and need to set the number of buckets (the bucket_count parameter), right-click the table in the Selected Objects pane. In the Parameter Settings area, set Enable Parameter Settings to Yes, specify a Value, and then click OK.

      Note
      • To select the SQL operations for incremental synchronization at the database or table level, right-click the object in the Selected Objects pane and select the desired SQL operations in the dialog box that appears.

      • To filter data by using WHERE conditions, right-click the table in the Selected Objects pane and set the filter conditions in the dialog box that appears. For instructions, see Set filter conditions.

      • If you use the object name mapping feature, other objects that depend on the mapped object may fail to synchronize.

    2. Click Next: Advanced Settings.

      Parameter

      Description

      Dedicated Cluster for Task Scheduling

      By default, DTS uses a shared cluster for tasks, so you do not need to make a selection. For greater task stability, you can purchase a dedicated cluster to run the DTS synchronization task. For more information, see What is a DTS dedicated cluster?.

      Retry Time for Failed Connections

      If the connection to the source or destination database fails after the synchronization task starts, DTS reports an error and immediately begins to retry the connection. The default retry duration is 720 minutes. You can customize the retry time to a value from 10 to 1,440 minutes. We recommend a duration of 30 minutes or more. If the connection is restored within this period, the task resumes automatically. Otherwise, the task fails.

      Note
      • If multiple DTS instances (e.g., Instance A and B) share a source or destination, DTS uses the shortest configured retry duration (e.g., 30 minutes for A, 60 for B, so 30 minutes is used) for all instances.

      • DTS charges for task runtime during connection retries. Set a custom duration based on your business needs, or release the DTS instance promptly after you release the source/destination instances.

      Retry Time for Other Issues

      If a non-connection issue (e.g., a DDL or DML execution error) occurs, DTS reports an error and immediately retries the operation. The default retry duration is 10 minutes. You can also customize the retry time to a value from 1 to 1,440 minutes. We recommend a duration of 10 minutes or more. If the related operations succeed within the set retry time, the synchronization task automatically resumes. Otherwise, the task fails.

      Important

      The value of Retry Time for Other Issues must be less than that of Retry Time for Failed Connections.

      Enable Throttling for Full Data Synchronization

      During full data synchronization, DTS consumes read and write resources from the source and destination databases, which can increase their load. To mitigate pressure on the destination database, you can limit the migration rate by setting Queries per second (QPS) to the source database, RPS of Full Data Migration, and Data migration speed for full migration (MB/s).

      Note

      Enable Throttling for Incremental Data Synchronization

      You can also limit the incremental synchronization rate to reduce pressure on the destination database by setting RPS of Incremental Data Synchronization and Data synchronization speed for incremental synchronization (MB/s).

      Environment Tag

      You can select an environment tag to identify the instance. This parameter is optional.

      Configure ETL

      Choose whether to enable the extract, transform, and load (ETL) feature. For more information, see What is ETL? Valid values:

      Monitoring and Alerting

      Choose whether to set up alerts. If the synchronization fails or the latency exceeds the specified threshold, DTS sends a notification to the alert contacts.

    3. Optional: After completing the preceding configurations, click Next: Configure Database and Table Fields to set the Primary Key Column, Distribution Key, and Engine for the tables to be synchronized in the target. For this synchronization link, the only supported Engine is unique.

      Note
      • This step is available only if you select Synchronization Types for Schema Synchronization when you configure task objects. You can set Definition Status to All to make modifications.

      • The Primary Key Column can be a composite key that consists of multiple columns, and you must select one or more columns from the Primary Key Column to serve as the Distribution Key.

  6. Save the task and perform a precheck.

    • To view the parameters for configuring this instance via an API operation, hover over the Next: Save Task Settings and Precheck button and click Preview OpenAPI parameters in the tooltip.

    • If you have finished viewing the API parameters, click Next: Save Task Settings and Precheck at the bottom of the page.

    Note
    • Before a synchronization task starts, DTS performs a precheck. You can start the task only if the precheck passes.

    • If the precheck fails, click View Details next to the failed item, fix the issue as prompted, and then rerun the precheck.

    • If the precheck generates warnings:

      • For non-ignorable warning, click View Details next to the item, fix the issue as prompted, and run the precheck again.

      • For ignorable warnings, you can bypass them by clicking Confirm Alert Details, then Ignore, and then OK. Finally, click Precheck Again to skip the warning and run the precheck again. Ignoring precheck warnings may lead to data inconsistencies and other business risks. Proceed with caution.

  7. Purchase the instance.

    1. When the Success Rate reaches 100%, click Next: Purchase Instance.

    2. On the Purchase page, select the billing method and link specifications for the data synchronization instance. For more information, see the following table.

      Category

      Parameter

      Description

      New Instance Class

      Billing Method

      • Subscription: You pay upfront for a specific duration. This is cost-effective for long-term, continuous tasks.

      • Pay-as-you-go: You are billed hourly for actual usage. This is ideal for short-term or test tasks, as you can release the instance at any time to save costs.

      Resource Group Settings

      The resource group to which the instance belongs. The default is default resource group. For more information, see What is Resource Management?.

      Instance Class

      DTS offers synchronization specifications at different performance levels that affect the synchronization rate. Select a specification based on your business requirements. For more information, see Data synchronization link specifications.

      Subscription Duration

      In subscription mode, select the duration and quantity of the instance. Monthly options range from 1 to 9 months. Yearly options include 1, 2, 3, or 5 years.

      Note

      This option appears only when the billing method is Subscription.

    3. Read and select the checkbox for Data Transmission Service (Pay-as-you-go) Service Terms.

    4. Click Buy and Start, and then click OK in the OK dialog box.

      You can monitor the task progress on the data synchronization page.

Data type mapping

Category

SQL Server type

SelectDB type

Numeric

BIT

INT

TINYINT

SMALLINT

SMALLINT

SMALLINT

INT

INT

BIGINT

BIGINT

  • DECIMAL

  • NUMERIC

DECIMAL

FLOAT

DOUBLE

REAL

DOUBLE

MONEY

STRING

SMALLMONEY

STRING

Date and Time

DATE

DATEV2

DATETIME

DATETIMEV2

DATETIME2

DATETIMEV2

SMALLDATETIME

DATETIMEV2

TIME

STRING

DATETIMEOFFSET

DATETIMEV2

TIMESTAMP

STRING

String

  • CHAR

  • VARCHAR

VARCHAR(4*N)

Important

When migrating to SelectDB, CHAR and VARCHAR(n) convert to VARCHAR(4*n).

  • If no data length is specified, the default is VARCHAR(65533).

  • If the data length exceeds 65533, the type converts to STRING.

  • NCHAR

  • NVARCHAR

VARCHAR(4*N)

Important

When migrating to SelectDB, NCHAR and NVARCHAR(n) convert to VARCHAR(4*n).

  • If no data length is specified, the default is VARCHAR(65533).

  • If the data length exceeds 65533, the type converts to STRING.

  • VARCHAR(MAX)

  • NVARCHAR(MAX)

  • TEXT

  • NTEXT

STRING

  • BINARY

  • VARBINARY

  • IMAGE

STRING

XML

STRING

UNIQUEIDENTIFIER

STRING

Other

GEOGRAPHY

STRING

GEOMETRY

STRING

  • SQL_VARIANT

  • HIERARCHYID

  • SYSNAME

STRING