DataWorks real-time sync keeps a destination database consistent with the source through continuous single-table or full-database synchronization.
Core capabilities
Real-time sync capabilities:
|
Capability |
Description |
|
Multi-source data sync |
Combine different source and destination data sources into sync pipelines. Supported data sources and sync solutions. |
|
Complex network environments |
Supports Alibaba Cloud databases, on-premises databases, ECS self-managed databases, and databases from other cloud providers. Ensure network connectivity between your resource group and both the source and destination. Network connectivity solutions. |
|
Sync scenarios |
Two primary scenarios: single source table to single destination table, and incremental data from sharded databases and tables to a single destination table.
|
|
Task configuration |
Configure code-free, real-time ETL pipelines for single tables. Configure a real-time sync task for a single table. Single-table real-time sync:
|
|
Task O&M |
Monitor and alert on sync task status.
|
-
Real-time sync tasks cannot run from Data Studio. Save and submit the node, then run it in Operation Center in the production environment.
-
Real-time sync does not support synchronizing views.
Supported data sources
Source: Kafka, Hologres, Oracle, LogHub, and DataHub.
Destination: ApsaraDB for OceanBase, Data Lake Formation (DLF), Doris, Hologres, Kafka, MaxCompute, OSS, OSS-HDFS, StarRocks, Tablestore, and Lindorm.
Data processing: data filtering, string replacement, data masking, JSON parsing, and field editing and assignment.
Get started
Create your first task: Configure a real-time sync task for a single table.