This topic explains how to use the data integration and development features of DataWorks to migrate data from MaxCompute to Tablestore.
Procedure
-
Activate services
-
Create a table in DataWorks
-
Log on to the DataWorks console and select a region in the upper-left corner.
-
In the left navigation pane, click Workspace.
-
On the Workspaces page, in the Actions column for the target workspace, choose .
-
On the Data Studio page, create a MaxCompute SQL node. Run the following statement to create a table named
transs.CREATE TABLE transs (name STRING, id STRING, gender STRING);
-
-
Import data into the
transstable-
Download the sample CSV file to your computer: demo_data.csv.
-
In the left navigation pane, choose .
-
Click Go to Data Upload and Download.
-
In the left navigation bar, click the upload icon
, and click Upload Data.Upload the local data to the table named
transsin MaxCompute.
-
-
Create a table in the Tablestore console
-
Log on to the Tablestore console and create an instance.
-
Create a data table named trans. For more information, see Create a data table.
Set name and id as the primary keys.
-
-
Add a MaxCompute data source in DataWorks
-
Log on to the DataWorks console and select a region in the upper-left corner.
-
In the left navigation pane, click Workspace.
-
On the Workspaces page, click the name of the target workspace.
-
On the Workspace Details page, click Data Sources in the left navigation pane.
-
On the Data Source tab, click Add Data Source, and select MaxCompute.
-
On the Add MaxCompute Data Source page, fill in the Basic Information.
For more information, see Configure a MaxCompute data source.
-
-
Add a Tablestore data source. For more information, see Configure a Tablestore data source.
-
Configure MaxCompute Reader and Tablestore Writer
-
Log on to the DataWorks console and select a region in the upper-left corner.
-
In the left navigation pane, choose .
-
Select Workspace and click Go to Data Studio.
-
On the left side of Data Studio, click
, and select .-
For Source, select MaxCompute.
-
For Destination, select Tablestore.
-
-
On the node configuration page, configure the following parameters.
-
Data Source-Source: Select the MaxCompute data source that you added.
-
Data Source-Destination: Select the Tablestore data source that you added.
-
Source-Table: Select the MaxCompute table that you created.
-
Destination-Table: Select the Tablestore table that you created.
-
Runtime Resource Group: Select the exclusive resource group that you created.
-
Keep the default values for the other parameters.
Alternatively, click the
icon above the configuration area to switch to script mode. The following code provides an example:{ "type": "job", "steps": [ { "stepType": "odps", "parameter": { "partition": [], "datasource": "odps_first", "column": [ "name", "id", "gender" ], "table": "transs" }, "name": "Reader", "category": "reader" }, { "stepType": "ots", "parameter": { "datasource": "transs", "column": [ { "name": "gender", "type": "STRING" } ], "writeMode": "UpdateRow", "table": "trans", "primaryKey": [ { "name": "name", "type": "STRING" }, { "name": "id", "type": "STRING" } ] }, "name": "Writer", "category": "writer" } ], "version": "2.0", "order": { "hops": [ { "from": "Reader", "to": "Writer" } ] }, "setting": { "errorLimit": { "record": "0" }, "speed": { "throttle": false, "concurrent": 1, "dmu": 1 } } } -
-
-
Verify the data in the Tablestore console.
-
Log on to the Tablestore console. In the upper-left corner, select a region.
-
In the navigation pane on the left, select All Instances.
-
On the All Instances page, click the instance name to go to the Instance Management page.
-
On the Instance Management page, click the Instance Details tab.
-
On the Instance Details tab, in the Tables area, click the name of the data table that you want to view.
-
On the data table management page, click the Query Data tab to view the data in the table.
-