Configure MaxCompute input widget

更新时间:
复制 MD 格式

The MaxCompute input widget reads data from MaxCompute data sources. To sync data from MaxCompute to other data sources, configure the MaxCompute input widget as the source and specify the target data source.

Prerequisites

  • A MaxCompute data source is created. For more information, see Create a MaxCompute data source.

  • To configure MaxCompute input widget properties, your account must have read-through permission for the data source. If you do not have this permission, request it. For more information, see Request data source permission.

Procedure

  1. Select Development > Data Integration from the top menu bar on the Dataphin home page.

  2. In the integration page's top menu bar, select Project (Dev-Prod mode requires selecting an environment).

  3. In the left-side navigation pane, click Batch Pipeline. Then, from the Batch Pipeline list, select the offline pipeline you want to develop to access its configuration page.

  4. To open the Component Library panel, click Component Library located in the upper right corner of the page.

  5. In the Component Library panel's left-side navigation pane, select Input. Then locate the MaxCompute component within the right-side input widget list and drag it onto the canvas.

  6. Click the image icon on the MaxCompute input widget card to open the MaxCompute Input Configuration dialog box.

  7. In the MaxCompute Input Configuration dialog box, configure the following parameters.

    Parameter

    Description

    Step Name

    The name of the MaxCompute input widget. Dataphin automatically generates a step name, which you can modify as needed. Naming rules:

    • Can only contain Chinese characters, letters, underscores (_), and numbers.

    • Cannot exceed 64 characters.

    Datasource

    Lists all MaxCompute-type data sources, including those with and without read-through permission. Click the image icon to copy the data source name.

    Note

    When the compute engine is MaxCompute, you can select a logical table and switch to the logical table input widget for integration.

    For data sources without read-through permission, click the request button next to the data source to request permission. For more information, see Request data source permission.

    If no MaxCompute-type data source exists, click Create to create one. For more information, see Create a MaxCompute data source.

    Table

    Select the source table for data synchronization. You can search by keyword or enter the exact table name and click Exact Search. After you select a table, the system automatically detects the table status. Click the image icon to copy the selected table name.

    Partition

    If the selected table is a partitioned table, enter the partition information. Supports reading static partitions or range partitions. Static partitions such as ds=20230101or ds1=2023,ds2=01; range partitions such as /*query*/ds >=20230101 and ds <= 20230107.

    When Partition Does Not Exist

    Select a policy to handle cases where the specified partition does not exist:

    • Set Task To Failed: Terminates the task and sets it to failed.

    • Set Task To Successful, No Data Written: The task runs successfully with no data written.

    • Use The Latest Non-empty Partition: Uses the current latest non-empty partition (max_pt) of the table as the partition to be synchronized. If the table does not have any partitions with data, the task reports an error and is set to failed.

    Output Fields

    Displays all fields of the selected table. To exclude fields from downstream widgets, delete the corresponding fields:

    Note

    Field classification and grading is available only when the compute engine is MaxCompute.

    • Single field deletion: Click the sgaga icon in the operation column to delete individual fields.

    • Batch field deletion: Click Field Management, select multiple fields in the Field Management dialog box, click the image left shift icon to move them to the unselected list, and click Confirm.

      image..png

  8. Click Confirm to save the MaxCompute input widget configuration.