Configure the Data Lake Formation Output Component

更新时间: 2026-03-14 04:36:33

The Data Lake Formation output component writes data to a Data Lake Formation data source. When you sync data from other data sources to a Data Lake Formation data source, you must configure the Data Lake Formation output component after you configure the source data. This topic describes how to configure the Data Lake Formation output component.

Prerequisites

Procedure

  1. In the top menu bar on the Dataphin homepage, choose Develop > Data Integration.

  2. In the top menu bar on the Integration page, select a Project. In Dev-Prod mode, also select an environment.

  3. In the left navigation pane, click Offline Integration. In the Offline Integration list, click the offline pipeline that you want to develop to open its configuration page.

  4. In the upper-right corner of the page, click Component Library to open the Component Library panel.

  5. In the navigation pane on the left of the Component Library panel, click Output. In the list of output components on the right, find the Data Lake Formation component and drag it onto the canvas.

  6. Click and drag the image icon of the upstream component to connect it to the Data Lake Formation output component.

  7. Click the image icon in the Data Lake Formation output component card to open the Data Lake Formation Output Configuration dialog box.

    image

  8. In the Data Lake Formation Output Configuration dialog box, configure the following parameters.

    Parameter

    Description

    Basic Settings

    Step Name

    The name of the Data Lake Formation output component. Dataphin generates a step name automatically. You can change it based on your business scenario. Naming rules:

    • Use only Chinese characters, letters, underscores (_), and digits.

    • Keep the name no longer than 64 characters.

    Datasource

    The drop-down list shows all Data Lake Formation data sources. It includes data sources for which you have write-through permission and those for which you do not. Click the image icon to copy the current data source name.

    Table

    Select the target table for output data. Enter a keyword to search for tables, or enter the exact table name and click Exact Search. After you select a table, the system checks its status automatically. Click the image icon to copy the selected table name.

    Partition

    If the selected target table is a partitioned table, enter partition information, such as state_date='20190101'. You can also use parameters to enable daily incremental writes, such as state_date=${bizdate}.

    Loading Policy

    Select the policy for writing data to the target table. Loading Policy options:

    • Append Data: Write data directly to the target table or partition.

    • Overwrite Data: Overwrite data in the target table or partition.

    Field Mapping

    Input Fields

    Lists input fields from the upstream component.

    Output Fields

    Lists output fields. You can perform the following actions:

    • Field Management: Click Field Management to select output fields.

      image

      • Click the gaagag icon to move Selected Input Fields to Unselected Input Fields.

      • Click the agfag icon to move Unselected Input Fields to Selected Input Fields.

    • Batch Add: Click Batch Add to configure output fields in JSON, TEXT, or DDL format.

      • You can configure in batch in JSON format, for example:

        // Example:
        [{
          "name": "user_id",
          "type": "String"
         },
         {
          "name": "user_name",
          "type": "String"
         }]
        Note

        name specifies the name of the imported field, and type specifies the data type of the field after import. For example, "name":"user_id","type":"String" means to import the field named user_id and set its data type to String.

      • TEXT format example:

        // Example:
        user_id,String
        user_name,String
        • The row delimiter separates each field. By default, it is a line feed (\n). You can also use a semicolon (;) or a period (.).

        • The column delimiter separates the field name and field type. By default, it is a comma (,).

      • DDL format example:

        CREATE TABLE tablename (
            id INT PRIMARY KEY,
            name VARCHAR(50),
            age INT
        );
    • Create Output Field: Click + Create Output Field. Enter a value for Column and select a value for Type. Click the image icon to save the configuration for the current row.

    Mapping

    Manually map fields between upstream input fields and target table fields. Quick Mapping options include Row Mapping and Name Mapping.

    • Name Mapping: Map fields with identical names.

    • Row Mapping: Map fields by position when field names differ between the source and target tables. Only fields in the same row are mapped.

  9. Click Confirm to complete the configuration of the Data Lake Formation output component.

上一篇: Configure GBase 8c Output Component 下一篇: Configure DolphinDB Output Component
阿里云首页 智能数据建设与治理 Dataphin 相关技术圈