The Export MaxCompute Table to OSS component exports a MaxCompute table to Object Storage Service (OSS). Each row in the table becomes a JSON object in the output file, and the output uses the JSON Lines format — one JSON object per line. The file as a whole is not a valid JSON object. Download an example output file to see the structure before configuring the component.
If the table contains a large amount of data, the output is split across multiple files, all stored in the OSS directory you specify.
Supported computing resources
Configure the component
-
In the Platform for AI (PAI) console, open Machine Learning Designer and go to your pipeline.
-
Add the Export MaxCompute Table to OSS component to your pipeline and connect it to the upstream component that provides the data to export.
-
Select the component to open its settings panel. Configure the parameters described in the following table.
-
Submit the pipeline.
Parameters
| Tab | Parameter | Required | Description | Default |
|---|---|---|---|---|
| Fields Setting | The OSS directory. | Yes | The OSS directory where the exported data is stored. If the data volume is large, the output is split into multiple files under this directory. | — |
| Tuning | Number of CPUs per instance of map task | No | Number of CPUs allocated to each map task instance. Valid values: 50–800. | 100 |
| The memory size per instance of map task | No | Memory allocated to each map task instance, in MB. Valid values: 256–12288. | 1024 | |
| The maximum size of input data for a map | No | Maximum data volume each map task instance processes, in MB. You can use this parameter to manage the input of a map. Valid values: 1–Integer.MAX_VALUE. | 256 |
该文章对您有帮助吗?