Once offline datasets are created, you can manage them from the offline datasets list. This topic outlines the management operations for these datasets.
At the top menu bar of Dataphin, select Tag > Tag Workbench.
In the left-side navigation pane, select Data Preparation > Offline Datasets.
The Offline Datasets pages respectively display the offline datasets list. The list includes information such as Dataset Name, Processing Method, Update Method, Owner, Dataset Status, Running Status, Downstream Tag, and Last Modified Time.
Running Status: If the running status shows Task Error, click the
icon in the running status column to view the error details.Downstream Tag: To view specific information about downstream tags in the dataset, click the
icon in the downstream tag column.
(Optional) Filter the desired dataset by selecting the processing method, dataset status, update method (only for offline datasets), owner, running status, or by entering the dataset name/code. Alternatively, select Only My Datasets to quickly find datasets owned by the current user.
In the offline datasets list, the following operations are available:
Operation
Description
Copy
Create a new dataset by copying the current dataset's information.
Edit
Edit the current offline dataset's editable information, such as Basic Information, O&M Configuration (only for offline datasets), and Processing Logic for datasets with statuses like Being Edited, Published, Publish Failed, or Offline.
NoteWhen editing offline datasets processed with Table Mapping or SQL Processing, you have the option to alter the Source Field associated with the metric. Ensure that the new Source Field Type corresponds with the Value Type of the Metric. For datasets processed using Form Processing, you are able to change the metric's Statistical Field and Statistical Function. The chosen Statistical Field and Statistical Function must be compatible with the Value Type of the Metric.
If the source table of an offline mapping dataset or offline form dataset shows Table Structure Information Not Retrieved, verify whether the source table has been deleted or renamed.
Details
View detailed configuration of the current dataset.
View Instance
Access the running instance of the current offline dataset, including details, logs, and rerun options.
NoteThis operation is exclusive to offline datasets.
Offline
Published offline datasets can be unpublished using the unpublish operation.
Run
Execute the run operation on offline datasets with a Manual Update Method. In the Run dialog box, select the Data Timestamp for the source table partition to be read, with the default set to Yesterday (T-1).
NoteThis operation is only available for offline datasets.
After manual execution, both the dataset data and any tag data referencing the dataset will be updated.
Go to O&M
Jump to the O&M page of the current offline dataset. For more information, see View and manage script tasks.
NoteThis function is limited to offline datasets.
Data Backfill
Carry out the data backfill operation on offline datasets with a Periodic Update Method.
NoteThis operation is exclusive to offline datasets.
Following data backfill, both the dataset data and any tag data referencing the dataset will be updated.
Delete
Editing and unpublishing offline datasets both allow for the delete operation.