Use the intelligent diagnosis feature to analyze tasks end-to-end. When a task does not run as expected, this feature helps you quickly pinpoint the problem.
Background information
The intelligent diagnosis feature analyzes tasks based on the following dimensions:
-
View running details: A scheduled task runs only when its upstream dependencies are successful, it reaches its scheduled time, sufficient scheduling resources are available, and it is not already running. For more information, see Conditions for a task to run.
-
View basic information: You can view the key timestamps for the current task instance.
-
Affected baselines: You can view a list of baselines that monitor the current task and check their operational status. For more information about intelligent baselines, see Intelligent baseline overview.
-
Historical instance running status: You can view the task's execution history for the last 15 days in charts and lists.
Limits
-
The Perform Diagnostics feature is available only in DataWorks Professional Edition and higher. You can currently try this feature for free, but we recommend that you upgrade to the Professional Edition to access more product capabilities. For more information about upgrading your edition, see Features of DataWorks editions.
-
The intelligent diagnosis feature is supported in the following regions: China (Hangzhou), China (Shanghai), China (Beijing), China (Zhangjiakou), China (Shenzhen), China (Hong Kong), Japan (Tokyo), Singapore, Malaysia (Kuala Lumpur), Indonesia (Jakarta), Germany (Frankfurt), US (Silicon Valley), US (Virginia), and UAE (Dubai).
Accessing intelligent diagnosis
Go to the Operation Center page.
Log on to the DataWorks console. In the top navigation bar, select the desired region. In the left-side navigation pane, choose . On the page that appears, select the desired workspace from the drop-down list and click Go to Operation Center.
-
After you open Operation Center, you can access the Intelligent Diagnostics page in one of two ways.
-
Method 1: Access intelligent diagnosis for an instance.
-
In the navigation pane on the left, choose . On the Instance Perspective tab, find the target instance and click Run Diagnostics in the Actions column to open the intelligent diagnosis page.
-
In the navigation pane on the left, choose . On the Instance Perspective tab, find the target instance and click DAG in the Actions column. In the DAG view, right-click the DAG for the instance and select Run Diagnostics from the menu to open the intelligent diagnosis page.
-
-
Method 2: In the navigation pane on the left, choose to open the intelligent diagnosis page.
NoteThe intelligent diagnosis feature can locate a specific instance only by its instance ID.
-
Running details
DataWorks sequentially checks the conditions required for a task to run. It verifies the status of upstream dependencies, the scheduled start time, the availability of scheduling resources, and the execution status of the current task.
-
Upstream Nodes
On the Upstream Nodes tab, intelligent diagnosis checks the status of upstream dependencies. If an upstream dependency has not run successfully, the current task is blocked. You can click Run Diagnostics in the Actions column for the upstream dependency to identify the cause of the failure.
NoteIf an upstream dependency has not run and the dependency chain is deep, first use the upstream analysis feature in the DAG panel to quickly identify the key blocking dependency. Then, use Intelligent Diagnostics to determine why that dependency failed to run. This improves O&M efficiency.
-
Timing Check
The Timing Check verifies whether the current task has reached its scheduled run time. This check is triggered only after the upstream dependency check passes.
NoteWhen you define the scheduling properties for a task in the DataStudio module, you set an expected run time. However, the actual run time might be delayed due to issues such as the failure of an upstream dependency.
-
Resources
The Resources tab of intelligent diagnosis displays resource usage and a list of tasks that are occupying resources while the current task is waiting. If the check fails, this indicates insufficient resources, and the task must wait for them to become available. You can use the resource usage trend to schedule tasks during off-peak hours.
Feature
Description
Scheduling resource information
Displays the name of the scheduling resource group used by the current task, the number of tasks currently running on the resource group, and the number of tasks waiting to run on the resource group.
NoteUse a serverless resource group to alleviate resource contention.
If you use a shared scheduling resource group, the peak hours for DataWorks tasks are from 00:00 to 09:00 every day. During peak hours, scheduling resources can be tight, and tasks may have to wait for resources.
Diagnosis Results
Displays the execution status of the current task.
Resource Usage Trends
If you use a shared scheduling resource group, this section displays the resource utilization rate of the group over different periods and the task's resource wait time.
-
Execution
The Execution feature shows you information about the current task, such as its execution logs, the execution details of associated data quality monitoring rules, and code details. For failed tasks, the intelligent diagnosis feature provides you with intelligent diagnosis suggestions based on log information to help you quickly find the cause of the error.
Feature
Description
Log
Displays the detailed execution process of the task.
In the Operation Details module of the Execution page, you can click the EMR Web UI address printed in the Log to open the corresponding EMR component's web UI. You can also click the Intelligent Diagnostics button in the lower-right corner to analyze the error message.
Intelligent Diagnostics
This feature uses a large language model (LLM) to analyze task run error logs. Tongyi Qianwen, DeepSeek, and the DW Knowledge Base are supported.
You can use the Tongyi Qianwen and DeepSeek LLMs to analyze error logs, generate an error analysis, and receive suggested solutions. You can also view recommended solutions in the DW Knowledge Base.
NoteAfter the LLM completes the analysis, you can perform quick actions such as Modify code, Rerun instance, Set instance to success, Modify instance scheduling resource group, Modify instance data integration resource group, submit a ticket, and apply for table permissions.
DQC
If the task is associated with a data quality monitoring rule, the rule is triggered when the task runs. You can view the rule's detailed execution results here.
Code details
Displays the code of the current task.
Basic information
On the General tab, you can view key timestamps and basic information for the current task. For a detailed description of related properties, see Scheduling configuration.
Affected baselines
On the Influenced Baseline tab, you can view the list of baselines that monitor the current task and check their operational status. For more information about intelligent baselines, see Intelligent baseline overview.
Historical instances
On the Historical instance tab, you can view the following information:
-
Metric trends for the current task: A chart shows the trends in the task's Running time, Start run time, Waiting Time for Resources, and Completed At over the last 15 days.
-
Historical instance runs for the current task: A list displays details of historical instances, including start time, end time, duration, and resource wait time. You can click Run Diagnostics in the Actions column to open the diagnosis details page for that instance.