The Lindorm Ray node in DataWorks lets you develop and periodically schedule Python tasks with the Ray distributed computing framework.
Overview
The Lindorm compute engine provides a computing service based on the Ray distributed computing framework. It is compatible with open-source Ray APIs and supports the Python programming model for distributed computing, machine learning, and data processing. With the Lindorm Ray node in DataWorks, you can write Python code, configure Ray submission commands, and develop, debug, and schedule Ray jobs on a periodic basis.
Limitations
-
Resource groups: Lindorm Ray nodes can run only on a Serverless resource group.
-
Language: Lindorm Ray nodes support only Python.
-
Execution: You cannot run single lines or blocks of code. You must submit the entire script for execution.
Prerequisites
-
You have created a Lindorm instance and associated it with a DataWorks workspace. For more information, see Associate a Lindorm computing resource.
-
You have created a Ray resource group in the Lindorm console. For more information, see Use a RAY resource group.
-
(Optional, required for RAM accounts) The RAM account for development has been added to the corresponding workspace and granted the Developer or Workspace Administrator role. The Workspace Administrator role has extensive permissions, so assign it with caution. For instructions on how to add members, see Add workspace members.
NoteIf you are using a primary account, you can skip this step.
Create a Lindorm Ray node
For instructions on how to create a node, see Create a Lindorm Ray node.
Develop a Lindorm Ray node
Write Python code in the code editor and configure the job submission command in the Ray submission command area.
Debug a Lindorm Ray node
-
Configure run properties.
In the Run Configuration pane on the right side of the node, configure the Compute Resource, Lindorm Resource Group, and Resource Group. The following table describes these parameters.
Parameter
Description
Compute Resource
Select the associated Lindorm compute resource.
Lindorm Resource Group
Select the Ray resource group you created in the Lindorm console.
Resource Group
Select a Serverless resource group that has passed the network connectivity test. Lindorm Ray nodes support only Serverless resource groups.
Script parameters
If you define variables using the ${Parameter Name} format when configuring the node, configure the Parameter name and Parameter Value in the Script Parameters section. The task then dynamically replaces the variables with their values at runtime. For more information, see Scheduling parameter sources and their expressions.
-
Run and debug the node.
To execute the node task, click Save and then click Run.
Next steps
-
Configure node scheduling: To run the node on a periodic schedule, configure the scheduling policy in the Scheduling pane on the right.
-
Deploy the node: To run the task in the production environment, deploy the node by clicking the
icon. Nodes run periodically only after they are deployed.