Create a workspace

更新时间:
复制 MD 格式

Starting a new data project requires isolating code, resources, and team members to prevent disorganization and security risks. A DataWorks workspace is a project-based container that helps you manage development tasks, compute resources, and member permissions, enabling secure, orderly, and efficient team collaboration.

Note

We recommend reading the Workspace overview first.

Quick start

Experienced users can follow these steps to quickly create a standard workspace suitable for production environments.

  1. Go to the DataWorks workspace list. Ensure the correct region is selected at the top of the page (the region cannot be changed after the workspace is created), and then click Create Workspace.

  2. On the creation page, configure the following key parameters:

    • Workspace Name: Enter a unique name that complies with your team's naming conventions.

    • Isolate Development and Production Environments : Enable this option. This creates a workspace in standard mode, isolating the development and production environments.

    • Use Data Studio (New Version): We recommend enabling this option to use the new version of Data Studio.

      If this option is not visible, it is enabled by default. If the page prompts you to authorize a service-linked role for PAI or perform other actions, follow the on-screen instructions before proceeding.

  3. Click Create Workspace.

After the workspace is created, the next step is to associate compute resources.

Permissions

To create a workspace, your account must meet one of the following requirements:

  • It is an Alibaba Cloud account.

  • It is a RAM user with the AliyunDataWorksFullAccess or CreateWorkspace access policy. For more information, see Grant permissions to a RAM user.

Before you begin

Before you create a workspace, you must plan its configuration and choose a suitable mode.

Action

Description

Reference

Plan workspaces

A workspace is the largest unit of business isolation in DataWorks. Before using workspaces, design a partitioning plan that suits your specific business scenarios.

Plan workspaces

Choose a workspace mode

DataWorks provides two workspace modes: basic mode and standard mode.

  • Standard Mode provides isolated development and production environments and is the best choice for ensuring data security and standardized workflows.

  • Basic Mode provides only a production environment and is suitable for individual testing or quick validation scenarios.

We strongly recommend using standard mode for all production projects.

Workspace mode differences

Create Workspace

Step 1: Select region and time zone

Workspaces are region-specific. You must first select the region where your business data resides before you can create a workspace in that region.

  1. Go to the DataWorks workspace list.

  2. In the top navigation bar of the console, select the desired region.

    Important

    The region cannot be changed after the workspace is created. Before selecting a region, confirm the following information:

    • Region and time zone relationship:

      • Default scheduling time zone: The scheduling time zone of a workspace defaults to the time zone of its region. For example, the default scheduling time zone for the China (Beijing) region is UTC+8. If the default time zone does not meet your requirements, see the next point about multi-time zone support.

      • Multi-time zone support: Some regions allow you to manually switch to other scheduling time zones. Read Switch the scheduling time zone to check whether your selected region supports this feature and which time zones are available.

    • Impact of daylight saving time (DST): If your selected region observes DST, such as Germany (Frankfurt) or US (Virginia), we strongly recommend that you read Scenario: Impact of DST switch on scheduled tasks to prevent the time zone change from affecting your scheduled tasks unexpectedly.

Step 2: Create the workspace

About the default workspace: The system provides a built-in workspace named default_workspace_xxxx. It is intended for quick exploration only and must not be used for production. It is a basic mode workspace with open permissions and no environment isolation. For more information, see Built-in default workspace.
  1. On the Workspaces page, click Create Workspace to go to the creation page.

  2. On the creation page, configure the parameters as described in the following table.

    Parameter

    Description

    Basic Information

    Workspace Name

    The unique identifier for the workspace. This name cannot be changed after creation.

    The name must be unique within the same region or tenant. The exact uniqueness requirement is subject to the validation prompts on the creation page.

    Display Name

    We recommend a name that reflects the business purpose of the workspace for easy identification.

    Core mode and features

    Isolate Development and Production Environments

    Defines the workspace mode, which determines whether to isolate the development and production environments.

    • Enable: Isolates the development and production environments. This creates a Standard Mode workspace.

    • Disable: Does not isolate the development and production environments. This creates a Basic Mode workspace.

    We recommend enabling this option for production environments.

    Use Data Studio (New Version)

    We recommend enabling this option to experience the latest features of the new Data Studio. If you do not enable this option, the legacy Data Studio is used.

    Important

    If this option is not visible, it typically means the new Data Studio is enabled by default. If the switch is unavailable or you cannot proceed to the next steps, follow the error messages and guides on the creation page (including authorizing service-linked roles for PAI) and try again.

    Workspace Template

    A workspace template defines the tools, resources, and features available in a DataWorks workspace.

    After you select a workspace template, you can add compute resources and data sources based on your requirements. For more information, see Introduction to workspace templates.

    Advanced Settings

    Workspace Administrator

    Defines the administrator for this workspace. By default, the currently logged-in account is the administrator. You can add other RAM users as administrators to help manage the workspace. The workspace administrator role has extensive permissions, so grant it with caution. For more information, see [Deprecated] Responsibilities of a workspace administrator.

    Create AI Workspace with Same Name

    Specifies whether to create an AI workspace with the same name. This option is enabled by default. You can schedule algorithm tasks from PAI in the AI workspace.

    Default Resource Group of DataWorks Workspace

    The default DataWorks resource group used for tasks that run in the workspace. You can change this setting later in the workspace configuration.

    If this option is not displayed, you can configure the default DataWorks resource group in the workspace configuration or scheduling settings in Data Studio after the workspace is created.

    Alibaba Cloud Resource Group

    Select a resource group created in Alibaba Cloud Resource Management. By default, the Default Resource Group is selected.

    If you have purchased multiple Alibaba Cloud resources, you can use Resource Management to create resource groups for better organization. You can also assign administrators to each group to manage all resources within it independently.

    Important

    This resource group organizes resources within an Alibaba Cloud account, which helps simplify resource grouping and permission management. It is distinct from the DataWorks resource group that is required to run tasks. Do not confuse the two concepts.

Step 3: Associate compute resources

After you create a workspace, you must associate it with compute resources, such as MaxCompute, to perform big data development tasks.

  • If you enabled Use Data Studio (New Version), you are automatically directed to the page for associating compute resources. For detailed steps, see Associate compute resources.

    If you have not prepared the compute resources yet, you can click Disable after creating the workspace and associate them later.
  • If you do not use Use Data Studio (New Version), you are automatically redirected to the DataWorks workspace list. You can view information about the workspace. For more information, see Configure a workspace. After the workspace is created, you also need to bind compute resources to the workspace. You can perform data development only after the creation or registration is complete.

Best practices for production

  • Mode selection: We recommend using Standard Mode to isolate development and production environments.

  • Naming conventions: Establish a unified standard that uses names with clear business meaning, such as finance_tax_report for a finance tax report project.

  • Time zone confirmation: For teams with cross-border operations, you must confirm the scheduling time zone policy before creating the workspace.

Next steps

Appendix

Workspace templates

Workspace Name

Applicable scenarios

Features

Supported regions

DataWorks workspace

Suitable for a wide range of business scenarios. You can configure compute resources and data sources as needed.

A fully configurable workspace mode based on the features of your DataWorks edition.

All regions where DataWorks is deployed.

OpenLake Workspace

A workspace for the big data/search/AI integrated solution, built on an open and manageable data lakehouse architecture.

  • An integrated big data/search/AI solution built on an open and manageable data lakehouse architecture.

  • Uses Data Lake Formation (DLF) to manage structured, semi-structured, and unstructured data, supporting secure access and I/O acceleration for lakehouse tables and files.

  • Supports multi-engine integration and peer-to-peer collaborative computing, leveraging DataWorks for unified development and large-scale task scheduling.

OpenLake Workspace can be created only in the China (Hangzhou), China (Shanghai), China (Beijing), and China (Shenzhen) regions.

Built-in default workspace

When you use DataWorks for the first time or activate it in a new region, DataWorks automatically performs the following actions to help you get started quickly without extensive preparation:

  1. Automatically creates a default workspace (name starts with default_workspace_).

  2. Automatically creates a default pay-as-you-go resource group (named dataworks_default_resource_group).

  3. Automatically creates a MaxCompute project and associates it as a compute resource (name starts with default_datasource_) .

  4. Automatically associates newly purchased resource groups with the default workspace.

Important

We do not recommend using the default workspace for production tasks.

Default vs. user-created workspace: Basic properties

The following table compares the properties of the default workspace and user-created workspaces.

Property

System default workspace

User-created workspace

Workspace Name

default_workspace_[4-digit random code]. Cannot be changed.

Custom. Cannot be changed after creation.

Display Name

Default Workspace. The display name can be changed.

Custom. Can be changed after creation.

Workspace mode

A basic mode workspace. A workspace with the new Data Studio enabled cannot be upgraded to standard mode. If the new Data Studio is not enabled, the workspace can be upgraded.

You can choose to create a basic mode or standard mode workspace.

Workspace Administrator

The default administrator depends on who activates DataWorks:

  • If an Alibaba Cloud account activates DataWorks, the Alibaba Cloud account is the default workspace administrator.

  • If a RAM user activates DataWorks, both the Alibaba Cloud account and that RAM user are the default workspace administrators.

Same as the system default workspace.

Workspace scheduling time zone

The scheduling time zone defaults to the local time zone. To change it, see Switch the scheduling time zone.

Same as the system default workspace.

Default vs. user-created workspace: Visibility and access

Feature

System default workspace

User-created workspace

Core logic

Open/default access model

Invitation-based/allowlist model

Visible Scope

Globally visible: Visible to all members of the current tenant.

Restricted visibility: Visible only to members who have been explicitly added to the workspace.

Access method

Any member of the tenant can access the workspace.

Members must be manually added by a workspace administrator to gain access.

Role logic

Automatic grant: When a member accesses the workspace for the first time, the system automatically grants them the guest role.

Explicit assignment: When a member is added, their role is explicitly assigned by a workspace administrator.