DataWorks provides two workspace modes: basic mode and standard mode. Basic mode offers simplicity; standard mode adds environment isolation, permission governance, and deployment workflows for production safety.
Background
This topic covers the following aspects:
|
Section |
Description |
|
Architecture of each workspace mode. |
|
|
How each mode affects node development and O&M. |
|
|
Pros and cons of each mode. |
|
|
How standard mode enables role-based workflow governance. |
|
|
Data source mapping for DataWorks modules in different workspace modes |
Basic mode has only a production environment; standard mode has both development and production. This section maps DataWorks modules to their corresponding environments in each mode. |
|
How to isolate development from production in basic mode. |
Notes
-
In standard mode, you must create physically isolated data sources for the development and production environments. Create a data source.
-
Cross-project or cross-database access depends on data source characteristics. If the development and production environments use different data sources, whether the development environment can access production tables, resources, and functions depends on the data source type.
-
In standard mode, development environment tasks are not periodically scheduled by default. Tasks run on a schedule only after deployment to the production environment.
Key concepts
Basic and standard mode workspaces differ in the following ways:
Either mode works for exploration, but standard mode is recommended for production use. It provides code isolation, compute resource isolation, permission isolation, and deployment workflow governance between environments.
To retain existing code while upgrading, see Upgrade a workspace from basic mode to standard mode.
The following table compares both modes in detail.
|
Perspective |
Basic mode |
Standard mode (recommended) |
|
Number of data sources |
A DataWorks workspace corresponds to one data source. |
A DataWorks workspace corresponds to two data sources, isolating the development and production environments. Note
You must create physically isolated data sources for the development and production environments.
|
|
Corresponding DataWorks environments |
The single data source serves as the production environment for DataWorks. |
Of the two data sources, one serves as the development environment and the other serves as the production environment for DataWorks. Note
You can create different data sources for the development environment and the production environment. For example:
|
Impact on development and O&M
|
Comparison |
Basic mode |
Standard mode (recommended) |
|
Development workflow governance |
Submitted tasks are periodically scheduled in the scheduling system without deployment. (submit --> production)
|
Tasks must be submitted in the development environment and deployed to production before scheduling. (submit --> deploy --> production) Note
In standard mode, only tasks in the production environment are automatically scheduled.
|
|
O&M permission governance |
Developers can directly edit the code of production tasks. |
Developers can only edit and submit code in Data Studio, not deploy directly to production. Deploying to production requires O&M permissions (project owner, workspace administrator, or operator role).
|
|
Data access governance |
Developers can directly access production data for testing, posing security risks. |
Developers use test data in the development environment, or request permissions to access production data for verification. Note
|
|
Data access identity |
A unified identity is used to directly operate in the production environment. For MaxCompute, Hologres, EMR, and CDH, access identities include: Alibaba Cloud account, RAM user, RAM role (supported only by MaxCompute), and task owner. Note
For other engines (AnalyticDB for MySQL, AnalyticDB for PostgreSQL, etc.), access depends on the account specified when the data source was created. Permissions match those of the database account. |
Note
MaxCompute, Hologres, EMR, and CDH
For other engines (AnalyticDB for MySQL, AnalyticDB for PostgreSQL, etc.), access depends on the account specified when the data source was created. Permissions match those of the database account. |
Pros and cons
|
Comparison |
Basic mode |
Standard mode |
|
Advantages |
Simple and easy to use. Assign the developer role and data developers can complete all data warehouse tasks. |
Secure and standardized.
|
|
Disadvantages |
Risks of instability and insecurity exist.
|
More complex workflow — typically requires multiple people to complete development and production tasks. |
Standard mode workflow
The development-production isolation in standard mode affects data model design, processing logic, and code deployment workflows.
Appendix: Data source mapping by module
Go to to view compute resources associated with Data Studio. The following table shows which data source each module operates on:
|
DataWorks module |
Standard mode |
Basic mode |
|
Data Studio |
Operates on the development environment data source (instance, project, or database) |
Operates on the production environment data source (instance, project, or database) |
|
Operation Center |
|
Appendix: How to isolate development from production in basic mode
If you use a basic mode workspace and want environment isolation:
Prepare two basic mode workspaces — one for development, one for production. Use cross-workspace deployment to deploy tasks from the development workspace to the production workspace, achieving environment isolation. Cross-workspace deployment.
Drawback: The production workspace code can still be edited directly in Data Studio, so the production code entry point is not unique and may compromise workflow governance.
Recommendation: Upgrade to standard mode for better workflow governance. Upgrade a workspace from basic mode to standard mode.



