What is Alibaba Cloud Model Studio

更新时间:
复制 MD 格式

Alibaba Cloud Model Studio is a one-stop LLM development and application platform. It integrates Qwen and mainstream third-party models, provides OpenAI-compatible APIs, and delivers full-lifecycle model services — from API calls to visual application building.

Key capabilities:

api Call APIs

Generate content and summaries with a few lines of code.

Model Studio is OpenAI-compatible. Update the API key, base URL, and model name to migrate existing OpenAI code.

Python

import os
from openai import OpenAI

# Note: The base_url varies by region. The following example uses the base_url for the China (Beijing) region.
# - China (Beijing): https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1. Replace {WorkspaceId} with your workspace ID.
# - Singapore: https://{WorkspaceId}.ap-southeast-1.maas.aliyuncs.com/compatible-mode/v1
# - Germany (Frankfurt): https://{WorkspaceId}.eu-central-1.maas.aliyuncs.com/compatible-mode/v1
# - US (Virginia): https://dashscope-us.aliyuncs.com/compatible-mode/v1
client = OpenAI(
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    base_url="https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1",
)
completion = client.chat.completions.create(
    model="qwen3.7-plus",
    messages=[{'role': 'user', 'content': 'Who are you?'}]
)
print(completion.choices[0].message.content)

Node.js

import OpenAI from "openai";

// Note: The base_url varies by region. The following example uses the base_url for the China (Beijing) region.
// - China (Beijing): https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1. Replace {WorkspaceId} with your workspace ID.
// - US (Virginia): https://dashscope-us.aliyuncs.com/compatible-mode/v1
// - Singapore: https://{WorkspaceId}.ap-southeast-1.maas.aliyuncs.com/compatible-mode/v1
// - Germany (Frankfurt): https://{WorkspaceId}.eu-central-1.maas.aliyuncs.com/compatible-mode/v1. Replace {WorkspaceId} with your workspace ID.
const openai = new OpenAI(
    {
        apiKey: process.env.DASHSCOPE_API_KEY,
        baseURL: "https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1",
    }
);

async function main() {
    const completion = await openai.chat.completions.create({
        model: "qwen3.7-plus",
        messages: [{ role: "user", content: "Who are you?"}],
    });
    console.log(completion.choices[0].message.content)
}

main()

curl

The base URL varies by region. The following example uses the base URL for the China (Beijing) region.

  • China (Beijing): https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1/chat/completions. Replace {WorkspaceId} with your workspace ID.

  • US (Virginia): https://dashscope-us.aliyuncs.com/compatible-mode/v1/chat/completions

  • Singapore: https://{WorkspaceId}.ap-southeast-1.maas.aliyuncs.com/compatible-mode/v1/chat/completions

  • Germany (Frankfurt): https://{WorkspaceId}.eu-central-1.maas.aliyuncs.com/compatible-mode/v1/chat/completions. Replace {WorkspaceId} with your workspace ID.

curl -X POST https://{WorkspaceId}.cn-beijing.maas.aliyuncs.com/compatible-mode/v1/chat/completions \
-H "Authorization: Bearer $DASHSCOPE_API_KEY" \
-H "Content-Type: application/json" \
-d '{
    "model": "qwen3.7-plus",
    "messages": [
        {
            "role": "user",
            "content": "Who are you?"
        }
    ]
}'

robot Build an intelligent customer service

Build an AI assistant for customer inquiries using visual tools.

Customer service sample 3

process Orchestrate flows

Visual orchestration lets non-technical staff design workflows without writing code.

Workflow diagram 4

tune_8 Fine-tune models

Customize models using visual fine-tuning without writing code.

loss_3

Model service

Models

Model Studio provides ready-to-use model services, including the proprietary Qwen series and third-party models such as DeepSeek, Kimi, and GLM. See Recommended models.

  • Qwen flagship models:

    • Qwen-Max: The highest-performing model in the Qwen series, suited for complex, multi-step tasks.

      The latest qwen3.7-max delivers significant reasoning improvements over its predecessor. Recommended.
    • Qwen-Plus: Balances performance, speed, and cost — recommended for most scenarios.

    • Qwen-Flash: Low-cost and low-latency — suited for simple tasks that require fast responses.

  • Multimodal coverage: Includes text generation, visual understanding, image generation, video generation, speech recognition and synthesis, and embedding.

  • Domain-specific models: Models for long-text processing, translation, data mining, legal, intent recognition, role-playing, and in-depth research.

Model fine-tuning, deployment, and evaluation

  • Model fine-tuning: Supports supervised fine-tuning (SFT), continued pre-training (CPT), and direct preference optimization (DPO).

  • Model deployment: Deploy pre-built or custom models as dedicated inference services for high-concurrency, low-latency scenarios. Billing options include duration-based, monthly subscription, and token volume-based plans.

  • Model evaluation: Compare models, verify fine-tuning results, and identify threats with manual, automatic, and baseline evaluations.

Application building

Billing

Activating Model Studio is free. Costs apply only when you invoke , fine-tune, or deploy models. See Billable items.

Free quota for new users

New users receive a free quota in the China (Beijing) region to try model invocation.

  • Unverified users cannot continue using the service after the free quota is depleted. They must complete identity verification and top up their account to switch to pay-as-you-go billing.

  • Verified users are automatically switched to pay-as-you-go billing after the free quota is depleted. To avoid unexpected charges, enable the Free quota only feature — the service stops when the quota is depleted.

For more information, see Free quota for new users.

Payment methods

Model calls are billed per minute. Make sure your account balance is sufficient — add funds on the Expenses and Costs page.

View bills and usage

  • Billing details: Go to the Billing Details and Cost Analysis pages.

  • Call statistics: About one hour after making a model call, go to the Model Studio console, select your region from the top-right corner, go to the Model Monitoring page, set your query conditions, click Monitor in the Actions column for the target model, and view call volume, token consumption, success rate, and other statistics. See Model monitoring.

  • Coding Plan usage: If you are subscribed to Coding Plan, view quota consumption on the Coding Plan page. Coding Plan uses a fixed monthly fee with a monthly request quota for AI coding tools. See Coding Plan overview.

Getting started

FAQ

Q: Is my data secure? Will Alibaba Cloud Model Studio use my data for training?

A: Alibaba Cloud protects data privacy and will never use your data for model training. All data transmitted when building applications or training models is encrypted. See Security certifications and privacy.

Q: How many regions is Model Studio available in? What are the differences between regions?

A: Model Studio is available in the following regions:

China (Beijing), US (Virginia), Singapore, and Germany (Frankfurt)

Select a nearby region to reduce network latency. Regions differ in endpoints and base URLs, API keys (not interchangeable across regions), supported models, platform features, and pricing. See the Model list.

Q: How can I avoid automatic charges?

A: Model Studio uses pay-as-you-go billing with no automatic deduction. To avoid charges:

  • Delete API keys: Go to the Model Studio console, select your region, go to the API-KEY page, and delete all API keys.

  • Stop all model calls in applications, agents, and workflows — including scheduled tasks and background processes.

  • Clean up billable resources: Delete unused knowledge bases. On the Model Deployment page, unpublish deployed instances billed by computing power usage.

  • Turn on Free quota only (new users only): On the model details page, enable this switch. Service stops when the quota is depleted, preventing paid usage. Applies only to models in the China (Beijing) region (Chinese mainland deployment scope) within the validity period.

  • Set up cost monitoring and alerts: View Billing Details and Monitoring, and set up high-spending alerts to catch abnormal spending.

  • Coding Plan: This plan offers a fixed monthly fee and monthly request quota, replacing pay-as-you-go charges. Use the dedicated base URL and API key for the Coding Plan when making calls — otherwise, model calls are charged pay-as-you-go. See Coding Plan overview.

Q: How to use Qwen3 or DeepSeek?

A:

  1. Try it online: Open the Model Studio console, select your region, go to the Models page, and click a model. (DeepSeek is available only in the China (Beijing) region.)

  2. Call models via API: See Make the first call to a Qwen API.

  3. Use developer tools (such as Claude Code): See Connect to clients or developer tools.

  4. Build LLM applications using the visual interface: See Agent applications or Workflow application.