Quick Start for Communication Intelligence Engine-Artificial Intelligence Cloud Call Service(AICCS)-阿里云帮助中心

Artificial Intelligence Cloud Call Service provides the Communication Intelligence Engine. By integrating with a Large Language Model (LLM), it encapsulates the call process to enable awareness and control of call events. This topic shows you how to get started quickly with the Communication Intelligence Engine.

The Communication Intelligence Engine integrates ASR (automatic speech recognition) and TTS (text-to-speech). It interacts with the LLM using text streams and detects and controls call events such as interruptions, silence, and overlapping speech to create a human-like interaction experience. Artificial Intelligence Cloud Call Service also offers other functional modules for your selection:

If you want an out-of-the-box solution: Choose Voice Agent, which includes Alibaba Cloud's communication LLM. You can connect quickly without configuring an LLM yourself.
If you prefer orchestration on a canvas: Choose Intelligent Call Robot. This option does not use an LLM but requires you to design call scripts in advance.

Integration flow

This topic guides you through configuring an LLM, connecting to the Communication Intelligence Engine, and creating an LLM application. The overall flow is as follows:

After integration is complete and the user answers the call, the Communication Intelligence Engine first plays a greeting. After receiving the user’s response, it sends the content to the LLM and plays audio based on the LLM’s output. During this process, the engine also handles call events such as interruptions and silence, as shown in the following diagram.

Preparations

Register an Alibaba Cloud account and complete identity verification: Enterprise identity verification is required to use Artificial Intelligence Cloud Call Service. Individual accounts are not supported.
Activate Artificial Intelligence Cloud Call Service: Go to the Artificial Intelligence Cloud Call Service console to activate the service.
Activate Alibaba Cloud Model Studio (optional): If you plan to use Alibaba Cloud Model Studio models, log on to the Alibaba Cloud Model Studio console to activate the service.

Request a number

Before connecting to the Communication Intelligence Engine, you must complete applications for voice qualifications and number resources. Voice number qualification applications, resource requests, and bill viewing are all managed in Voice Service.

Apply for voice qualifications: Go to the Voice Service console. In the navigation bar, choose Qualification and Script Management > Qualification Management and click Add New Qualification. Follow the instructions in Enterprise Qualification Management to fill in the required information and submit your application.
Apply for voice scripts: After your qualification is approved, in the navigation bar, choose Qualification and Script Management > Scenario & Script Management and click Add Script. Fill in the requested information and script content as prompted. Select Dedicated Mode for the business mode.
Apply for number resources: After your script is approved, in the navigation bar, choose Voice Numbers > Real Number Application and click Apply for Number. Request fixed-line and mobile phone numbers for the required regions based on your business needs.

Note
The monthly rental fee for a standard number is CNY 35 per month. Charges start in the month the number is activated. Partial months are billed as full months. You can also contact your account manager for fully managed service: Fully managed is a Voice Service feature that automatically applies for and replaces numbers based on your call volume to maintain high answer rates.

Configure an LLM

Before using the Communication Intelligence Engine, configure your own LLM. We recommend using Qwen-series models via Alibaba Cloud Model Studio, or using open source or commercially available third-party models.

If you use your own LLM or integrate a third-party LLM, it must comply with the Communication Engine Gateway Standard Protocol to connect.
If you choose Alibaba Cloud Model Studio, you can connect via a model or an application.
- Connect via model: See Select a model to choose your preferred large language model. You will need this when creating an LLM gateway later.
- Connect via application: Large language models cannot directly answer questions about private knowledge domains. Use Alibaba Cloud Model Studio’s agent application capabilities and private knowledge documents to build a Q&A application that handles domain-specific queries. For call scenarios, LLM applications must use streaming output. Otherwise, slow content generation causes choppy interactions. We currently recommend using agent applications or conversational workflow applications.
  Agent application
  1. Log on to Alibaba Cloud Model Studio and go to the Application Management page. Click Add Application.
  2. Select Agent Application and click Create Now.
  3. Select a Model Studio model and enter instructions, knowledge base details, and other settings. For step-by-step instructions, see Build a Q&A application with no code. When finished, click Publish.
    
    The configuration page also includes sections for Variables, Vision toggle, and Skills (supporting MCP Server, plugins, agents, and workflows). A text conversation test area on the right lets you enter questions to try the application.
  4. Return to the Application Management page and obtain the application ID.
  5. Replace YOUR_APP_ID in https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion with the application ID you obtained to get the model endpoint URL.
  Conversational workflow application
  1. Log on to Alibaba Cloud Model Studio and go to the Application Management page. Click Add Application.
  2. Select Workflow Application and click Create Conversational Workflow.
  3. Configure nodes as needed. For detailed steps, see Example 2: Intelligent Shopping Guide. When finished, click Publish.
    
    Note
    
    Ensure the output node of the associated LLM has “Return Result” enabled. Otherwise, output is only available after the model completes its full response.
    
    Ensure the workflow end node uses text output and has “Return Result” disabled. Otherwise, duplicate content may occur.
    
    Conversational workflow applications support custom input parameters.
  4. Return to the Application Management page and obtain the application ID.
  5. Replace YOUR_APP_ID in https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion with the application ID you obtained to get the model endpoint URL.

Step 1: Create and publish an LLM gateway

Create an LLM gateway

Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Gateway Configuration and click Create LLM Gateway.

In the dialog box, enter model details. For example, if using Alibaba Cloud Model Studio:

Note

You can use Qwen-series models, open source models, or commercially mature third-party models. If building your own LLM, fine-tune it for your business scenario and configure prompts before connecting to the Communication Intelligence Engine. Your custom LLM must comply with the LLM Gateway Integration Parameter Protocol to connect.

Configuration item	Description	Best practice setting
Model name	Enter a custom name to distinguish this LLM gateway.	Test Gateway
Model endpoint	Enter the LLM invocation endpoint. If using an Alibaba Cloud Model Studio model: `https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completions` If using an Alibaba Cloud Model Studio application: Replace `YOUR_APP_ID` in `https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion` with your application ID	http://****
Authorization code	API key for the LLM service. If using Alibaba Cloud Model Studio, go to the Alibaba Cloud Model Studio console to get your API key. If you don’t have one, click Create My API_KEY in the upper-right corner.	s*********************************1
Model temperature	Sampling temperature that controls the diversity of generated text.	0.7 Note This is a recommended value.
Model topP	Nucleus sampling probability threshold that controls the diversity of generated text.	0.9 Note This is a recommended value.
Model topK	Size of the candidate set during generation.	50 Note This is a recommended value.
Base model version	Model version passed as a parameter to the LLM. For Qwen models, versions include qwen-plus, qwen-turbo, and qwen-max. For more information, see Select a model. Note When using a Model Studio application, do not configure the base model version to avoid test errors.	qwen-plus

Test and publish the LLM gateway

After creating the gateway, test it before publishing for use by LLM applications.

Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose Communication Intelligence Engine > LLM Gateway Configuration. Find your gateway and click Test in the Actions column.
In the dialog box, enter test content and click Test to view results.

Results appear in a table with three columns: Base Model Version, Connectivity Test Result, and Test Result, showing the model version used, whether connectivity passed, and the model’s response.
After a successful test, click Publish in the Actions column.

Step 2: Create and configure an LLM application

Create an LLM application

Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Application Management and click Create LLM Application.

Configuration item	Description	Example
Application name	Enter a custom name.	Test Application
Model name	Select a published LLM gateway.	-
Base template	Select a base template.	-
Maximum concurrency	Default is 10. Contact operations staff to adjust.	-
Qualification	Select an approved qualification. View qualification details in Voice Service Qualification Management.	-
Script	Select an approved script. View script details in Voice Service Script Management.	-
Greeting	Text: Enter custom greeting text. Use variables in the format ${param}. Recording: Upload a greeting audio file in WAV format, up to 10 MB. Note To ensure natural-sounding voice, if you use a recording for the greeting, select a cloned voice matching that recording when choosing voice tone.	Text: Hello, are you ${param}?
Prompt	Enter custom prompt text. Use variables in the format ${param}. Note A prompt is an instruction given to the LLM to tell it what you want it to do.	You are Xiao Mei, a professional Alibaba Cloud customer service agent handling pre-sales inquiries. Strictly follow these rules: 1. If the customer asks about ${problem}, direct them to view details in ${document}. 2. If the customer asks about ${other}, direct them to view details in ${answer}.
Call voice tone	Choose a preset or custom voice tone. Adjust voice style, speed, and volume. Enter sample text to preview online.	-

Click Submit to finish adding the application.

Configure LLM application properties

Note

If you need a custom voice tone, upload an audio file. The LLM will learn a personalized voice based on its characteristics. For instructions, see Custom voice tone.
To add a hotword library to your LLM, create one yourself. For instructions, see Speech-to-text hotword library.

Select your LLM application and click Conversation Properties in the Actions column.

In the dialog box, configure the following parameters:

Configuration option	Description	Best practice setting
Silence duration	Set the wait time before triggering the silence model when neither party speaks. Range: 3–15 seconds.	5
Silence-triggered model	When the customer does not speak, the model application sends a fixed script to the LLM: User did not speak. Contact technical support for custom content. Values: Yes: Enable silence-triggered model. The model generates content when both parties are silent and pushes the silence event to your LLM. No: Disable silence-triggered model.	Yes
Silence hangup configuration	Set the number of silence events that trigger automatic hangup. Range: 1–5 times.	3
Intelligent Answering Detection	Enabled by default. Automatically detects voice assistants and voicemail. Returns LlmSmartCallReport—call record message and smart status code.	-
Hang up immediately	When smart answering detects a voice assistant or voicemail, choose whether to end the call immediately. Default is No. Enable this option as needed.	-
Maximum call duration	Range: 300–3600 seconds. The call ends automatically when the limit is reached.	-
Morse code playback configuration	When enabled, play an AI-generated Morse code announcement after the conversation ends.	-
Non-Disruptive Opening	When enabled, the LLM blocks all interruptions during the greeting period. Use this for critical announcements or identity verification to prevent early termination due to customer interjections or background noise. Options: Do not allow any interruption Do not allow interruption during greeting period	-
Prevent repeated interruptions	If the number of interruptions exceeds the threshold, the Large Language Model (LLM) ignores interruptions for the configured duration, forces completion of its response, and ensures smooth conversation flow. Configurable: Interruptions are not allowed for X seconds after X consecutive interruptions.	-
Interruption filler words	When enabled, the LLM responds with natural filler words like “Go ahead” or “Hmm” before yielding the floor to the customer. This reduces robotic feel and improves conversational flow.	-

Click OK to finish configuration.

Step 3: Associate Xiaoji Agent configuration (optional)

Prerequisites: You have created a Xiaoji Agent. See Xiaoji Agent.

Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Application Management.
Find your application and click Xiaoji Agent Configuration in the Actions column.
Set Associate Xiaoji Agent to Yes and select the corresponding Xiaoji Agent.

Associating a Xiaoji Agent incurs additional fees. See Billing Details for pricing. If you don’t have a suitable Xiaoji Agent, click Create Xiaoji Agent to create one.

Step 4: Configure message receipts (optional)

Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > Message Receipt Configuration.
On the Message Receipt Configuration tab, find your desired message type, enable the corresponding toggle, and complete setup.
Message queue consumption mode
1. Enable the toggle next to General Queue.
  
  After enabling, Default Queue shows the queue name (for example, Alicom-Queue-18...-LlmSmartCallReport). Retrieve message receipts from this default queue. Keep Application-Specific Queue disabled.
2. After configuration, pull messages using the queue name. For a consumption demo, see Lightweight Message Queue (formerly MNS) Consumption Demo.
HTTP batch push mode
1. Enable the toggle next to Message Receiving URL.
2. Enter your message receipt receiving URL in the input field. The system sends messages to this URL via HTTP requests. Ensure your endpoint returns HTTP status code 200.
  
  For example, enter https://push.example.com/contextpath/receive.do. On successful test, the response is {"code": 0, "msg": "Success"}. To configure different URLs for different applications, enable the Application-Specific URL toggle.

Step 5: Initiate a call

After configuration, call the LlmSmartCall API to initiate a call. This API call incurs charges. Insufficient balance may cause the call to fail.

You can use the OpenAPI Explorer for online debugging. Enter the following parameters:

CalledNumber: The callee number—the user who receives the call.
ApplicationCode: The LLM application code. Find it on the LLM Application Management page in the Artificial Intelligence Cloud Call Service console.
CallerNumber: The caller number—the number you applied for. Find it on the Real Numbers page in the Voice Service console.
StartWordParam: Optional. If your LLM application greeting includes variables, pass their values in this field.

FAQ

References

Overview of Communication Intelligence Engine: Communication Intelligence Engine
Create, configure, test, and publish an LLM gateway: LLM Gateway Configuration
Create and configure an LLM application: LLM Application Management
Integrate your own LLM with the Communication Engine: LLM Gateway Integration Parameter Protocol
Create a hotword library for LLM applications: Speech-to-text hotword library
Create a custom voice tone for LLM applications: Custom voice tone