Quick Start for Communication Intelligence Engine

更新时间:
复制 MD 格式

Artificial Intelligence Cloud Call Service provides the Communication Intelligence Engine. By integrating with a Large Language Model (LLM), it encapsulates the call process to enable awareness and control of call events. This topic shows you how to get started quickly with the Communication Intelligence Engine.

The Communication Intelligence Engine integrates ASR (automatic speech recognition) and TTS (text-to-speech). It interacts with the LLM using text streams and detects and controls call events such as interruptions, silence, and overlapping speech to create a human-like interaction experience. Artificial Intelligence Cloud Call Service also offers other functional modules for your selection:

  • If you want an out-of-the-box solution: Choose Voice Agent, which includes Alibaba Cloud's communication LLM. You can connect quickly without configuring an LLM yourself.

  • If you prefer orchestration on a canvas: Choose Intelligent Call Robot. This option does not use an LLM but requires you to design call scripts in advance.

Integration flow

This topic guides you through configuring an LLM, connecting to the Communication Intelligence Engine, and creating an LLM application. The overall flow is as follows:

After integration is complete and the user answers the call, the Communication Intelligence Engine first plays a greeting. After receiving the user’s response, it sends the content to the LLM and plays audio based on the LLM’s output. During this process, the engine also handles call events such as interruptions and silence, as shown in the following diagram.

Preparations

  1. Register an Alibaba Cloud account and complete identity verification: Enterprise identity verification is required to use Artificial Intelligence Cloud Call Service. Individual accounts are not supported.

  2. Activate Artificial Intelligence Cloud Call Service: Go to the Artificial Intelligence Cloud Call Service console to activate the service.

  3. Activate Alibaba Cloud Model Studio (optional): If you plan to use Alibaba Cloud Model Studio models, log on to the Alibaba Cloud Model Studio console to activate the service.

Request a number

Before connecting to the Communication Intelligence Engine, you must complete applications for voice qualifications and number resources. Voice number qualification applications, resource requests, and bill viewing are all managed in Voice Service.

  1. Apply for voice qualifications: Go to the Voice Service console. In the navigation bar, choose Qualification and Script Management > Qualification Management and click Add New Qualification. Follow the instructions in Enterprise Qualification Management to fill in the required information and submit your application.

  2. Apply for voice scripts: After your qualification is approved, in the navigation bar, choose Qualification and Script Management > Scenario & Script Management and click Add Script. Fill in the requested information and script content as prompted. Select Dedicated Mode for the business mode.

  3. Apply for number resources: After your script is approved, in the navigation bar, choose Voice Numbers > Real Number Application and click Apply for Number. Request fixed-line and mobile phone numbers for the required regions based on your business needs.

    Note

    The monthly rental fee for a standard number is CNY 35 per month. Charges start in the month the number is activated. Partial months are billed as full months. You can also contact your account manager for fully managed service: Fully managed is a Voice Service feature that automatically applies for and replaces numbers based on your call volume to maintain high answer rates.

Configure an LLM

Before using the Communication Intelligence Engine, configure your own LLM. We recommend using Qwen-series models via Alibaba Cloud Model Studio, or using open source or commercially available third-party models.

  • If you use your own LLM or integrate a third-party LLM, it must comply with the Communication Engine Gateway Standard Protocol to connect.

  • If you choose Alibaba Cloud Model Studio, you can connect via a model or an application.

    • Connect via model: See Select a model to choose your preferred large language model. You will need this when creating an LLM gateway later.

    • Connect via application: Large language models cannot directly answer questions about private knowledge domains. Use Alibaba Cloud Model Studio’s agent application capabilities and private knowledge documents to build a Q&A application that handles domain-specific queries. For call scenarios, LLM applications must use streaming output. Otherwise, slow content generation causes choppy interactions. We currently recommend using agent applications or conversational workflow applications.

      Agent application

      1. Log on to Alibaba Cloud Model Studio and go to the Application Management page. Click Add Application.

      2. Select Agent Application and click Create Now.

      3. Select a Model Studio model and enter instructions, knowledge base details, and other settings. For step-by-step instructions, see Build a Q&A application with no code. When finished, click Publish.

        The configuration page also includes sections for Variables, Vision toggle, and Skills (supporting MCP Server, plugins, agents, and workflows). A text conversation test area on the right lets you enter questions to try the application.

      4. Return to the Application Management page and obtain the application ID.

      5. Replace YOUR_APP_ID in https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion with the application ID you obtained to get the model endpoint URL.

      Conversational workflow application

      1. Log on to Alibaba Cloud Model Studio and go to the Application Management page. Click Add Application.

      2. Select Workflow Application and click Create Conversational Workflow.

      3. Configure nodes as needed. For detailed steps, see Example 2: Intelligent Shopping Guide. When finished, click Publish.

        Note
        • Ensure the output node of the associated LLM has “Return Result” enabled. Otherwise, output is only available after the model completes its full response.

        • Ensure the workflow end node uses text output and has “Return Result” disabled. Otherwise, duplicate content may occur.

        • Conversational workflow applications support custom input parameters.

      4. Return to the Application Management page and obtain the application ID.

      5. Replace YOUR_APP_ID in https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion with the application ID you obtained to get the model endpoint URL.

Step 1: Create and publish an LLM gateway

Create an LLM gateway

  1. Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Gateway Configuration and click Create LLM Gateway.

  2. In the dialog box, enter model details. For example, if using Alibaba Cloud Model Studio:

    Note

    You can use Qwen-series models, open source models, or commercially mature third-party models. If building your own LLM, fine-tune it for your business scenario and configure prompts before connecting to the Communication Intelligence Engine. Your custom LLM must comply with the LLM Gateway Integration Parameter Protocol to connect.

    Configuration item

    Description

    Best practice setting

    Model name

    Enter a custom name to distinguish this LLM gateway.

    Test Gateway

    Model endpoint

    Enter the LLM invocation endpoint.

    • If using an Alibaba Cloud Model Studio model: https://dashscope.aliyuncs.com/compatible-mode/v1/chat/completions

    • If using an Alibaba Cloud Model Studio application: Replace YOUR_APP_ID in https://dashscope.aliyuncs.com/api/v1/apps/YOUR_APP_ID/completion with your application ID

    http://****

    Authorization code

    API key for the LLM service. If using Alibaba Cloud Model Studio, go to the Alibaba Cloud Model Studio console to get your API key. If you don’t have one, click Create My API_KEY in the upper-right corner.

    s*********************************1

    Model temperature

    Sampling temperature that controls the diversity of generated text.

    0.7

    Note

    This is a recommended value.

    Model topP

    Nucleus sampling probability threshold that controls the diversity of generated text.

    0.9

    Note

    This is a recommended value.

    Model topK

    Size of the candidate set during generation.

    50

    Note

    This is a recommended value.

    Base model version

    Model version passed as a parameter to the LLM. For Qwen models, versions include qwen-plus, qwen-turbo, and qwen-max. For more information, see Select a model.

    Note

    When using a Model Studio application, do not configure the base model version to avoid test errors.

    qwen-plus

Test and publish the LLM gateway

After creating the gateway, test it before publishing for use by LLM applications.

  1. Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose Communication Intelligence Engine > LLM Gateway Configuration. Find your gateway and click Test in the Actions column.

  2. In the dialog box, enter test content and click Test to view results.

    Results appear in a table with three columns: Base Model Version, Connectivity Test Result, and Test Result, showing the model version used, whether connectivity passed, and the model’s response.

  3. After a successful test, click Publish in the Actions column.

Step 2: Create and configure an LLM application

Create an LLM application

  1. Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Application Management and click Create LLM Application.

    Configuration item

    Description

    Example

    Application name

    Enter a custom name.

    Test Application

    Model name

    Select a published LLM gateway.

    -

    Base template

    Select a base template.

    -

    Maximum concurrency

    Default is 10. Contact operations staff to adjust.

    -

    Qualification

    Select an approved qualification. View qualification details in Voice Service Qualification Management.

    -

    Script

    Select an approved script. View script details in Voice Service Script Management.

    -

    Greeting

    • Text: Enter custom greeting text. Use variables in the format ${param}.

    • Recording: Upload a greeting audio file in WAV format, up to 10 MB.

      Note

      To ensure natural-sounding voice, if you use a recording for the greeting, select a cloned voice matching that recording when choosing voice tone.

    Text: Hello, are you ${param}?

    Prompt

    Enter custom prompt text. Use variables in the format ${param}.

    Note

    A prompt is an instruction given to the LLM to tell it what you want it to do.

    You are Xiao Mei, a professional Alibaba Cloud customer service agent handling pre-sales inquiries. Strictly follow these rules: 1. If the customer asks about ${problem}, direct them to view details in ${document}. 2. If the customer asks about ${other}, direct them to view details in ${answer}.

    Call voice tone

    Choose a preset or custom voice tone. Adjust voice style, speed, and volume. Enter sample text to preview online.

    -

  2. Click Submit to finish adding the application.

Configure LLM application properties

Note
  • If you need a custom voice tone, upload an audio file. The LLM will learn a personalized voice based on its characteristics. For instructions, see Custom voice tone.

  • To add a hotword library to your LLM, create one yourself. For instructions, see Speech-to-text hotword library.

  1. Select your LLM application and click Conversation Properties in the Actions column.

  2. In the dialog box, configure the following parameters:

    Configuration option

    Description

    Best practice setting

    Silence duration

    Set the wait time before triggering the silence model when neither party speaks. Range: 3–15 seconds.

    5

    Silence-triggered model

    When the customer does not speak, the model application sends a fixed script to the LLM: User did not speak. Contact technical support for custom content.

    Values:

    • Yes: Enable silence-triggered model. The model generates content when both parties are silent and pushes the silence event to your LLM.

    • No: Disable silence-triggered model.

    Yes

    Silence hangup configuration

    Set the number of silence events that trigger automatic hangup. Range: 1–5 times.

    3

    Intelligent Answering Detection

    Enabled by default. Automatically detects voice assistants and voicemail. Returns LlmSmartCallReport—call record message and smart status code.

    -

    Hang up immediately

    When smart answering detects a voice assistant or voicemail, choose whether to end the call immediately. Default is No. Enable this option as needed.

    -

    Maximum call duration

    Range: 300–3600 seconds. The call ends automatically when the limit is reached.

    -

    Morse code playback configuration

    When enabled, play an AI-generated Morse code announcement after the conversation ends.

    -

    Non-Disruptive Opening

    When enabled, the LLM blocks all interruptions during the greeting period. Use this for critical announcements or identity verification to prevent early termination due to customer interjections or background noise. Options:

    • Do not allow any interruption

    • Do not allow interruption during greeting period

    -

    Prevent repeated interruptions

    If the number of interruptions exceeds the threshold, the Large Language Model (LLM) ignores interruptions for the configured duration, forces completion of its response, and ensures smooth conversation flow.

    Configurable: Interruptions are not allowed for X seconds after X consecutive interruptions.

    -

    Interruption filler words

    When enabled, the LLM responds with natural filler words like “Go ahead” or “Hmm” before yielding the floor to the customer. This reduces robotic feel and improves conversational flow.

    -

  3. Click OK to finish configuration.

Step 3: Associate Xiaoji Agent configuration (optional)

Prerequisites: You have created a Xiaoji Agent. See Xiaoji Agent.

  1. Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > LLM Application Management.

  2. Find your application and click Xiaoji Agent Configuration in the Actions column.

  3. Set Associate Xiaoji Agent to Yes and select the corresponding Xiaoji Agent.

    Associating a Xiaoji Agent incurs additional fees. See Billing Details for pricing. If you don’t have a suitable Xiaoji Agent, click Create Xiaoji Agent to create one.

Step 4: Configure message receipts (optional)

  1. Go to the Artificial Intelligence Cloud Call Service console. In the navigation bar, choose LLM Communication > Communication Intelligence Engine > Message Receipt Configuration.

  2. On the Message Receipt Configuration tab, find your desired message type, enable the corresponding toggle, and complete setup.

    Message queue consumption mode

    1. Enable the toggle next to General Queue.

      After enabling, Default Queue shows the queue name (for example, Alicom-Queue-18...-LlmSmartCallReport). Retrieve message receipts from this default queue. Keep Application-Specific Queue disabled.

    2. After configuration, pull messages using the queue name. For a consumption demo, see Lightweight Message Queue (formerly MNS) Consumption Demo.

    HTTP batch push mode

    1. Enable the toggle next to Message Receiving URL.

    2. Enter your message receipt receiving URL in the input field. The system sends messages to this URL via HTTP requests. Ensure your endpoint returns HTTP status code 200.

      For example, enter https://push.example.com/contextpath/receive.do. On successful test, the response is {"code": 0, "msg": "Success"}. To configure different URLs for different applications, enable the Application-Specific URL toggle.

Step 5: Initiate a call

After configuration, call the LlmSmartCall API to initiate a call. This API call incurs charges. Insufficient balance may cause the call to fail.

You can use the OpenAPI Explorer for online debugging. Enter the following parameters:

  • CalledNumber: The callee number—the user who receives the call.

  • ApplicationCode: The LLM application code. Find it on the LLM Application Management page in the Artificial Intelligence Cloud Call Service console.

  • CallerNumber: The caller number—the number you applied for. Find it on the Real Numbers page in the Voice Service console.

  • StartWordParam: Optional. If your LLM application greeting includes variables, pass their values in this field.

FAQ

References