Billing
This topic describes the billing models and pricing for the AI2T agent service for education.
Billing models
The AI2T agent service for education offers two flexible billing models.
Key concepts
Agent application types:
The platform provides the following five types of agents based on their capabilities:
Voice Interaction Application: An agent application that processes voice or text input and generates voice output. The end-to-end workflow includes capabilities like intent recognition, instructions, plugins, and long-term/short-term memory, but does not support image input.
Multimodal Interaction Application: An agent application that processes voice, text, or image input and generates voice output. The end-to-end workflow includes capabilities like intent recognition, instructions, plugins, and long-term/short-term memory. The agent supports image input.
Text Interaction Application: An agent application that processes voice or text input and generates text output. The end-to-end workflow includes capabilities like intent recognition, instructions, plugins, and long-term/short-term memory.
Translation Application: An agent application that processes voice or text input through a translation model and generates voice output. The workflow includes only the translation process and does not support standard agent capabilities such as casual chat, instructions, or plugins.
Visual Task Application: An agent application that processes text or image input and generates text output. This agent handles complex visual tasks, such as summarizing content from or searching for information related to IPC devices.
Definition of a conversation:
A conversation is a single turn of interaction between a user and an agent, consisting of one question and one answer. The service includes capabilities such as ASR (Automatic Speech Recognition), LLM (Large Language Model) understanding and generation, TTS (Text-to-Speech), intent recognition, instruction recognition, and long-term/short-term memory. This excludes text-to-image and image-to-image generation.
Definition of an SKU:
An SKU is a stock-keeping unit that represents a specific quantity of service conversations for a particular agent application type. When making your first purchase, you can create a new SKU. This automatically generates an SKU for the selected agent application type and quantity. For subsequent purchases, you can select an existing SKU from the drop-down list to add more conversations.
Definition of a region:
A region is the location where the agent service is deployed. Currently, you can choose between Singapore and the Chinese mainland. The unit price varies by region. Note that while the billing logic is the same for both billing models described below, the unit prices differ between regions.
Model 1: Resource pool (pay-per-conversation)

How it works When you place an order, you can specify the number of conversations included for each device. When a device is activated, the corresponding number of conversations is automatically added to a shared resource pool. This resource pool can be used by all devices under your tenant. Once the resource pool is depleted, devices can no longer initiate new conversations. You must activate more devices to replenish the pool and resume service.
Formula Total purchase cost = Unit price per conversation (varies by application type) × Quantity (conversations per device) × Number of devices
Example A
You select the Multimodal Interaction Application and purchase a prepaid plan for 100 devices, with each device allocated 100 conversations.
When you activate each of these devices, 100 conversations are added to the resource pool for the Multimodal Interaction Application. This pool is shared by all devices under your tenant that use this application type. After activating 100 devices, a total of 10,000 conversations (100 devices × 100 conversations/device) are available in the shared pool. Some devices may use more than 100 conversations, while others may use fewer. Once all 10,000 conversations are used, you must purchase for additional devices to continue the service.
Procedure
Select the Agent Application Type.
Specify the Quantity of conversations for each device.
Select an existing SKU or choose Create SKU.
Enter the Number of Devices to purchase.
Click Purchase.
Model 2: License model with daily limits

How it works You are charged based on the number of licenses you purchase. For a one-year service period, each license provides a device with a daily limit of 150, 300, or 500 conversations, depending on the package purchased. The daily usage limit resets at midnight.
Formula Total purchase cost = Unit price per license (varies by application type and specification) × Number of devices
Example B
You purchase 1,000 licenses. After provisioning these licenses to 1,000 devices, each device can use the service. For one year from the activation date, each device has a daily limit of 150, 300, or 500 conversations, depending on the purchased package.
Procedure
Select the Agent Application Type.
Select the resource specification for each device.
Select an existing SKU or choose Create SKU.
Enter the Number of Devices to purchase.
Click Purchase.
Pricing
Model 1: Resource pool (pay-per-conversation)
Agent application type | Region | Price (CNY per 1,000 conversations) |
Voice Interaction Application | Chinese mainland | 83 |
Multimodal Interaction Application | 117 | |
Text Interaction Application | 13 | |
Visual Task Application | 33 | |
Translation Application | 93 | |
Voice Interaction Application | Singapore | 334 |
Multimodal Interaction Application | 411 | |
Text Interaction Application | 110 | |
Visual Task Application | 81 | |
Translation Application | 179 |
Model 2: License model with daily limits
Agent application type | Region | Resource specification | Price (CNY per license) |
Voice Interaction Application | Chinese mainland | Basic package: 150 conversations/day | 12.00 |
Standard package: 300 conversations/day | 20.00 | ||
Premium package: 500 conversations/day | 30.00 | ||
Multimodal Interaction Application | Basic package: 150 conversations/day | 17.00 | |
Standard package: 300 conversations/day | 32.50 | ||
Premium package: 500 conversations/day | 45.00 | ||
Text Interaction Application | Basic package: 150 conversations/day | 2.75 | |
Standard package: 300 conversations/day | 5.00 | ||
Premium package: 500 conversations/day | 7.50 | ||
Visual Task Application | Basic package: 150 conversations/day | 5.00 | |
Standard package: 300 conversations/day | 8.75 | ||
Premium package: 500 conversations/day | 14.00 | ||
Translation Application | Basic package: 150 conversations/day | 15.00 | |
Standard package: 300 conversations/day | 27.50 | ||
Premium package: 500 conversations/day | 45.00 | ||
Voice Interaction Application | Singapore | Basic package: 150 conversations/day | 20.00 |
Standard package: 300 conversations/day | 40.00 | ||
Premium package: 500 conversations/day | 69.00 | ||
Multimodal Interaction Application | Basic package: 150 conversations/day | 22.00 | |
Standard package: 300 conversations/day | 45.00 | ||
Premium package: 500 conversations/day | 89.00 | ||
Text Interaction Application | Basic package: 150 conversations/day | 10.50 | |
Standard package: 300 conversations/day | 20.00 | ||
Premium package: 500 conversations/day | 34.00 | ||
Visual Task Application | Basic package: 150 conversations/day | 5.00 | |
Standard package: 300 conversations/day | 9.60 | ||
Premium package: 500 conversations/day | 15.80 | ||
Translation Application | Basic package: 150 conversations/day | 22.00 | |
Standard package: 300 conversations/day | 43.00 | ||
Premium package: 500 conversations/day | 72.00 |