CreateModelService
Creates a model service.
Operation description
Before you call this operation, review the billing methods and pricing of AnalyticDB for PostgreSQL.
Try it now
Test
RAM authorization
|
Action |
Access level |
Resource type |
Condition key |
Dependent action |
|
gpdb:CreateModelService |
create |
*DBInstance
|
None | None |
Request parameters
|
Parameter |
Type |
Required |
Description |
Example |
| DBInstanceId |
string |
Yes |
The ID of the instance. Note
You can call the DescribeDBInstances operation to query the IDs of all AnalyticDB for PostgreSQL instances in a region. |
gp-xxxxxxxxx |
| ModelName |
string |
Yes |
The name of the model. |
Qwen3-Embedding-8B |
| Description |
string |
No |
The description of the model service. |
test |
| SecurityIPList |
string |
No |
The IP whitelist. Set this parameter to |
127.0.0.1 |
| AiNodes |
array |
Yes |
A list of AINodes on which to deploy the model. |
|
|
string |
No |
The AINode name. |
ai-xxxxxx |
|
| ModelParams |
object |
No |
The model parameters. This parameter is not yet supported. |
暂未开放 |
| ResourceGroupId |
string |
No |
The ID of the resource group to which the instance belongs. For more information about how to obtain the ID of a resource group, see View the basic information of a resource group. |
rg-bp67acfmxazb4p**** |
| ClientToken |
string |
No |
A token to ensure the idempotence of the request. For more information, see How to ensure idempotence. |
0c593ea1-3bea-11e9-b96b-88********** |
| Replicas |
integer |
No |
The number of model service replicas. |
1 |
| InferenceEngine |
string |
No |
The inference engine. Currently, only vllm is supported. |
vllm |
| EnablePublicConnection |
boolean |
No |
Specifies whether to enable a public network connection. |
false |
Response elements
|
Element |
Type |
Description |
Example |
|
object |
The response object. |
||
| ModelServiceId |
string |
The ID of the model service. |
ms-xxxxxxxxx |
| RequestId |
string |
The request ID. |
ABB39CC3-4488-4857-905D-2E4A051D0521 |
Examples
Success response
JSON format
{
"ModelServiceId": "ms-xxxxxxxxx",
"RequestId": "ABB39CC3-4488-4857-905D-2E4A051D0521"
}
Error codes
See Error Codes for a complete list.
Release notes
See Release Notes for a complete list.