数据库内AI模型推理-AI模型算子-云原生数据库 PolarDB-阿里云

当您希望在业务流程中集成AI能力（如文本摘要、智能问答、商品推荐）时，通常面临着复杂的AI基础设施搭建、高昂的算力成本以及数据跨域流动的安全风险。为了解决这些痛点，PolarDB推出了AI模型算子（Model as Operator）能力，它允许您通过简单的SQL语句或直接调用模型的访问地址进行推理。其核心价值在于，数据无需离开您的VPC，即可在数据库内部完成低延迟、高吞吐的AI计算，简化了AI应用的开发和维护，同时保障了数据的安全与合规。

功能简介

核心架构

PolarDB的AI模型算子本质上是一个部署在您集群内的模型推理服务网关。它作为PolarDB内核与底层AI算力之间的桥梁，通过统一的访问地址（私网/公网/PrivateLink）向您提供服务。

其核心数据流如下：

模型调用：可通过SQL函数调用或直接调用模型的访问地址。
1. 方式一：采用SQL函数的方式进行模型注册与调用。
  1. 模型注册：通过AI_CreateModel函数，在PolarDB中注册一个模型。注册信息包括模型的访问地址（model_url）和提供方（如阿里云大模型服务平台百炼或自定义服务）。
  2. SQL调用：业务应用执行AI_CallModel函数，将需要推理的数据（如一段文本）作为参数传入。
2. 方式二：直接调用提供方（如阿里云大模型服务平台百炼或自定义服务）的模型访问地址，模型输入参数应与提供方的规范保持一致。
服务代理：将请求转发给部署AI模型算子的PolarDB集群。
模型推理：AI模型算子根据注册信息，向后端的AI模型服务（如阿里云大模型服务平台百炼）发起调用，执行推理计算。
结果返回：AI模型算子获取推理结果，并将其返回给PolarDB，最终以SQL查询结果的形式呈现给您。

部署模式

AI模型算子支持专属资源+阿里云大模型服务平台百炼混池的混合部署架构。该架构的核心优势在于通过内置网关，在两个资源池之间进行流量分配。

特性	专属资源	阿里云大模型服务平台百炼混池	混合部署架构优势
资源隔离	物理隔离，独享GPU算力，性能稳定无干扰。	逻辑隔离，共享大规模算力资源池。	通过内置网关，将需要物理隔离和性能稳定性的核心请求定向至专属资源，其余请求利用混池的逻辑隔离，实现性能与成本的最佳平衡。
数据隐私	最高。模型和数据均在您VPC内处理。	较高。数据仍在您VPC内，但模型服务由阿里云大模型服务平台百炼托管。	无论流量走向何方，数据始终在您的VPC内。对于有数据不出VPC最严格合规要求的流量，网关可确保其始终在专属资源内处理，提供最高级别的安全保障。
成本	相对较高，按预留资源付费，适合作为稳定容量基石。	成本较低，按实际用量付费，适合承载绝大部分常规流量。	将主体流量引导至按量付费的阿里云大模型服务平台百炼混池，大幅降低基础成本。仅在必要时（如处理Corner Case）动用专属资源，避免了为全部流量预留昂贵专属算力的浪费，实现成本优化。
弹性	支持手动扩缩容，保障核心业务的确定性容量。	支持智能弹性伸缩，自动应对流量洪峰。	充分利用阿里云大模型服务平台百炼混池的智能弹性能力应对业务洪峰，同时专属资源作为稳定容量的基石和处理特定请求的快速通道。二者结合，既能从容应对流量波动，又能保证核心业务的绝对稳定。

核心性能

高吞吐：在PolarSearch集群中对接Qwen-Embedding模型时，通过AI模型算子能力，每分钟请求数（QPM）可达26,000以上。
低延迟：基于三层解耦内存池的PolarKVCache技术，大模型推理的首Token时延（TTFT）可降低7倍，整体吞吐量提升60%。

优势

开箱即用
您可以在数据库内直接调用主流的开源模型或您自己的定制模型，无需复杂的环境配置，即刻为您的数据应用注入AI能力。
数据不出域
数据始终保留在您自己的VPC（私有网络）内，并且模型可部署于指定地域。这确保了计算过程的物理隔离，完全满足数据安全、隐私保护的合规要求。
弹性伸缩 & 无感流控
系统能根据实时负载自动、平滑地伸缩资源，从容应对业务的波峰波谷。这既能保障服务的持续稳定，又能最大化资源利用率，降低成本。
高性能
基于三层解耦内存池的PolarKVCache技术，大幅提升了模型推理效率，实现首Token时延（TTFT）可降低7倍，整体吞吐能力提升60%，确保交互的流畅体验。
监控体系
提供覆盖模型从调用到返回的全链路监控。您可以清晰观测服务状态，并通过自动化的异常告警，第一时间发现并定位问题，保障业务稳定运行。
生态融合
服务与PolarDB、PolarSearch及IMCI等功能原生集成。让AI能力无缝融入您现有的数据分析与应用工作流，无需在多个系统间切换，简化开发。
灵活计费
提供多种计费模式：包年包月、按量付费与按Token计费。您可以根据业务的稳定或波动情况，自由选择最经济的成本方案。

应用场景

在数据库内进行向量检索与智能问答

在构建基于大语言模型的智能问答、知识库或推荐系统时，通常需要将业务数据（如产品文档、历史对话）转换为向量并存储。使用AI模型算子，您可以直接在数据库内调用Embedding模型（如Qwen3-Embedding-8B），将文本数据实时或批量转换为向量。结合PolarDB的向量检索能力，可以在一个SQL查询中完成用户问题向量化->相似向量匹配->返回相关业务数据的完整流程，无需将数据导出到外部向量数据库，简化了技术架构并降低了数据同步的延迟。

对海量数据进行自动化标注与清洗

对于存储在数据库中的海量非结构化数据，如用户评论、商品描述等，手动进行情感分析、标签提取或数据分类成本高昂且效率低下。您可以利用AI模型算子，编写一个SQL函数或存储过程，批量将这些数据发送给大语言模型（LLM）进行处理。例如，调用模型判断用户评论的情感倾向（正面/负面/中性）或提取关键信息，并将结果直接写回数据表的新字段中，实现数据的自动化富化和质量提升。

构建满足数据安全合规的AI应用

对于金融、医疗、政务等对数据安全和隐私有严格要求的行业，数据通常不允许离开客户的私有网络环境。AI模型算子的专属资源部署模式是理想的解决方案。它将AI模型服务直接部署在您的VPC内，确保模型调用和数据处理的全过程都在安全边界内完成，满足数据不出域的合规要求。这使得您可以在保障数据安全的前提下，利用AI能力进行风险预测、合规审计或敏感信息识别等高级分析。

适用范围

集群形态：集中式PolarDB PostgreSQL版集群，暂不支持PolarDB PostgreSQL分布式版集群。
数据库引擎：PostgreSQL 16。
地域：华东1（杭州）、华东2（上海）、华北2（北京）、华南1（深圳）、华北6（乌兰察布）、中国香港、新加坡以及德国（法兰克福）。

费用说明

AI模型的计费方式默认根据您选择的GPU节点规格和购买时长进行计算。此外，系统也支持按Token计费。如您有相关需求，请提交工单与我们联系，以便为您转换计费方式。

单个GPU节点的价格信息

华东1（杭州）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	6405.92	13.3456
polar.pg.g8.xlarge.gu20	9686.95	20.1811
polar.pg.g8.2xlarge.gu49	9740	20.2917
polar.pg.x8.4xlarge.gu100	12075	25.1563
polar.pg.g8.xlarge.guu	12895.05	26.8646
polar.pg.g8.2xlarge.gu50	13125	28.1641
polar.pg.g4.4xlarge.gu10	13565.48	28.2614
polar.pg.g4.6xlarge.gu4	13860	28.875
polar.pg.g8.4xlarge.gu49	19480	40.5835
polar.pg.g8.4xlarge.gu20	21311.29	44.3985
polar.pg.g6.2xlarge.guh	23286.54	48.5136
polar.pg.g8.2xlarge.gu100	23346.4	48.6388
polar.pg.g8.4xlarge.gu200	24150	50.3125
polar.pg.g6.4xlarge.gu10	25623.68	53.3827
polar.pg.g8.2xlarge.guu	25790.1	53.7292
polar.pg.g8.4xlarge.gu50	26250	56.3281
polar.pg.g4.12xlarge.gu4	27720	57.75
polar.pg.g8.xlarge.gu30	34673.1	72.2356
polar.pg.g8.8xlarge.gu49	38960	81.167
polar.pg.g8.8xlarge.gu20	42622.58	88.7971
polar.pg.g6.4xlarge.guh	45811.39	95.4404
polar.pg.g8.4xlarge.gu100	46692.8	97.2776
polar.pg.g8.8xlarge.gu400	48300	100.625
polar.pg.g4.8xlarge.gu10	51247.35	106.7653
polar.pg.g8.3xlarge.guu	51580.2	107.4588
polar.pg.g8.8xlarge.gu50	52500	112.6563
polar.pg.g8.2xlarge.gu30	69346.2	144.4713
polar.pg.g8.16xlarge.gu49	74200	154.5833
polar.pg.g8.16xlarge.gu20	77495.6	161.4493
polar.pg.g12.4xlarge.guh	93146.16	194.0546
polar.pg.g8.8xlarge.gu100	93385.6	194.5538
polar.pg.x8.16xlarge.gu800	96600	201.25
polar.pg.g8.4xlarge.guu	103160.4	214.9175
polar.pg.g8.16xlarge.gu50	105000	218.75
polar.pg.g8.4xlarge.gu30	138692.4	288.9425
polar.pg.g8.16xlarge.gu100	182000	379.1667
polar.pg.g12.8xlarge.guh	186292.32	388.109
polar.pg.g8.8xlarge.guu	206320.8	429.835
polar.pg.g8.8xlarge.gu30	277384.8	577.885

华东2（上海）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	6405.92	13.3456
polar.pg.g8.xlarge.gu20	9686.95	20.1811
polar.pg.g8.2xlarge.gu49	9740	20.2917
polar.pg.x8.4xlarge.gu100	12075	25.1563
polar.pg.g8.xlarge.guu	12895.05	26.8646
polar.pg.g8.2xlarge.gu50	13125	28.1641
polar.pg.g4.4xlarge.gu10	13565.48	28.2614
polar.pg.g4.6xlarge.gu4	13860	28.875
polar.pg.g8.4xlarge.gu49	19480	40.5835
polar.pg.g8.4xlarge.gu20	21311.29	44.3985
polar.pg.g6.2xlarge.guh	23286.54	48.5136
polar.pg.g8.2xlarge.gu100	23346.4	48.6388
polar.pg.g8.4xlarge.gu200	24150	50.3125
polar.pg.g6.4xlarge.gu10	25623.68	53.3827
polar.pg.g8.2xlarge.guu	25790.1	53.7292
polar.pg.g8.4xlarge.gu50	26250	56.3281
polar.pg.g4.12xlarge.gu4	27720	57.75
polar.pg.g8.xlarge.gu30	34673.1	72.2356
polar.pg.g8.8xlarge.gu49	38960	81.167
polar.pg.g8.8xlarge.gu20	42622.58	88.7971
polar.pg.g6.4xlarge.guh	45811.39	95.4404
polar.pg.g8.4xlarge.gu100	46692.8	97.2776
polar.pg.g8.8xlarge.gu400	48300	100.625
polar.pg.g4.8xlarge.gu10	51247.35	106.7653
polar.pg.g8.3xlarge.guu	51580.2	107.4588
polar.pg.g8.8xlarge.gu50	52500	112.6563
polar.pg.g8.2xlarge.gu30	69346.2	144.4713
polar.pg.g8.16xlarge.gu49	74200	154.5833
polar.pg.g8.16xlarge.gu20	77495.6	161.4493
polar.pg.g12.4xlarge.guh	93146.16	194.0546
polar.pg.g8.8xlarge.gu100	93385.6	194.5538
polar.pg.x8.16xlarge.gu800	96600	201.25
polar.pg.g8.4xlarge.guu	103160.4	214.9175
polar.pg.g8.16xlarge.gu50	105000	218.75
polar.pg.g8.4xlarge.gu30	138692.4	288.9425
polar.pg.g8.16xlarge.gu100	182000	379.1667
polar.pg.g12.8xlarge.guh	186292.32	388.109
polar.pg.g8.8xlarge.guu	206320.8	429.835
polar.pg.g8.8xlarge.gu30	277384.8	577.885

华北2（北京）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	6405.92	13.3456
polar.pg.g8.xlarge.gu20	9686.95	20.1811
polar.pg.g8.2xlarge.gu49	9740	20.2917
polar.pg.x8.4xlarge.gu100	12075	25.1563
polar.pg.g8.xlarge.guu	12895.05	26.8646
polar.pg.g8.2xlarge.gu50	13125	28.1641
polar.pg.g4.4xlarge.gu10	13565.48	28.2614
polar.pg.g4.6xlarge.gu4	13860	28.875
polar.pg.g8.4xlarge.gu49	19480	40.5835
polar.pg.g8.4xlarge.gu20	21311.29	44.3985
polar.pg.g6.2xlarge.guh	23286.54	48.5136
polar.pg.g8.2xlarge.gu100	23346.4	48.6388
polar.pg.g8.4xlarge.gu200	24150	50.3125
polar.pg.g6.4xlarge.gu10	25623.68	53.3827
polar.pg.g8.2xlarge.guu	25790.1	53.7292
polar.pg.g8.4xlarge.gu50	26250	56.3281
polar.pg.g4.12xlarge.gu4	27720	57.75
polar.pg.g8.xlarge.gu30	34673.1	72.2356
polar.pg.g8.8xlarge.gu49	38960	81.167
polar.pg.g8.8xlarge.gu20	42622.58	88.7971
polar.pg.g6.4xlarge.guh	45811.39	95.4404
polar.pg.g8.4xlarge.gu100	46692.8	97.2776
polar.pg.g8.8xlarge.gu400	48300	100.625
polar.pg.g4.8xlarge.gu10	51247.35	106.7653
polar.pg.g8.3xlarge.guu	51580.2	107.4588
polar.pg.g8.8xlarge.gu50	52500	112.6563
polar.pg.g8.2xlarge.gu30	69346.2	144.4713
polar.pg.g8.16xlarge.gu49	74200	154.5833
polar.pg.g8.16xlarge.gu20	77495.6	161.4493
polar.pg.g12.4xlarge.guh	93146.16	194.0546
polar.pg.g8.8xlarge.gu100	93385.6	194.5538
polar.pg.x8.16xlarge.gu800	96600	201.25
polar.pg.g8.4xlarge.guu	103160.4	214.9175
polar.pg.g8.16xlarge.gu50	105000	218.75
polar.pg.g8.4xlarge.gu30	138692.4	288.9425
polar.pg.g8.16xlarge.gu100	182000	379.1667
polar.pg.g12.8xlarge.guh	186292.32	388.109
polar.pg.g8.8xlarge.guu	206320.8	429.835
polar.pg.g8.8xlarge.gu30	277384.8	577.885

华南1（深圳）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	6405.92	13.3456
polar.pg.g8.xlarge.gu20	9686.95	20.1811
polar.pg.g8.2xlarge.gu49	9740	20.2917
polar.pg.x8.4xlarge.gu100	12075	25.1563
polar.pg.g4.4xlarge.gu10	13565.48	28.2614
polar.pg.g4.6xlarge.gu4	13860	28.875
polar.pg.g8.2xlarge.gu50	13125	29.2906
polar.pg.g8.4xlarge.gu49	19480	40.5835
polar.pg.g8.4xlarge.gu20	21311.29	44.3985
polar.pg.g8.2xlarge.gu100	23346.4	48.6388
polar.pg.g8.4xlarge.gu200	24150	50.3125
polar.pg.g6.4xlarge.gu10	25623.68	53.3827
polar.pg.g8.4xlarge.gu50	26250	58.5813
polar.pg.g4.12xlarge.gu4	27720	57.75
polar.pg.g8.xlarge.gu30	34673.1	72.2356
polar.pg.g8.8xlarge.gu49	38960	81.167
polar.pg.g8.8xlarge.gu20	42622.58	88.7971
polar.pg.g8.8xlarge.gu400	48300	100.625
polar.pg.g4.8xlarge.gu10	51247.35	106.7653
polar.pg.g8.8xlarge.gu50	52500	117.1625
polar.pg.g8.2xlarge.gu30	69346.2	144.4713
polar.pg.g8.16xlarge.gu49	74200	154.5833
polar.pg.g8.16xlarge.gu20	77495.6	161.4493
polar.pg.g8.8xlarge.gu100	93385.6	194.5538
polar.pg.x8.16xlarge.gu800	96600	201.25
polar.pg.g8.16xlarge.gu50	105000	227.5
polar.pg.g8.4xlarge.gu30	138692.4	288.9425
polar.pg.g8.16xlarge.gu100	182000	379.1667
polar.pg.g8.8xlarge.gu30	277384.8	577.885

华北6（乌兰察布）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	5765.33	12.0106
polar.pg.g8.2xlarge.gu49	8766	18.26253
polar.pg.g8.xlarge.gu20	9686.95	20.1811
polar.pg.x8.4xlarge.gu100	10867.5	22.6408
polar.pg.g8.xlarge.guu	12069.4	25.1439
polar.pg.g4.4xlarge.gu10	12208.94	25.4352
polar.pg.g8.2xlarge.gu50	11813	25.5354
polar.pg.g4.6xlarge.gu4	12474	25.9882
polar.pg.g8.4xlarge.gu49	17532	36.52515
polar.pg.g8.2xlarge.gu100	21011.76	43.7738
polar.pg.g8.4xlarge.gu20	21311.29	44.3985
polar.pg.g8.4xlarge.gu200	21735	45.2816
polar.pg.g6.2xlarge.guh	22122.21	46.088
polar.pg.g6.4xlarge.gu10	23061.32	48.0438
polar.pg.g8.2xlarge.guu	24138.8	50.2877
polar.pg.g8.4xlarge.gu50	23625	51.0708
polar.pg.g4.12xlarge.gu4	24948	51.975
polar.pg.g8.xlarge.gu30	31205.79	65.0118
polar.pg.g8.8xlarge.gu49	35064	73.0503
polar.pg.g8.4xlarge.gu100	42023.52	87.5476
polar.pg.g8.8xlarge.gu20	42622.58	88.7971
polar.pg.g8.8xlarge.gu400	43470	90.5632
polar.pg.g6.4xlarge.guh	43520.82	90.6683
polar.pg.g4.8xlarge.gu10	46122.62	96.089
polar.pg.g8.3xlarge.guu	48276.2	100.5754
polar.pg.g8.8xlarge.gu50	47250	102.1417
polar.pg.g8.2xlarge.gu30	62411.58	130.0236
polar.pg.g8.16xlarge.gu49	66780	139.12497
polar.pg.g8.16xlarge.gu20	77495.6	161.4493
polar.pg.g8.8xlarge.gu100	84047.04	175.098
polar.pg.x8.16xlarge.gu800	86940	181.125
polar.pg.g12.4xlarge.guh	88488.85	184.3517
polar.pg.g8.16xlarge.gu50	94500	198.3333
polar.pg.g8.4xlarge.guu	96552.4	201.1508
polar.pg.g8.4xlarge.gu30	124823.16	260.0486
polar.pg.g12.8xlarge.guh	176977.7	368.7036
polar.pg.g8.8xlarge.guu	193104.8	401.8
polar.pg.g8.8xlarge.gu30	249646.32	520.0972

中国香港

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	10249.6	20.7998
polar.pg.g8.xlarge.gu20	12399.3	29.4158
polar.pg.g4.4xlarge.gu10	21704	45.2186
polar.pg.g4.6xlarge.gu4	22176	46.2
polar.pg.g8.2xlarge.gu100	27486.79	67.8902
polar.pg.g8.4xlarge.gu20	27378.34	71.3882
polar.pg.g6.4xlarge.gu10	40998.4	85.4126
polar.pg.g4.12xlarge.gu4	44352	92.4
polar.pg.g8.xlarge.gu30	55476.96	115.577
polar.pg.g8.4xlarge.gu100	54973.58	135.7804
polar.pg.g8.8xlarge.gu20	54756.69	142.7625
polar.pg.g4.8xlarge.gu10	81995.2	170.8238
polar.pg.g8.2xlarge.gu30	110953.92	231.154
polar.pg.g8.16xlarge.gu20	99557.61	264.7837
polar.pg.g8.8xlarge.gu100	109947.14	271.5605
polar.pg.g8.4xlarge.gu30	221907.84	462.308
polar.pg.g8.8xlarge.gu30	443815.68	924.616

新加坡

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	10249.6	20.7998
polar.pg.g8.xlarge.gu20	12496.16	29.6177
polar.pg.g4.4xlarge.gu10	21704	45.2186
polar.pg.g4.6xlarge.gu4	22176	46.2
polar.pg.g8.4xlarge.gu20	27412.29	71.4589
polar.pg.g6.4xlarge.gu10	40998.4	85.4126
polar.pg.g4.12xlarge.gu4	44352	92.4
polar.pg.g8.xlarge.gu30	55476.96	115.577
polar.pg.g8.8xlarge.gu20	54824.57	142.9039
polar.pg.g4.8xlarge.gu10	81995.2	170.8238
polar.pg.g8.2xlarge.gu30	110953.92	231.154
polar.pg.g8.16xlarge.gu20	99681.05	265.0409
polar.pg.g8.4xlarge.gu30	221907.84	462.308
polar.pg.g8.8xlarge.gu30	443815.68	924.616

德国（法兰克福）

节点规格	价格（元/月）	价格（元/小时）
polar.pg.g4.xlarge.gu10	10890.19	22.1004
polar.pg.g8.xlarge.gu20	14821.03	34.4611
polar.pg.g4.4xlarge.gu10	23060.51	48.0438
polar.pg.g4.6xlarge.gu4	23562	49.0882
polar.pg.g6.2xlarge.guh	32568.2	75.0175
polar.pg.g8.4xlarge.gu20	32504.4	82.0674
polar.pg.g6.4xlarge.gu10	43560.8	90.7508
polar.pg.g4.12xlarge.gu4	47124	98.175
polar.pg.g8.xlarge.gu30	58944.27	122.801
polar.pg.g6.4xlarge.guh	64069.6	147.8296
polar.pg.g8.8xlarge.gu20	65008.8	164.121
polar.pg.g4.8xlarge.gu10	87119.9	181.5016
polar.pg.g8.2xlarge.gu30	117888.54	245.6006
polar.pg.g12.4xlarge.guh	130271.4	285.7478
polar.pg.g8.16xlarge.gu20	118197.81	303.6174
polar.pg.g8.4xlarge.gu30	235777.08	491.2026
polar.pg.g12.8xlarge.guh	260541.4	571.4817
polar.pg.g8.8xlarge.gu30	471554.16	982.4052

Token价格信息

由包年包月或按量付费方式转为Token计费时，我们将根据您所选择的GPU节点规格以及历史/预计的Token消耗量等信息，为您制定合适的计费规则。

规格	中国内地单价 (元/万 Token)
`small`	0.075
`medium`	0.15
`large`	0.2
`xlarge`	0.45
`2xlarge`	0.8
`4xlarge`	1.6

创建AI模型实例

您可以通过以下两种方式进行创建：
- 前往PolarDB控制台，单击左侧导航栏的AI集群列表，并单击创建AI新集群。
- 前往PolarDB控制台，在左侧导航栏单击集群列表，找到符合适用范围的目标集群并进入集群详情页。单击左侧导航栏中的AI能力 > AI模型，并单击新建AI模型。

在模型购买页面中，请根据您的需求选择适合的配置：

配置项		说明
付费类型		包年包月：预付费模式。在创建应用时，您需选择固定规格的资源，并预先支付应用的费用。购买周期越长，所享受的折扣也越大。该模式一般适用于业务需求长期稳定的场景。按量付费：后付费模式。在创建应用时，您需选择固定规格的资源，但无需提前支付应用的费用。该模式根据您实际使用的时长进行计费，一般适用于业务需求灵活的场景。
引擎		固定为PolarDB。
地域		选择应用所在的地理位置。说明应用购买完成后，不支持更改地域。应用需与PolarDB PostgreSQL版集群位于同一地域。因此，请选择与PolarDB PostgreSQL版集群相同的地域。建议将应用与需要连接的ECS创建在同一地域，否则它们将无法通过内网（私网）实现互通，只能通过外网（公网）进行通信，这将无法充分发挥最佳性能。
架构		选择AI节点。
生态		选择PostgreSQL。
源 PolarDB 集群		选择需要创建模型的PolarDB集群。
扩展	您可以根据实际业务需求，选择是否预部署指定模型。
	预部署模型	自动部署开源LLM模型。部署成功后，将提供可调用的模型服务。模型列表：支持多种开源模型，您可以根据实际业务需求进行选择。例如，`Qwen3-VL-30B-A3B-Instruct`、`Qwen3-14B`或`DeepSeek-R1-Distill-Qwen-32B`等。部署方式：固定一键部署。推理引擎：可选择SGLang或VLLM。是否开启KVCache加速：选择是否开启KVCache加速。取值范围：10～100000，默认100，单位GB。详细价格信息，请以购买页为准。
	自定义模型	若您已有LLM模型，则可选择直接部署自有模型。部署方式：固定一键部署。推理引擎：可选择SGLang或VLLM。自定义模型：请填写模型所在的OSS Bucket、路径以及模型名称。自有模型参数：设置自有模型的部署相关参数。格式：以`--`开头。示例： `--gpu-memory-utilization`：0.85。 `--served-model-name`：Qwen3-0.6B-xxx。自有模型最大访问限制：您可以根据实际业务需求进行配置。取值范围：1～99999，默认100。说明付费类型需选择按量付费。地域需选择与OSS Bucket地域保持一致。在部署自有模型之前，需要先为PolarDB授权访问OSS Bucket的只读（包含ListObject操作）权限。建议设置一个专门用于托管自有模型的OSS Bucket，并授予PolarDB对该Bucket的整体只读权限。单击展开查看详细操作说明单击Bucket 列表，然后单击目标Bucket名称。在左侧导航栏，选择权限控制>Bucket 授权策略。在Bucket 授权策略页面的按图形策略添加页签，单击新增授权。在新增授权面板，按以下说明配置各项参数，然后单击确定。应用Bucket Policy。授权资源：可选择授权对象是整个Bucket还是指定资源。资源路径：授权资源选择整个Bucket时，资源路径为`bucket-name/`。授权资源选择指定资源时，需要授权到模型文件夹的上一层目录，且以`/`结尾。其他账号：填写PolarDB账号ID（201167356289492098）。授权操作：选择简单设置与只读（包含ListObject操作）。
实例规格类型		选择为GPU规格。
GPU资源		选择对应的GPU规格。说明在扩展选择自定义模型时，规格可随意选择，实际上并不生效。
开启周期分时		选择是否在固定的时间段自动启动资源，在其他时间段自动停止资源。
网络类型		固定为专有网络。
VPC网络		自动填写为源PolarDB集群的专有网络，无需手动填写。
可用区和交换机		配置VPC网络的交换机，建议选择与PolarDB PostgreSQL版集群的主可用区相同的交换机，以发挥最佳网络性能。如果已有的交换机无法满足您的要求，您可以自行创建交换机。
安全组		配置应用的安全组。
购买数量		选择需要购买的应用数量。说明每个PolarDB PostgreSQL版集群仅支持购买一个相同类型的AI应用。仅付费类型为包年包月时，支持配置。
购买时长		选择应用的购买时长。说明仅付费类型为包年包月时，支持配置。
自动续费		配置是否开启自动续费。为避免因忘记续费而导致业务中断，建议您开启自动续费。说明仅付费类型为包年包月时，支持配置。

购买成功后，请返回相应的AI模型列表页面，以查看新创建的模型。如果选择预部署模型，请耐心等待系统完成相应模型的部署。

管理AI模型实例

查看模型信息：进入AI模型列表页面，单击模型实例名称，您可以查看模型实例的基本信息与连接串信息。
查看模型实例监控信息：进入AI模型列表页面，单击操作列的查看云监控。您可以查看模型实例的请求数、负载率、Token消耗等信息。

模型开发

SQL函数调用：您可以根据模型管理中的SQL函数进行AI_CreateModel - 创建模型、AI_SetModelToken - 设置模型API-Key、AI_CallModel - 调用模型、AI_AlterModel - 更新模型以及AI_DropModel - 删除模型等操作。
直接调用模型的访问地址：您可以直接调用提供方（如阿里云大模型服务平台百炼或自定义服务）的模型访问地址，模型输入参数应与提供方的规范保持一致。