AI Gateway is a unified API proxy service provided by ESA that forwards AI requests at the edge, adding observability, security, and performance optimization to your AI applications.
Features
AI Gateway proxies requests between your applications and AI service providers with these core capabilities:
-
Unified access: Connect to multiple AI service providers through a single
endpoint. Compatible with the OpenAI API standard, so you can switch models without changing code. -
Security: Built-in rate limiting, IP allowlist, and API key authentication protect your AI services from abuse and attacks.
-
High availability: Automatic failover and request retry ensure business continuity. When a primary provider is unavailable, requests switch to a backup provider.
-
Observability: Monitor request counts,
Tokenusage, response times, and costs in real time, with detailed request logs and audit trails.
How it works
AI Gateway runs on ESA's global edge nodes. Requests are processed as follows:
-
Request reception: The client sends an API request to the unified
endpoint, routed to the nearest edge node. -
Security verification: The edge node authenticates the request using API keys, IP allowlists, or other configured methods.
-
Routing and forwarding: The request is forwarded to the target AI service provider (such as Alibaba Cloud Model Studio or OpenAI) based on configured routing rules.
-
Result caching: The edge node caches the upstream response and returns it to the client.
-
Monitoring records: Records Sampled Logs and metrics such as Requests, Token Usage, and Error Count.
Routing mechanism: Specify the target AI model with the model parameter in {provider}/{model-name} format. Examples:
-
openai/gpt-4o: Calls OpenAI's GPT-4o model. -
deepseek/deepseek-chat: Calls DeepSeek's chat model. -
qwen/qwen-plus: Calls Alibaba Cloud Model Studio's Qwen-Plus model.
Use cases
Multi-model comparison testing
Scenario: You need to compare the performance, speed, and cost of models from different providers before selecting one.
Solution: Switch between models by changing the model parameter. No application code changes or multiple API keys required.
Sample code:
import OpenAI from "openai";
const client = new OpenAI({
defaultHeaders: {
"esa_aigw_auth": `Bearer ESA_API_TOKEN`,
},
baseURL: "https://YOUR_DOMAIN/v1",
});
const response = await client.chat.completions.create({
model: "qwen/qwen-plus", // Specify the model to use.
messages: [{ role: "user", content: "What is Edge Security Acceleration?" }],
});
Primary/backup failover
Scenario: Relying on a single AI service provider risks service interruption.
Solution: Configure failover rules to switch to a backup provider when the primary is unavailable. Request retry is also supported.
Sample configuration:
import OpenAI from "openai";
const client = new OpenAI({
defaultHeaders: {
"esa_aigw_auth": `Bearer ESA_API_TOKEN`,
"x-ai-route-metadata": "YOUR_IFELSE_CONDITION_RULE", // Required when an If...Else condition is configured.
},
baseURL: "https://api.example.com/v1",
});
const response = await client.chat.completions.create({
model: "dynamic/ai-test", // Routing configuration
messages: [
{ role: "user", content: "Hello!" }
],
});
console.log(response.choices[0].message.content);
Security and compliance
Scenario: Enterprise applications must meet data security and compliance requirements.
Solution: AI Gateway provides encrypted API key storage, request log retention, sensitive data masking, and access audit trails for compliance.
Notes
-
API Key Management: AI Gateway requires API keys from upstream AI service providers. Store these keys securely and rotate them regularly. The keys are encrypted and stored in ESA.
-
Model compatibility: The unified interface follows the OpenAI API standard. Some provider-specific features may not be supported.
-
Monitoring data latency: Metrics may have a 1- to 5-minute delay. For near-real-time needs, supplement with log analysis.
Steps
Prerequisites
-
You have obtained the API key from the upstream AI service provider, such as a Model Studio API key.
Step 1: Create an AI Gateway
-
Go to the AI Gateway page.
-
On the AI Gateway page, click Create AI Gateway. On the creation page, configure the following parameters and click OK:
-
Gateway Name: Enter a name for the gateway, such as
my-ai-gateway. -
Notes: (Optional) Enter a description for the gateway.
-
Step 2: Configure API keys
-
In the newly created AI Gateway application, click the API Key Management tab and then click Add API Key.
-
On the Add API Key page, configure the following parameters and click OK:
-
Name: Enter a custom name for the API key, such as
qwen-api. -
AI Service Provider: Select the AI service provider.
-
API Key: Enter the API key.
-
Step 3: Bind a custom domain
-
In the newly created AI Gateway application, click the Feature Configuration tab and then click Add Domain Name.
-
In the Add Domain Name section, enter a custom domain prefix, such as
api-gateway. The console completes the domain name based on your site. Select the full domain name, such asapi-gateway.example.com. -
Click OK.
Enable the Identity Verification feature to validate all requests.
Step 4: Call the AI Gateway
After completing the preceding steps, test the gateway.
-
In the newly created AI Gateway application, click the Call Example tab and select a model for which an API key is configured.
-
Copy the sample code. Replace
your_domainwith the bound custom domain name. If authentication is enabled, replaceesa_api_tokenwith the system-generated Authentication Token. -
Run the sample code. Example cURL command:

Routing configuration
Configure routing rules to match models based on request characteristics. Add failover rules to switch to a backup provider or alternate model when a service becomes unavailable.
-
In the AI Gateway application, select the Routing Configuration tab, click Add Route, and then enter a custom Name on the page that appears.
-
After you create the routing configuration, click Edit in the Actions column for the corresponding route.
-
On the edit page, configure model routing visually:
-
Configure routing conditions: In the visual model, click the add icon
between the Start and Model Service flows, then click Add Condition to generate an If...Else decision flow. Next, click Configure in the If...Else flow and then click Add Condition in the panel on the right.-
metadata.: Enter the key to be evaluated, such astest-key. -
Logical operator: Select a logical operator, such as
==(equals) or!=(not equals). -
Value: Enter the value that corresponds to
metadata., such asok.
NoteBy default, AI Gateway uses the primary provider when the condition is
trueand the backup provider when it isfalse. Edit the model configuration to change this behavior. -
-
Configure the model service: For the
trueandfalsebranches, click the add icon
and select Add Model Service (or Add Condition). Then, click Configure on the Model Service process block to add model information:-
AI Service Provider: Select an AI service provider from the drop-down list, such as
Qwen. -
Model Name: Select a model from the drop-down list, such as
qwen3-max. -
API Key: Select an added API key, such as
qwen-api-key.
Click Add Fallback Model to add a backup model for failover.
-
-
-
After completing the configuration, click Save and Deploy. The system uses the
x-ai-route-metadataheader to match routing conditions and select a model service.
Analytics and logs
ESA collects key data from all AI Gateway requests to help you analyze your AI application performance.
-
In the AI Gateway application, select the Analytics tab.
-
Select a time range to view these metrics:
-
Requests: Total requests through this AI Gateway in the selected time range.
-
Token Usage: Estimated tokens used by this AI Gateway in the selected time range.
-
Error Count: Error requests recorded for this AI Gateway in the selected time range.
-
Sampled Logs: Detailed request and response information, including Edge Status Code, Time, Model, Request, and Response.
-
Supported AI service providers
AI Gateway supports the following AI service providers. More providers will be added.
|
Provider |
Capabilities |
Model examples |
|
Alibaba Cloud Model Studio |
Qwen series models |
|
|
OpenAI |
Chat Completions, Embeddings |
|
|
DeepSeek |
DeepSeek series models |
|
Provider-specific features such as function calling and streaming responses may require additional configuration or have compatibility limitations. Check each provider's API documentation for details.
Billing
AI Gateway defaults to a Free Version at no additional cost, ideal for individual developers and personal projects. A paid plan for enterprise users will be available later.
|
Feature |
Free plan |
|
Number of gateway applications |
1 |
|
Number of custom domains |
1 |
|
Number of routing configurations |
1 |
|
Analytics time range |
Past 24 hours |