By default, Hermes Agent uses the Alibaba Cloud Model Studio (Bailian) Coding Plan model. To integrate model plans from other providers, configure Hermes Agent. This document provides examples for Bailian Token Plan Team Edition, MiniMax Token Plan, VolcEngine Ark Agent Plan, and Tencent Token Plan.
Steps for each provider
Bailian Token Plan Team Edition
-
Password-free logon to the terminal: Log on to the Simple Application Server console, find the target server card, click Remote Connection, and in the Connect with One-click on Workbench area, click Log on Now.
-
Subscribe to the Token Plan Team Edition plan and obtain an API key.
-
Configure access credentials: Run the following commands in the terminal. Replace
YOUR_API_KEYwith the API key for your Bailian Token Plan Team Edition.hermes config set model.provider custom hermes config set model.base_url https://token-plan.cn-beijing.maas.aliyuncs.com/compatible-mode/v1 hermes config set model.api_key YOUR_API_KEY hermes config set model.default qwen3.6-plusThese commands write the configuration to the
~/.hermes/config.yamlfile. You can also edit the file directly and add the following content:model: default: qwen3.6-plus provider: custom base_url: https://token-plan.cn-beijing.maas.aliyuncs.com/compatible-mode/v1 api_key: YOUR_API_KEY -
Verify the configuration: After the configuration is complete, run the following command to send a test message:
hermes chat -q "Hello"A valid response confirms a successful configuration.
-
Switch models: Use the
-mparameter to specify a different model:hermes chat -m deepseek-v4-pro
Models supported by Bailian Token Plan Team Edition:
|
Model name |
Model provider |
|
|
Qwen |
|
|
Qwen |
|
|
DeepSeek |
|
|
DeepSeek |
|
|
DeepSeek |
|
|
Moonshot |
|
|
Moonshot |
|
|
Zhipu |
|
|
Zhipu |
|
|
MiniMax |
MiniMax Token Plan
-
Password-free logon to the terminal: Log on to the Simple Application Server console, find the target server card, click Remote Connection, and in the Connect with One-click on Workbench area, click Log on Now.
-
Subscribe to the MiniMax Token Plan and obtain an API key.
-
Configure the model: Run the following command in the terminal to start the model configuration wizard.
hermes modelIn the interactive configuration, make the following selections:
Select model provider → MiniMax China (Chinese mainland endpoint) Enter API key → Paste the MiniMax Token Plan API key you obtained in the previous step. default model → MiniMax-M2.7NoteFor users outside of the Chinese mainland, select the
MiniMax Globalendpoint. -
Verify the configuration: After the configuration is complete, run the following command to check the connection status.
hermes doctor -
Start Hermes Agent: After successful verification, run the following command in the terminal to start Hermes Agent.
hermes
VolcEngine Ark Agent Plan
-
Password-free logon to the terminal: Log on to the Simple Application Server console, find the target server card, click Remote Connection, and in the Connect with One-click on Workbench area, click Log on Now.
-
Subscribe to the VolcEngine Ark Agent Plan and obtain an API key from the VolcEngine console.
-
Configure access credentials: Run the following commands in the terminal. Replace
YOUR_ARK_API_KEYwith the API key you obtained in the previous step.WarningDo not use
https://ark.cn-beijing.volces.com/api/v3as the base URL. Requests sent to this address do not consume your Agent Plan quota and will incur additional charges.hermes config set model.provider custom hermes config set model.base_url https://ark.cn-beijing.volces.com/api/plan/v3 hermes config set model.api_key YOUR_ARK_API_KEY hermes config set model.default doubao-seed-2.0-proThese commands write the configuration to the
~/.hermes/config.yamlfile. You can also edit the file directly and add the following content:model: default: doubao-seed-2.0-pro provider: custom base_url: https://ark.cn-beijing.volces.com/api/plan/v3 api_key: YOUR_ARK_API_KEY -
Verify the configuration: After the configuration is complete, run the following command to send a test message:
hermes chat -q "Hello"A valid response confirms a successful configuration.
-
Switch models: To switch models, specify the model with the
-mparameter:hermes chat -m glm-5.1
Models supported by the Agent Plan:
|
Model name |
Model provider |
|
|
VolcEngine |
|
|
ByteDance |
|
|
ByteDance |
|
|
ByteDance |
|
|
ByteDance |
|
|
Zhipu |
|
|
DeepSeek |
|
|
MiniMax |
|
|
Moonshot |
Tencent Token Plan
-
Password-free logon to the terminal: Log on to the Simple Application Server console, find the target server card, click Remote Connection, and in the Connect with One-click on Workbench area, click Log on Now.
-
Subscribe to the Tencent Cloud Token Plan and generate an API key (in the format
sk-tp-xxx) in the TokenHub console. -
Configure access credentials: Run the following commands in the terminal. Replace
YOUR_API_KEYwith the API key you obtained in the previous step.NoteModel IDs must be configured in all lowercase.
hermes config set model.provider custom hermes config set model.base_url https://api.lkeap.cloud.tencent.com/plan/v3 hermes config set model.api_key YOUR_API_KEY hermes config set model.default hunyuan-2.0-thinkingThese commands write the configuration to the
~/.hermes/config.yamlfile. You can also edit the file directly and add the following content:model: default: hunyuan-2.0-thinking provider: custom base_url: https://api.lkeap.cloud.tencent.com/plan/v3 api_key: YOUR_API_KEY -
Verify the configuration: After the configuration is complete, run the following command to send a test message:
hermes chat -q "Hello"A valid response confirms a successful configuration.
-
Switch models: To switch models, specify the model with the
-mparameter:hermes chat -m kimi-k2.5
Models supported by the Token Plan:
|
Model name |
Model provider |
|
|
Tencent |
|
|
Tencent |
|
|
Tencent |
|
|
Tencent |
|
|
Tencent |
|
|
MiniMax |
|
|
Moonshot |
|
|
Zhipu |
FAQ
Changing the default model
This section uses the qwen3.6-plus model from Bailian Token Plan as an example to describe two approaches: temporary and persistent. For models from other providers, such as MiniMax, VolcEngine, and Tencent Token Plan, follow the same steps but replace the model name in the command with a model supported by the target provider.
Use qwen3.6-plus temporarily: To use qwen3.6-plus only for the current task, run the following command:
hermes chat -m qwen3.6-plus
This action applies only to the current task and does not modify the default model configuration.
Use qwen3.6-plus persistently: To use qwen3.6-plus for all future tasks, run the following command to set it as the default model:
hermes config set model.default qwen3.6-plus