Some models in Model Studio Coding Plan, such as qwen3.6-plus, qwen3.5-plus, and kimi-k2.5, natively support visual understanding and can process images directly. For text-only models like glm-5 and MiniMax-M2.5, add a local Skill to enable visual capabilities.
Running an image understanding Skill consumes your Coding Plan quota. No other charges apply.
Prerequisites
-
You have subscribed to Coding Plan. See Getting started.
-
You have configured the connection in your Coding Plan tool and can start conversations. See Clients and developer tools.
Visual support status
|
Model |
Visual support |
Description |
|
Yes |
No extra configuration required. Pass images directly. |
|
No |
Requires a Skill or Agent for vision. |
Method 1: Use a visual model directly (recommended)
qwen3.6-plus, qwen3.5-plus, and kimi-k2.5 have built-in vision support and are recommended if you frequently work with images.
|
Tool |
How to switch models |
|
Claude Code |
|
|
OpenCode |
|
|
Qwen Code |
|
To switch models in other tools, see Clients and developer tools. After switching, reference image paths or drag and paste images into the conversation.
Method 2: Add visual capabilities using a Skill or Agent
To process images using models without vision support, such as glm-5 and MiniMax-M2.5, configure a Skill or Agent.
Claude Code
-
Add a Skill
In your project directory, create a
skills/image-analyzerdirectory in the.claudefolder:mkdir -p .claude/skills/image-analyzerIn this directory, create a
SKILL.mdfile with the following content:--- name: image-analyzer description: Helps models without visual capabilities understand images. Use this skill to analyze image content, extract information, text, and UI elements from an image, or understand any visual content such as screenshots, charts, or architecture diagrams. Pass the image path to get a description. model: qwen3.6-plus --- qwen3.6-plus has visual understanding capabilities. Use the qwen3.6-plus model directly for image understanding.Directory structure:
.claude/ └── skills/ └── image-analyzer/ └── SKILL.md -
Get started
-
In your project directory, run
claudeto start Claude Code, and then run/model glm-5to switch toglm-5. -
Download aliyun.png to your project directory and ask the following question:
Load the image-analyzer skill and describe the information in the aliyun.png banner.The reply is similar to the following:
-
OpenCode
-
Add an Agent
In your project directory, create an
agentsdirectory in the.opencodefolder:mkdir -p .opencode/agentsIn this directory, create an
image-analyzer.mdfile with the following content:NoteThe model field must use the provider and model name defined in the OpenCode config file. For example, based on the configuration in the OpenCode documentation, the value is
bailian-coding-plan/qwen3.6-plus.--- description: Analyzes images using a vision-capable model. Use this agent when the user needs to understand image content, extract information from screenshots, diagrams, UI mockups, or any visual content. Invoke with @image-analyzer followed by the image path and your question. mode: subagent model: bailian-coding-plan/qwen3.6-plus tools: write: false edit: false --- You have vision capabilities. Analyze the provided image and return a clear, structured description focused on what the user is asking about.Directory structure:
.opencode/ └── agents/ └── image-analyzer.md -
Get started
-
In your project directory, run
opencodeto start OpenCode, and then switch toglm-5. -
Download aliyun.png to your project directory. Use the
at signto invokeimage-analyzerand ask the following question:@image-analyzer, describe the information in the aliyun.png banner.The reply is similar to the following:
-