Webpage tags

更新时间:
复制 MD 格式

This document describes the enum values, distribution, and usage of tags in search results.

Tag overview

This feature provides tags for sites and URL paths. Tags are applied to URLs using regular expression (regex) patterns to help users understand and filter search results.

Important

Because some search results may lack tags, or a query might return no tagged results, follow these filtering principles:

  1. Prioritize exclusion logic. Use a blocklist to filter results. This approach minimizes the risk of unintentionally excluding results that lack tags.

  2. Configure a fallback strategy. If you use an inclusion-based filter, implement a fallback solution.

  3. Define fallback triggers. When the number of filtered results is zero or falls below a predefined threshold, the system automatically reverts to full recall mode, which disables all tag-based filtering.

Tag list

Tag

Parameter

Type

Description

genre

content type

Enum

Identifies the webpage's content type, such as News Portal, Blog, Social Media, or Video Site.

isUgc

is UGC

Bool

Indicates whether the content is user-generated content (UGC).

ugcType

UGC type

Enum

When isUgc is true, this tag specifies the UGC category, such as Media Article, Blog Post, Structured Q&A, or Note Share.

industry

industry classification

Enum

Identifies the content's industry, such as Automotive, Finance, Medical, Government, or Travel.

isListPage

is list page

Bool

Indicates whether the page is a list page.

Enum values and examples

The coverage of various tags changes continuously as indexed sites and paths are updated. Tag coverage for links is currently about 89%.

genre (content type)

Value

Description

Url example

NewsPortal

News Portal

Blog

Blog

ForumUgc

Forum UGC

Commerce

Commerce

Encyclopedia

Encyclopedia

Social

Social Media

VideoSite

Video Site

isUgc (is UGC)

Value

Description

Url example

true

User-generated content

false

Not user-generated content

ugcType (UGC type)

The ugcType tag is only meaningful when isUgc is true.

Value

Description

Url example

MediaArticle

Media Article

BlogPost

Blog Post

StructuredQA

Structured Q&A

NoteShare

Note Share

ResourceAgg

Resource Aggregation

TechTutorial

Tech Tutorial

ShortFeed

Short Feed

ForumPost

Forum Post

ReviewEval

Review and Evaluation

industry (industry classification)

Value

Description

Url example

General

General/Other

News

General News

Finance

Finance

Education

Education & Training

Entertainment

Entertainment & Leisure

Tech

Technology

Travel

Travel

Auto

Automotive

Gov

Government

Medical

Medical & Health

Recruitment

Recruitment

isListPage (is list page)

Value

Description

Url example

false

Detail page

https://tianqi.moji.com/forecast10/china/zhejiang/hangzhou

true

List page

Usage and recommendations

The UnifiedSearch API returns tags within the tags object for each pageItem in the response. The API does not support filtering by tags in the request, so you must filter the results you receive.

Sample request

curl  -X POST https://cloud-iqs.aliyuncs.com/search/unified \
--header "Authorization: Bearer $API_KEY" \
--header "Content-Type: application/json" \
--data '{
  "query": "Hangzhou weather",
  "engineType": "LiteAdvanced",
  "contents": {
    "mainText": true,
    "markdownText":false,
    "summary": false,
    "rerankScore": true
  }
}'

Response

{
    "requestId": "c7ca543a-6fc3-4563-a39a-720ffb134978",
    "pageItems": [
        {
            "title": "Weather forecast for Hangzhou, Zhejiang on March 15, 2026",
            "link": "https://tianqi.moji.com/forecast10/china/zhejiang/hangzhou",
            ......
            "tags": {
                "genre": "Official",
                "isUgc": "false",
                "industry": "General",
                "isListPage": "false",
            }
        },
        ....
    ]
}

Recommendations

  1. Use tag exclusion whenever possible. This helps prevent empty result sets that can occur when filtering for a small number of included tags.

  2. Combine multiple tags for exclusion to achieve more precise filtering.

  3. Identify tag combinations that consistently produce undesirable results for your use case, and add them to your blocklist.

Use cases

1. Automotive: Exclude low-authority and UGC sites

2. Exclude list pages