Lite edition

更新时间:
复制 MD 格式

This document covers the features and benchmark results of the lightweight engine.

Product overview

The LiteAdvanced search engine is a high-performance, low-cost solution designed for Agent WebSearch scenarios. This engine builds high-quality, full-text web page indexes on demand and offers several core advantages:

  • Low latency: Responds in under 500 ms on average, ensuring a smooth user experience for multi-turn Agent searches.

  • Superior semantic search: Offers precise semantic search that outperforms traditional keyword-based search engines, achieving state-of-the-art (SOTA) performance on the ChineseSimpleQA dataset.

  • Advanced search capabilities: Includes fine-grained filters for criteria like publish date and site.

  • Low cost: Lowers your total cost of ownership.

Latency

截屏2025-08-10 下午2

The LiteAdvanced engine outperforms other common search engines, achieving an average response time (RT) of under 500 ms and a P95 latency of less than 600 ms. This performance gives it a competitive edge against general-purpose engines.

Note
  1. Data for Exa and GoogleSerp is sourced from fastest-search-api.

  2. The GoogleSerp results are based on tests from international nodes. Latency is expected to be higher when tested from within the Chinese mainland.

  3. The LiteAdvanced and Generic engines include main_text, which further reduces page-read latency.

Quality

On a private dataset, the engine improved the overall_score by 13% compared to its predecessor, evaluated using a large model-based method for search quality.

The engine was also benchmarked on the public ChineseSimpleQA and SimpleQA datasets.

ChineseSimpleQA

Evaluation method

  • Judge Model: qwen-max

  • Answer Model: deepseek-v3

  • Simplified logic for context enhancement with search results:

    def wrap_xml_context(search_results):
      """
          Wrap search results into XML format for context.
        """
      page_items = search_results.get("pageItems", []) if search_results else []
    
      if not page_items:
        return "<no_results>No relevant information found.</no_results>"
    
      xml_contexts = [
        f"<source>\n    <url>{item['link']}</url>\n    <passage>{item['snippet']}</passage>\n</source>"
        for item in page_items[:10]
      ]
        
      return "\n".join(xml_contexts)
    
    answer_prompt_template = """
        Please answer the following question based on the provided context.
        Question: {question}
        <context>
            {context}
        </context>
    """
    
    answer_prompt = answer_prompt_template.format(question = question, context = context)
    messages = [
      {"role": "system", "content": "You are an AI assistant. You use XML-formatted context to research people's questions."},
      {"role": "user", "content": answer_prompt}
    ]

Evaluation results

Search engine

Overall (f-score)

Chinese culture

Humanities and social sciences

Engineering and science

Lifestyle and culture

Society

Natural sciences

LLM Only (No WebSearch)

73.44

71.31

79.09

74.34

66.78

72.97

75.31

LiteAdvanced (Snippet)

92.51

93.25

92.96

90.01

93.23

91.62

93.71

Google (Snippet, Serp)

90.37

92.20

91.09

88.45

89.95

92.14

89.10

Bing (Snippet, Serp)

89.87

92.33

91.60

86.42

89.32

91.64

88.54

Engine 1 (Summary)

82.23

90.79

83.53

80.39

81.49

79.57

79.95

Engine 2 (Summary)

80.27

91.10

80.91

79.42

79.09

76.25

77.93

Engine 3 (Summary)

81.31

90.17

85.17

77.25

80.99

77.96

78.08

Note
  1. The snippet is a raw web page summary and does not involve AI-powered summarization. In contrast, the Summary results from other engines are typically longer (400–600 characters). The LiteAdvanced snippet is optimized to a length of 400–500 characters, making it ideal for Agent use cases.

  2. Engines 1, 2, and 3 refer to other popular, general-purpose search engines.

SimpleQA

Agent WebSearch requires high-quality, global, and multilingual content, distinguishing it from traditional search engines designed for humans. The SimpleQA benchmark is used to measure this global search capability. While many domestic engines lack sufficient multilingual support, LiteAdvanced is built on a high-quality global index.

Evaluation method

  • Judge Model: qwen-max

  • Answer Model: deepseek-v3

  • The context enhancement logic is the same as that used for the ChineseSimpleQA test, but adds the --search-engine parameter.

  • The test was conducted on the first 1,000 questions from the dataset.

 python -m simple-evals.simple_evals --model deepseek-v3 --eval simpleqa --examples 1000 --search-engine liteAdvanced

Evaluation results

Search engine

Score

Incorrect

Not attempted

Score: std

LLM Only (No WebSearch)

0.248

0.69

0.062

0.43

LiteAdvanced (Snippet)

0.592

0.166

0.242

0.49

Google (Snippet, Serp)

0.824

0.078

0.098

0.38

Bing (Snippet, Serp)

0.705

0.116

0.179

0.456

Engine 1 (Summary)

0.019

0.05

0.931

0.13

Engine 2 (Summary)

0.043

0.05

0.894

0.20

Features

The engine supports several advanced search parameters and has been optimized for Agent-friendly operation.

Time range search

The engine supports time range search, letting you specify a start and end date for the publish date. This is in addition to the generic TimeRange parameter.

"advancedParams": {
  "startPublishedDate": "2024-12-01",
  "endPublishedDate": "2025-01-31"
}

Site search

To perform a site search, specify domains to include (includeSites) or exclude (excludeSites). Both top-level (e.g., sina.com.cn) and subdomains (e.g., finance.sina.com.cn) are supported, with a limit of 20 sites per request.

"advancedParams": {
  "excludeSites": "aliyun.com,sina.cn"
}

Number of results

You can specify the number of results to return, from 1 to 50, depending on your use case and result processing capacity. The default is 10.

"advancedParams": {
  "numResults": "10"
}

Semantic search

Unlike previous versions that required keyword rewriting, the new engine supports semantic search with natural language queries (ideally under 30 characters). For example:

  • news about Zhejiang from the Two Sessions coverage

  • positive news about NIO cars

Longer snippets

While traditional search snippets are short (~100 characters), Agent workflows benefit from richer context. To meet this need, the LiteAdvanced engine provides a longer snippet (up to 500 characters) to deliver more information in a single call. This reduces the need to choose between a short snippet, the full main_text (~3,000 characters), or a secondary web crawl.

Limitations

  1. Performance for long-tail queries and real-time data needs (minute-level updates) may not match that of large-scale, traditional search engines. However, it outperforms other self-built search solutions.

Usage

For detailed instructions, see the IQS UnifiedSearch LiteAdvanced engine documentation. Set the engineType parameter to LiteAdvanced.

Release notes

Release date

Description

December 3, 2025

  • Added site authority score: websiteAuthorityScore.

  • Added content for stock prices and platform trending news.

  • Added multi-engine fallback to supplement results when recall is insufficient.

September 19, 2025

  • Increased the site limit for includeSites and excludeSites to 20.

September 8, 2025

  • Added indexed data for weather, stock information, exchange rates, and news.

August 7, 2025

  • Initial release of the lightweight engine.