The Service Marketplace aggregates all services on the Open Platform for AI Search, allowing you to view service details without logging in. You can use the Experience Center to try out core capabilities, such as document parsing, multimodal embedding, sorting, object detection, text vectorization, and video analysis, to quickly determine if they meet your business needs.
Service overview
Agent search services
Service category | Description |
Agentic Memory | The Agentic Memory service enables agents to store and retrieve long-term, short-term, and contextual memory. It uses a hybrid search technique that combines BM25 with vector retrieval for efficient and accurate memory recall. |
large language model (LLM) |
|
internet search | If your private knowledge base provides no answer during a search, you can enable internet search to retrieve additional information. This supplements your private knowledge base and, when combined with an LLM, enables richer responses. |
query analysis | This service uses LLMs and NLP capabilities to perform intent recognition, similar query expansion, and natural language to SQL (NL2SQL) conversion on user queries. This improves retrieval and question answering performance in retrieval-augmented generation (RAG) scenarios. |
sorting service |
|
Text and document processing services
Service category | Description |
Jina AI Reader | A web content extraction service for LLMs that converts any URL into a clean, LLM-friendly plain text format. It removes ads, navigation, and other distracting information to extract only the core content of a webpage. |
document parsing |
|
text vectorization |
|
sparse text vectorization | OpenSearch sparse text vectorization service: Converts text into a sparse vector representation. Sparse vectors require less storage and are often used to represent keywords and term frequency signals. They can be combined with a dense vector for hybrid search to improve retrieval quality. This service supports over 100 languages with a maximum input text length of 8,192 tokens. |
document chunking | This service splits structured data in HTML, Markdown, and TXT formats based on document paragraphs, text semantics, or specified rules. It also supports extracting code, images, and tables from documents in rich text format. |
dimensionality reduction | embedding-dim-reduction: A vector model fine-tuning service. You can use custom training to reduce high-dimensional vectors to lower dimensions. This reduces storage costs and improves cost-effectiveness with minimal impact on retrieval performance. |
Multimodal processing services
Service category | Description |
multimodal embedding |
|
Multimodal ranking | Provides an image relevance ranking service. In RAG and multimodal search scenarios, it can rerank retrieval results by relevance, improving retrieval accuracy and LLM generation quality. |
video summarization | Provides a video summarization service that understands a specified video segment and uses LLM capabilities to generate a video summary, title, and tags. |
video splitting | This service understands and analyzes a video, extracts keyframes, and splits the video into corresponding segments. |
keyframe extraction | keyframe extraction service 001: Provides video content extraction by capturing keyframes from a video. When combined with multimodal embedding or image parsing services, it enables cross-modal retrieval. |
speech recognition | speech recognition service 001: This service provides speech-to-text capabilities, quickly converting speech from video or audio files into structured text. The service supports multiple languages. |
object detection |
|
image content parsing | Image content understanding service: Uses a multimodal large language model to parse, understand, and recognize text in images. The parsed text can be used in image search and question-answering scenarios. |
Image text recognition service: Performs Optical Character Recognition (OCR) on images. The recognized text can be used in image search and question-answering scenarios. |
Features
The Experience Center offers the following services:
Service group | Service category | Description |
Agent search services | Agentic Memory Provides long-term, short-term, and contextual memory storage and retrieval for Agentic AI and intelligent search services. It supports storage, query, update, and forgetting operations for various search data, including personalized memory, agent memory, search memory, and Skill management. The service uses hybrid search and multi-path recall technology, combining BM25 and vector retrieval to ensure efficient and accurate memory retrieval. | The Agentic Memory service provides storage and management for two types of data: Memory and Skill. Memory stores a user's personal hobbies, interests, and preferences, while Skill stores reusable execution logic and capabilities. |
large language model (LLM) |
| |
internet search | If your private knowledge base provides no answer during a search, you can enable internet search to retrieve additional information. This supplements your private knowledge base and, when combined with an LLM, enables richer responses. | |
query analysis A query content analysis service that uses LLMs and NLP capabilities to perform intent recognition, similar query expansion, and natural language to SQL (NL2SQL) conversion on user queries. This improves retrieval and question answering performance in RAG scenarios. | A general query analysis service that uses an LLM for intent understanding and similar query expansion on user input queries. | |
sorting service |
| |
Text and document processing services | Jina AI Reader | A web content extraction service for LLMs that converts any URL into a clean, LLM-friendly plain text format. It removes ads, navigation, and other distracting information to extract only the core content of a webpage. |
document parsing |
| |
text vectorization |
| |
sparse text vectorization This service converts text data into a sparse vector representation. Sparse vectors require less storage and are often used to represent keywords and term frequency signals. They can be combined with dense vectors for hybrid search to improve retrieval quality. | OpenSearch sparse text vectorization service: A text vectorization service for over 100 languages with a maximum input text length of 8,192 tokens. | |
document chunking | This service splits structured data in HTML, Markdown, and TXT formats based on document paragraphs, text semantics, or specified rules. It also supports extracting code, images, and tables from documents in rich text format. | |
dimensionality reduction | embedding-dim-reduction: A vector model fine-tuning service. You can use custom training to reduce high-dimensional vectors to lower dimensions, helping improve cost-effectiveness with minimal impact on retrieval performance. | |
Multimodal processing services | multimodal embedding |
|
Multimodal ranking | Provides an image relevance ranking service. In RAG and multimodal search scenarios, this ranking service can rerank content to improve retrieval accuracy and LLM generation quality. | |
video summarization | Provides a video summarization service that understands a specified video segment and uses LLM capabilities to generate a video summary, title, and tags. | |
video splitting | This service understands and analyzes a video, extracts keyframes, and splits the video into corresponding segments. | |
keyframe extraction | keyframe extraction service 001: Provides video content extraction by capturing keyframes from a video. When combined with multimodal embedding or image parsing services, it enables cross-modal retrieval. | |
speech recognition | speech recognition service 001: This service provides speech-to-text capabilities, quickly converting speech from video or audio files into structured text. The service supports multiple languages. | |
object detection |
| |
image content parsing | Image content understanding service: Uses a multimodal large language model to parse, understand, and recognize text in images. The parsed text can be used in image search and question-answering scenarios. | |
Image text recognition service: Performs Optical Character Recognition (OCR) on images. The recognized text can be used in image search and question-answering scenarios. |
Try services
This section shows how to use the Experience Center to try services like document parsing and multimodal embedding, view results, and get the sample code.
Document parsing
Log on to the Open Platform for AI Search console.
In the navigation pane, select Experience Center.
For Service Category, select Document/Image Parsing (document-analyze), and then select a specific service from Experience Services.
Use the system-provided Sample data or upload your own data via Manage data. Supported file formats are TXT, PDF, HTML, DOC, DOCX, PPT, and PPTX. The maximum file size is 20 MB.
File: Upload local files. These files are automatically deleted after 7 days. The platform does not store your data long-term.
URL: Provide the file URL and its corresponding file type. You can upload multiple URLs, with each URL on a separate line.
NoteDocument parsing will fail if you select an incorrect data format. Make sure to choose the correct file type for your data.
ImportantEnsure that you use the web link import feature in compliance with applicable laws and regulations. You must adhere to the management specifications of the target platform and protect the legal rights of rights holders. You are solely responsible for your actions. As a tool provider, the Open Platform for AI Search is not liable for your parsing or downloading behavior.
If you use your own data, select the pre-uploaded file or URL from the drop-down list.
Click Get Results to start parsing the document.
Results: Displays the parsing progress and result.
Result source code: View the response code. You can use Copy Code or Download File to save the code locally.
Sample code: View and download the Sample code for calling this service.
Multimodal embedding
Log on to the Open Platform for AI Search console.
In the navigation pane, select Experience Center.
For Service Category, select Multimodal Vector (multi-modal-embedding). Select a specific service from Experience Services, and then choose Text, Image, or Text + Image.
NoteUploaded local images for vectorization are automatically deleted after 7 days. The platform does not store your data long-term.
Click Get Results to get the multimodal embedding.
Results: Displays the embedding result.
Result source code: View the response code. You can use Copy Code or Download File to save the code locally.
Sample code: View and download the Sample code for calling this service.