Improve the efficiency and quality of audio and video content generation with video AI-ApsaraVideo VOD(VOD)-阿里云帮助中心

ApsaraVideo VOD provides intelligent media processing and content generation features, such as automated review, media fingerprint, DataQ - Smart Tag Service, and smart thumbnail. These features recognize, analyze, and understand audio and video content to improve the efficiency and quality of content production.

Introduction to video AI services

Alibaba Cloud video AI services recognize, analyze, and understand audio and video content. You can use these services to:

Detect non-compliant video content.
Identify and search for duplicate or similar audio and video segments.
Recognize people, text, entities, scenes, and actions in videos.
Analyze and understand videos to generate video tags, recommend thumbnails, create GIFs, and produce video synopses.
Convert speech to text.

Video AI Features

Feature	Description	More information
Automated review	The automated review service detects pornography, suggestive material, terrorism, special attire, special logos, weapons, and political references in video files, thumbnails, and titles of video-on-demand resources, and then provides recommended actions.	Product information: Automated review Configuration document: Automated review
Media fingerprint	A media fingerprint uniquely marks a video, audio file, or image. It remains stable across transformations such as format conversion, editing, splicing, compression, and rotation. The media fingerprint service extracts and compares fingerprint features from images and audio in videos to find duplicate videos, trace video segment sources, and identify original content.	Product information: Media fingerprint Configuration document: Media fingerprint
DataQ - Smart Tag Service	The DataQ - Smart Tag Service analyzes visual, text, speech, and behavioral information in videos. It uses multimodal information fusion and alignment to accurately recognize content and automatically generate multi-dimensional content labels, transforming unstructured information into a structured format.	Product information: DataQ - Smart Tag Service Configuration document: DataQ - Smart Tag Service
Smart thumbnail	The smart thumbnail service analyzes video content to extract the five best screenshots as candidate thumbnails. It also extracts key frames to automatically create an animated GIF thumbnail.	Product information: Smart thumbnail Configuration document: Set a video thumbnail

Notes

Video AI features are available only in some regions. For more information, see Service regions.
All video AI features use the Video AI Processing Complete switch to send callback notifications for completed AI jobs. For more information, see Callback settings.
Fees are charged for video AI features. For more information, see Video AI billing.
Video AI processing can be performed only on audio and video files uploaded to ApsaraVideo VOD. For more information, see Upload media.

Video AI processing flow

You can call an API operation to start a video AI job and retrieve the results in three ways. The following figure shows the processing flow.

Video AI