Alibaba Cloud Visual Intelligence API product and documentation release updates for 2021-Visual Intelligence API(VIAPI)-阿里云帮助中心

Docs ICP Filing Console

This topic describes the product and documentation updates for Alibaba Cloud Visual Intelligence API in 2021.

December 2021

Category	Capability Name	Description	Release date	Supported platforms	References
Offline SDK	Limb keypoint SDK	Provides detection information for 15 key points in authorized human images, such as the nose, eyes, neck, left shoulder, and right shoulder.	2021-12-30	Android and iOS	Limb keypoint SDK
	Limb action counting SDK	Captures video of human actions using a camera. Detects limb keypoints in real time and automatically counts actions. Supports 15 fitness actions, such as jumping rope, squats, jumping jacks, sit-ups, push-ups, planks, and glute bridges. Custom fitness actions are also supported.	2021-12-30	Android and iOS	Limb action counting SDK
	Limb action counting feedback SDK	Uses AI to detect 15 types of incorrect limb actions in real time and provides immediate feedback.	2021-12-30	Android and iOS	Limb action counting feedback SDK
	Facial landmark SDK	Detects the number of faces and face regions in an image. Returns the face count, coordinates of 106 basic facial landmarks, 134 additional fine-grained landmarks, and 40 pupil landmarks.	2021-12-30	Android and iOS	Facial landmark SDK
	Image enhancement SDK	Enlarges original images by 2× without quality loss.	2021-12-30	Android and iOS	Image enhancement SDK
	Filter SDK	Provides eight filters: Normal, Vibrant, Fresh, Food, Japanese, Beauty, Mint, and Black & White. Applies filters while preserving image quality.	2021-12-30	Android and iOS	Filter SDK

November 2021

Category	API Name	Description	Release date	Region	References
Object detection	Clothing detection	Uses visual AI algorithms, Internet of Things (IoT), and big data analytics to detect whether personnel wear hats, masks, or uniforms in specified areas. Sends real-time alerts for noncompliant attire.	2021-11-30	China (Shanghai)	Clothing detection
Object detection	Cat and mouse detection	Uses visual AI algorithms, IoT, and big data analytics to detect cats, mice, and other animals in scenes. Sends real-time alerts.	2021-11-30	China (Shanghai)	Cat and mouse detection

October 2021

Category	Interface Name	Description	Release date	Region	References
Face and body	Batch add face data	Adds face data to a specified database in batches.	2021-10-29	China (Shanghai)	Batch add face data

September 2021

Category	Interface Name	Description	Release date	Region	References
Face and body	Smart slimming	Takes a portrait image as input. Detects and analyzes facial features. Generates a slimmer version of the face. Supports up to three faces per image.	September 30, 2021	China (Shanghai)	Smart slimming
Face and body	Smart skin retouching	Takes a portrait image as input. Smooths facial skin and removes blemishes such as acne, acne scars, and freckles. Whitens skin across the entire body. Preserves natural skin texture. Supports multiple people per image.	2021-09-31	China (Shanghai)	Smart skin retouching

August 2021

Category	Interface Name	Description	Release date	Region	References
Image recognition	Ad creative analysis	Tags people—including celebrities, ordinary people, and computer-generated characters—and scenes in ad images. Supports thousands of content tags with broad coverage.	2021-08-31	China (Shanghai) and China (Hohhot)	Ad creative analysis
Video understanding	Video content understanding	Added support for China (Hohhot).	2021-08-31	China (Shanghai) and China (Hohhot)	Video content understanding
Object detection	IPC video object detection	Detects objects—such as people, vehicles, and pets—in input videos.	2021-08-31	China (Shanghai)	IPC video object detection

July 2021

Category	Interface Name	Description	Release date	Region	References
OCR	VAT invoice roll recognition	Recognizes structured fields on VAT invoice rolls, including total amount including tax, invoice code, invoice number, total tax amount, total amount, password area, issue date, tax rate, buyer identification number, and seller identification number.	2021-07-31	China (Shanghai)	VAT invoice roll recognition
	Fixed-amount invoice recognition	Recognizes structured fields on fixed-amount invoices, including invoice number, invoice code, and invoice amount.	2021-07-31	China (Shanghai)	Fixed-amount invoice recognition
	PDF recognition	Performs structured text recognition on PDF files.	2021-07-31	China (Shanghai)	PDF recognition
Image analysis and processing	Aortic aneurysm and pulmonary hypertension detection	Segments the aorta and pulmonary artery from chest CT DICOM scan data. Extracts centerlines for both vessels. Generates optimal-view Stretch CPR, Cross Section, and Straightened CPR images that wrap around each vessel. Returns the maximum diameter of each vessel. Also returns cross-sectional area at 1 mm intervals along each centerline and the position of each point in the patient coordinate system of the original image.	2021-07-31	China (Shanghai)	Aortic aneurysm and pulmonary hypertension detection

June 2021

Category	Capability	Description	Release date	Supported platforms	References
Offline SDK	Real-time video segmentation SDK	Uses deep learning frameworks and detection-recognition techniques to deliver high-precision visual segmentation. Segments foreground subjects and scenes at the pixel level in real time. Works well with highly transparent subjects and complex backgrounds.	2021-06-30	Android and iOS	Real-time video segmentation SDK
	Offline image segmentation SDK	Uses detection and recognition techniques to perform precise, flawless background removal on user-captured or uploaded images. Delivers high-precision visual segmentation. Supports segmentation and background replacement for complex images.	2021-06-30	Android and iOS	Offline image segmentation SDK
	Certificate recognition SDK	Uses Alibaba Cloud Visual Intelligence API’s innovative certificate recognition technology for efficient certificate recognition.	2021-06-30	Android and iOS	Certificate recognition SDK
	Vehicle recognition SDK	Scans and recognizes all mainland China single-line license plates and VIN codes from video streams.	2021-06-30	Android and iOS	Vehicle recognition SDK
	General OCR SDK	Performs offline OCR on Android or iOS devices. Has a small package size and delivers sub-second recognition speed.	2021-06-30	Android and iOS	General OCR SDK

May 2021

Category	API Name	Description	Release date	Region	References
3D vision	Monocular video depth estimation	Takes a color video as input. Estimates a depth map for each frame. Generates a point cloud.	2021-05-31	China (Shanghai)	Offline

April 2021

Category	API Name	Description	Release date	Region	References
Face and body	Online proctoring	Detects candidate behavior during online exams. Supports screen chat tool detection and candidate status detection.	2021-04-30	China (Shanghai)	Online proctoring
3D vision	Image-based human reconstruction	Estimates the 3D depth value for each pixel in a single human image. Outputs a local unit mesh model from the corresponding viewpoint.	2021-04-30	China (Shanghai)	Offline
3D vision	Multi-view 3D reconstruction	Takes multiple color images of the same scene and their corresponding camera positions as input. Reconstructs a 3D model of the main subject or scene. Outputs a 3D point cloud.	2021-04-30	China (Shanghai)	Offline

March 2021

Category	Interface Name	Description	Release date	Region	References
3D vision	Monocular image depth estimation	Estimates the 3D depth value for each pixel in a single image. Outputs a depth map.	2021-03-25	China (Shanghai)	Offline
3D vision	Stereo depth estimation	Takes two stereo color images—left and right—as input. Estimates and outputs the disparity map for the left image.	2021-03-25	China (Shanghai)	Offline

February 2021

Category	API Name	Description	Release date	Region	References
Face and body	Static gesture recognition	Detects gestures in images.	2021-02-26	China (Shanghai)	Offline

January 2021

Category	Interface Name	Description	Release date	Region	References
Face and body	Face fusion	Fuses one person’s face into another person’s facial features, with proper authorization.	2021-01-31	China (Shanghai)	Face fusion
	Add face fusion template	Uses approved face images as templates for face fusion.	2021-01-31	China (Shanghai)	Add face fusion template
	Query face fusion template	Lists existing face templates.	2021-01-31	China (Shanghai)	Query face fusion template
	Delete face fusion template	Deletes existing face templates.	2021-01-31	China (Shanghai)	Delete face fusion template
	Human sketch stylization	Automatically crops the head region from a portrait image and generates a sketch effect.	2021-01-31	China (Shanghai)	Human sketch stylization
	Server-side identity verification	You can request the user’s image information for identity verification on the server side by providing the verified person’s name and ID number.	2021-01-31	China (Shanghai)	Server-side identity verification
	Mobile identity verification request	Requests identity verification information on mobile devices using the person’s name and ID number.	2021-01-31	China (Shanghai)	Mobile identity verification request
	Mobile identity verification query	Returns identity verification information after confirming the ID is valid and the name matches.	2021-01-31	China (Shanghai)	Mobile identity verification query
Object detection	Vehicle illegal parking detection	Detects parked vehicles in target areas of images.	2021-01-31	China (Shanghai)	Vehicle illegal parking detection
Object detection	Vehicle congestion detection	Determines whether traffic congestion occurs based on vehicles in images.	2021-01-31	China (Shanghai)	Vehicle congestion detection
Image recognition	Food recognition	Identifies food categories and calorie counts in images.	2021-01-31	China (Shanghai)	Food recognition
Video segmentation	Green screen video segmentation	Removes green screens from videos. Automatically segments foreground subjects from green screen backgrounds.	2021-01-31	China (Shanghai)	Offline
Video understanding	Video content understanding	Analyzes elements in videos, such as celebrities, ordinary people, and game footage.	2021-01-31	China (Shanghai)	Video content understanding

Previous：2022Next：2020

该文章对您有帮助吗？