General-purpose text recognition

更新时间:
复制 MD 格式

This topic describes the features, advantages, and application scenarios of the general-purpose text recognition products from Alibaba Cloud OCR. It also provides quick links to the product APIs.

Product introduction

Alibaba Cloud OCR general-purpose recognition products can recognize and restore text from various common document images or scanned documents while preserving the original document format. To better restore text and document structure, the document recognition feature adds layout analysis and document image editing capabilities to its general full-text recognition features. These features include text localization, line analysis, and text recognition. This allows for the structured extraction of document elements from document images and improves the user experience.

Product features

General-purpose text recognition

Alibaba Cloud OCR general-purpose text recognition is suitable for unstructured text recognition in various industry scenarios. It returns text content and coordinate information.

General-purpose text recognition

High-accuracy full-text recognition (Recommended)

Alibaba Cloud OCR high-accuracy full-text recognition supports accurate recognition of multiple layouts, complex document backgrounds, and various lighting conditions. The document recognition rate exceeds 99.7%. For documents with seals or handprints, it can recognize text after the seals are removed. It also supports advanced features such as low-confidence-level filtering and pattern detection.

High-accuracy full-text recognition

General-purpose handwriting recognition

The Alibaba Cloud OCR general-purpose handwriting recognition model supports handwriting recognition in complex scenarios, including Chinese, English, and numbers. It can also recognize printed text, which makes it suitable for various handwritten notes and whiteboard content.

Table recognition

Alibaba Cloud OCR table recognition can effectively recognize lined, striped, and unlined tables.

Note

Intelligent table parsing: Performs general-purpose table parsing to extract table styles, content, and key-value pairs from text and tables. It supports PDF documents up to 100 MB and 100 pages, and up to 30 images for image document formats. Try it now for free

Table recognition

E-commerce image text recognition

Alibaba Cloud OCR E-commerce image text recognition is a core product designed for fast and accurate character recognition in online images. This includes E-commerce product promotions, online community posts, and user-generated content (UGC). This feature is highly valuable for scenarios such as illegal ad detection, information review management, and network security administration.E-commerce text recognition

Structured document recognition

Alibaba Cloud OCR structured document recognition can recognize document information in a structured manner. It provides layout information output from two perspectives: a tiled element view and a hierarchical tree view. It can extract and sequentially output text elements, such as single characters, text blocks, and lines, and their corresponding layout formats, such as titles, paragraphs, and tables. Currently, this feature only supports single-page documents.

Note

Intelligent document parsing: Extracts logical hierarchical structures, text content, table content, key-value fields, and style information from documents. It analyzes the content, layout, and logical information of a document to output the extracted results as structured data. It supports PDF documents up to 100 MB and 100 pages, and up to 30 images for image document formats. Try it now for free

Advantages

  • High accuracy

The models are trained on massive image samples to achieve industry-leading accuracy. For example, the accuracy for ID card recognition exceeds 99%.

  • Low latency

The service relies on Alibaba's self-built EAS online service clusters and continuously optimized inference technology. This provides a low-latency service with elastic scaling.

  • Advanced technology

The service is built on Alibaba Cloud Platform for AI. It uses Alibaba's deeply optimized deep learning framework, PAI-TensorFlow, to train advanced text detection and recognition models.

  • Stable service

The service offers elastic scaling based on call volume, which ensures good extensibility. Continuous algorithm iterations and optimizations do not affect service stability for users.

Scenarios

  • Image content moderation

You can use various general-purpose APIs to recognize content for moderation in different scenarios. This helps promptly detect non-compliant behavior, significantly reduces manual labor costs, and is widely used in E-commerce content administration.

  • Contract and document recognition

You can use general-purpose text recognition to recognize text from images of contracts, documents, and novels. This feature is useful for scenarios such as contract proofreading, document retrieval, and PDF content extraction. It is widely used in industries such as judicial case file management, corporate legal contract review, and automated processes in finance and insurance.

API quick links

Alibaba Cloud Marketplace API links (Legacy)

Official website API links (New)

High-accuracy full-text recognition

RecognizeAdvanced

General-purpose handwriting recognition

RecognizeHandwriting

E-commerce image text recognition

RecognizeBasic

Table recognition

RecognizeTableOcr

General-purpose text recognition

RecognizeGeneral

Structured document recognition

RecognizeDocumentStructure