Tencent Cloud Optical Character Recognition (OCR), based on deep learning and multimodal large model technology from Tencent YouTu Lab, intelligently identifies text content in images as editable text or extracts structured information. OCR supports recognition of standard documents such as identity cards, bank cards, and invoices, as well as various complex industry documents like transportation and logistics waybills and consignment notes, and healthcare documents such as medical reports and expense lists.
This section introduces OCR APIs that comply with the OpenAPI 3.0 specification.
You can call APIs to perform text recognition operations, such as General OCR, Card OCR, invoice recognition and intelligent documentation.
For information on ALL APIs supported by OCR, see API Overview.
For common terms of the OCR API interface, see the table below:
| Term | Description |
|---|---|
| Object Notation is a lightweight data interchange format. Any type supported by the JavaScript language can be represented through JSON, such as string, number, object, and array. | |
| SDK is a collection of development tools that software engineers use to create applications for specific software packages, software frameworks, hardware platforms, and operating systems. |
Usage limits
For API parameter limits, refer to the parameter description in each API document.
You can use the API Explorer tool to call APIs online.
This document uses general printed text recognition (high-precision version) as an example. The steps to make an API call via the API Explorer Tool are as follows:
フィードバック