tencent cloud

Optical Character Recognition

문서Optical Character Recognition

Introduction

다운로드
포커스 모드
폰트 크기
마지막 업데이트 시간: 2026-05-26 21:25:50

Overview

Tencent Cloud Optical Character Recognition (OCR), based on deep learning and multimodal large model technology from Tencent YouTu Lab, intelligently identifies text content in images as editable text or extracts structured information. OCR supports recognition of standard documents such as identity cards, bank cards, and invoices, as well as various complex industry documents like transportation and logistics waybills and consignment notes, and healthcare documents such as medical reports and expense lists.
This section introduces OCR APIs that comply with the OpenAPI 3.0 specification.
You can call APIs to perform text recognition operations, such as General OCR, Card OCR, invoice recognition and intelligent documentation.
For information on ALL APIs supported by OCR, see API Overview.

Glossary

For common terms of the OCR API interface, see the table below:

Term Description
Object Notation is a lightweight data interchange format. Any type supported by the JavaScript language can be represented through JSON, such as string, number, object, and array.
SDK is a collection of development tools that software engineers use to create applications for specific software packages, software frameworks, hardware platforms, and operating systems.

Usage limits
For API parameter limits, refer to the parameter description in each API document.

Getting Started with APIs

You can use the API Explorer tool to call APIs online.
This document uses general printed text recognition (high-precision version) as an example. The steps to make an API call via the API Explorer Tool are as follows:

  1. After registering a Tencent Cloud account and completing real-name authentication, log in to the OCR console (https://console.tencentcloud.com/ocr/overview), click Enable Now, and you can obtain API interface call permission for text recognition.
  2. Go to the API Explorer page. For more API Explorer tool usage information, see the document.
  3. Call the GeneralAccurateOCR API (https://console.tencentcloud.com/api/explorer?Product=ocr&Version=2018-11-19&Action=GeneralAccurateOCR).
  4. Manually input the corresponding parameters and view the response result through online API call. For input parameter description, refer to the API document (https://www.tencentcloud.com/document/product/866/34937?from_cn_redirect=1).

도움말 및 지원

문제 해결에 도움이 되었나요?

피드백