使用Python基于百度等OCR API的文字识别_随笔

使用Python基于百度等OCR API的文字识别百度OCR

Baidu OCR API：一定额度免费，目前是每日500次

Python SDK文档：https://cloud.baidu.com/doc/OCR/OCR-Python-SDK.html

安装使用

1）首先注册一个百度云BCE账号

登录官网https://cloud.baidu.com/ --> 右上角‘管理控制台’ --> 产品服务 --> 文字识别https://console.bce.baidu.com/ai/#/ai/ocr/overview/index --> 创建应用 --> 立即创建

2）然后从控制面板新建一个文字识别应用

查看应用列表

获得调用API需要的AppID，API Key 和 Secret Key

3）安装OCR Python SDK

pip install baidu-aip

封装和调用

接口说明：https://cloud.baidu.com/doc/OCR/OCR-Python-SDK.html#.E6.8E.A5.E5.8F.A3.E8.AF.B4.E6.98.8E

# 新建AipOCR
from aip import AipOcr

config = {
    'appId': 'your-id',
    'apiKey': 'your-key',
    'secretKey': 'your-secret-key'
}

client = AipOcr(**config)

def get_file_content(file):
    with open(file, 'rb') as fp:
        return fp.read()

def img_to_str(image_path):
    image = get_file_content(image_path)
    result = client.basicGeneral(image)
    if 'words_result' in result:
        return '\n'.join([w['words'] for w in result['words_result']])

if __name__ == '__main__' :
    imagepath = './image/Test.png'
    img_to_str(imagepath)

测试 - 识别1：

测试 - 识别2：

测试 - 识别3：

测试 - 识别4：

总结

提供多种参数 - 处理灵活 - 比如自定义旋转，返回可信度，特定类型证件识别等

其他OCR

微软Azure 图像识别：https://azure.microsoft.com/zh-cn/services/cognitive-services/computer-vision/
有道智云文字识别：http://aidemo.youdao.com/ocrdemo
阿里云图文识别：https://www.aliyun.com/product/cdi/
腾讯OCR文字识别：https://cloud.tencent.com/product/ocr
TesseractOCR文字识别：https://digi.bib.uni-mannheim.de/tesseract/（Tesseract有提供优化版的简体中文语料库）

欢迎分享，转载请注明来源：内存溢出

原文地址: https://outofmemory.cn/zaji/588928.html

使用Python基于百度等OCR API的文字识别

发表评论

评论列表（0条）