PIL如何高效识别并提取图片中的文字内容-Python

教程大全 2026-02-14 21:21:42 浏览次

Python PIL 库识别图片文字：高效便捷的图像处理工具

简介

Python PIL（Python Imaging Library）库是一个强大的图像处理库，它提供了丰富的图像处理功能，包括图像的读取、编辑、保存等，PIL库的OCR（Optical Character Recognition，光学字符识别）功能可以帮助我们识别图片中的文字，本文将详细介绍如何使用Python PIL库进行图片文字识别。

安装PIL库

在使用PIL库之前，首先需要安装PIL库，由于PIL库已经不再维护，我们可以使用其继任者Pillow库,以下是安装Pillow库的命令：

pip instAll Pillow

图片文字识别步骤

读取图片

我们需要读取要识别文字的图片，使用Pillow库的 Image.open() 函数可以打开图片。

from PIL import Image# 打开图片image = Image.open("example.jpg")

转换图片格式

为了提高文字识别的准确性，我们需要将图片转换为灰度图，使用方法可以实现这一功能。

# 转换为灰度图gray_image = image.convert("L")

应用阈值

阈值处理可以将灰度图转换为二值图，使得文字和背景更加分明，使用 threshold() 方法可以实现阈值处理。

# 应用阈值threshold_image = gray_image.point(lambda x: 255 if x > 128 else 0)

使用Tesseract OCR识别文字

Tesseract OCR是一个开源的OCR引擎，可以用于识别图片中的文字，在Python中，我们可以使用 pytesseract 库来调用Tesseract OCR。

from pytesseract import image_to_string# 识别文字text = image_to_string(threshold_image)print(text)

保存识别结果

如果需要将识别结果保存到文件中,可以使用Python的文件操作。

with open("result.txt", "w", encoding="utf-8") as f:f.write(text)

示例代码

以下是一个完整的图片文字识别示例代码：

from PIL import Imagefrom pytesseract import image_to_string# 打开图片image = Image.open("example.jpg")# 转换为灰度图gray_image = image.convert("L")# 应用阈值threshold_image = gray_image.point(lambda x: 255 if x > 128 else 0)# 识别文字text = image_to_string(threshold_image)# 打印识别结果print(text)# 保存识别结果with open("result.txt", "w", encoding="utf-8") as f:f.write(text)