OCR Text Recognition

PDF Scans: Use PyMuPDF to extract images first, then OCR
Image Text Recognition: Perform OCR directly on images
Multi-page PDFs: Process page by page

This skill uses PaddleOCR for text recognition, supporting both Chinese and English.

Quick Start

Perform OCR recognition directly on image or PDF files:

from paddleocr import PaddleOCR

ocr = PaddleOCR(lang='ch')
result = ocr.predict("file_path.jpg")

Install dependencies before first use:

pip3 install paddlepaddle paddleocr

Recognition results return JSON containing: - rec_texts: List of recognized text - rec_scores: Confidence score for each text

Common scripts are located in the scripts/ directory.