SkillHub

wiseocr

v1.0.18

PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API. Supports table recognition, multi-column layouts, and high-accuracy text extraction. Usage: Upload a file and say Use WiseOCR to OCR this.

Sourced from ClawHub, Authored by wisediag

Installation

Please help me install the skill `wiseocr` from SkillHub official store. npx skills add wisediag/wiseocr

⚠️ Privacy Warning

IMPORTANT - READ BEFORE INSTALLING:

This skill uploads your file to WiseDiag's cloud servers for OCR processing.

Do NOT use with sensitive or confidential documents unless: - You trust WiseDiag's data handling policies - You accept that file contents will be transmitted and processed remotely

For sensitive documents, use offline/local OCR tools instead.


WiseOCR Skill (powered by WiseDiag)

A high-accuracy OCR tool that converts a single PDF or image file into Markdown format. After processing, the Markdown result is automatically saved to disk — no additional saving is needed.

Supported formats: PDF, jpg, jpeg, png, webp, gif, bmp, tiff.

Installation

pip install -r requirements.txt

🔑 API Key Setup (Required)

Get your API key: 👉 https://console.wisediag.com/apiKeyManage

The API key MUST be set as an environment variable. The script reads it automatically.

export WISEDIAG_API_KEY=your_api_key

How to Process a File (Step-by-Step)

NEVER call any API or HTTP endpoint directly. ONLY use the script below.

Step 1: Set the API key (if not already set):

export WISEDIAG_API_KEY=your_api_key

Step 2: Run the script with the input file:

cd scripts

# Single PDF
python3 wiseocr.py -i "/path/to/input_filename.pdf"

# Single image
python3 wiseocr.py -i "/path/to/scan.png"

IMPORTANT: If the input file has been copied or renamed (e.g. to a temp path), always pass -n with the original filename (without extension) so the output file is named correctly:

python3 wiseocr.py -i "/tmp/ocr_input.pdf" -n "my_report"
# Output saved to: ~/.openclaw/workspace/WiseOCR/my_report.md

The Markdown result is saved to ~/.openclaw/workspace/WiseOCR/{name}.md automatically. No additional saving is needed.

Arguments

Flag Description
-i, --input Input file: PDF or image — single file path (required)
-n, --name Output filename stem (recommended when input file is renamed/copied)
-o, --output Output directory (default: ~/.openclaw/workspace/WiseOCR)
--dpi PDF rendering DPI, 72-600 (default: 200)

Data Privacy

What happens to your files: 1. Files are uploaded to WiseDiag's OCR API 2. Files are processed on WiseDiag servers 3. Processing results are returned to you 4. Files are not permanently stored on WiseDiag servers

For sensitive documents, use offline/local OCR tools instead.

License

MIT