wisediag-medocr
v1.0.25PDF & Image OCR — Convert a single PDF or image to Markdown via WiseDiag cloud API. Supports table recognition, multi-column layouts, and high-accuracy text extraction. Usage: Upload a file and say Use WiseOCR to OCR this.
Installation
⚠️ Privacy Warning
IMPORTANT - READ BEFORE INSTALLING:
This skill uploads your file to WiseDiag's cloud servers for OCR processing.
Do NOT use with sensitive or confidential documents unless: - You trust WiseDiag's data handling policies - You accept that file contents will be transmitted and processed remotely
For sensitive documents, use offline/local OCR tools instead.
WiseOCR Skill (powered by WiseDiag)
A high-accuracy OCR tool that converts a single PDF or image file into Markdown format. After processing, the Markdown result is automatically saved to disk — no additional saving is needed.
Supported formats: PDF, jpg, jpeg, png, webp, gif, bmp, tiff.
Installation
pip install -r requirements.txt
🔑 API Key Setup (Required)
Get your API key: 👉 https://console.wisediag.com/apiKeyManage
The API key MUST be set as an environment variable. The script reads it automatically.
export WISEDIAG_API_KEY=your_api_key
How to Process a File (Step-by-Step)
NEVER call any API or HTTP endpoint directly. ONLY use the script below.
Step 1: Set the API key (if not already set):
export WISEDIAG_API_KEY=your_api_key
Step 2: Run the script with the input file:
cd scripts
# Single PDF
python3 wiseocr.py -i "/path/to/input_filename.pdf"
# Single image
python3 wiseocr.py -i "/path/to/scan.png"
IMPORTANT: If the input file has been copied or renamed (e.g. to a temp path), always pass -n with the original filename (without extension) so the output file is named correctly:
python3 wiseocr.py -i "/tmp/ocr_input.pdf" -n "my_report"
# Output saved to: ~/.openclaw/workspace/WiseOCR/my_report.md
The Markdown result is saved to ~/.openclaw/workspace/WiseOCR/{name}.md automatically. No additional saving is needed.
Arguments
| Flag | Description |
|---|---|
-i, --input |
Input file: PDF or image — single file path (required) |
-n, --name |
Output filename stem (recommended when input file is renamed/copied) |
-o, --output |
Output directory (default: ~/.openclaw/workspace/WiseOCR) |
--dpi |
PDF rendering DPI, 72-600 (default: 200) |
Data Privacy
What happens to your files: 1. Files are uploaded to WiseDiag's OCR API 2. Files are processed on WiseDiag servers 3. Processing results are returned to you 4. Files are not permanently stored on WiseDiag servers
For sensitive documents, use offline/local OCR tools instead.
License
MIT