ucloud-deepseek-ocr
v1.0.0OCR text recognition using DeepSeek-OCR model. Use when user asks for OCR, text recognition, image text extraction, screenshot recognition, or converting images to text/markdown.
Installation
Please help me install the skill `ucloud-deepseek-ocr` from SkillHub official store.
npx skills add qianjunye/ucloud-deepseek-ocr
DeepSeek OCR
Recognize text in images using the DeepSeek-OCR model.
Quick start
{baseDir}/scripts/ocr.sh /path/to/image.jpg
Usage
{baseDir}/scripts/ocr.sh <image_path> [output_format]
Parameters:
- <image_path>: Local image file (jpg, png, webp, gif, bmp)
- [output_format]: Optional, defaults to markdown. Can be text, json, etc.
Examples
# Convert to markdown (default)
{baseDir}/scripts/ocr.sh /path/to/image.jpg
# Convert to plain text
{baseDir}/scripts/ocr.sh /path/to/image.png text
# Extract table as JSON
{baseDir}/scripts/ocr.sh /path/to/table.jpg "extract table as json"
Remote URL images
The model only supports base64-encoded images. For remote URLs, download first:
curl -s -o /tmp/image.jpg "https://example.com/image.jpg"
{baseDir}/scripts/ocr.sh /tmp/image.jpg
API key
Set DEEPSEEK_OCR_API_KEY, or configure in ~/.openclaw/openclaw.json:
{
skills: {
"deepseek-ocr": {
apiKey: "YOUR_KEY_HERE",
},
},
}
Default API URL: https://api.modelverse.cn/v1/chat/completions
Override with DEEPSEEK_OCR_API_URL if needed.