OpenClaw Skillv1.0.9

WiseOCR

wisediagby wisediag
Deploy on EasyClawdfrom $14.9/mo

PDF OCR — Convert PDF to Markdown via WiseDiag cloud API (requires WISEDIAG_API_KEY). Files are uploaded to WiseDiag servers for processing. Supports table r...

How to use this skill

OpenClaw skills run inside an OpenClaw container. EasyClawd deploys and manages yours — no server setup needed.

  1. Sign up on EasyClawd (2 minutes)
  2. Connect your Telegram bot
  3. Install WiseOCR from the skills panel
Get started — from $14.9/mo
5stars
414downloads
1installs
0comments
16versions

Latest Changelog

- Updated SKILL.md to remove Chinese examples and keywords, using only English filenames and terms for usage instructions and examples.
- Generalized usage documentation (e.g., example filenames like "medical_report" instead of "体检报告").
- No changes were made to the code or functionality.

Tags

latest: 1.0.9

Skill Documentation

---
name: wiseocr
description: "Convert PDF files to Markdown using WiseOCR API (powered by WiseDiag). Supports table recognition, multi-column layouts, and medical document OCR. Usage: Upload a PDF file and say Use WiseOCR to process this."
registry:
  homepage: https://github.com/fmdmm/WiseOCR
  author: fmdmm
  credentials:
    required: true
    env_vars:
      - WISEDIAG_API_KEY
---

# ⚠️ Privacy Warning

**IMPORTANT - READ BEFORE INSTALLING:**

This skill **uploads your files to WiseDiag's cloud servers** for OCR processing.

**Do NOT use with sensitive or confidential documents** unless:
- You trust WiseDiag's data handling policies
- You accept that file contents will be transmitted and processed remotely

**For sensitive documents, use offline/local OCR tools instead.**

---

# WiseOCR Skill (powered by WiseDiag)

Convert PDF files into Markdown format. The script handles API authentication, file upload, OCR processing, and saves the result automatically.

## ⚠️ IMPORTANT: How to Use This Skill

**You MUST use the provided script to process files. Do NOT call any API or HTTP endpoint directly.**

The script `scripts/wiseocr.py` handles everything:
- API authentication (reads `WISEDIAG_API_KEY` from environment)
- PDF upload and OCR processing
- Saves the Markdown result to `WiseOCR/{filename}.md`

## 🔑 API Key Setup (Required)

**Get your API key:**
👉 [https://console.wisediag.com/apiKeyManage](https://s.wisediag.com/xsu9x0jq)

```bash
export WISEDIAG_API_KEY=your_api_key
```

## Installation

```bash
pip install -r requirements.txt
```

## Usage

**To process a PDF file, run:**

```bash
cd scripts
python wiseocr.py -i /path/to/input.pdf -n original_filename
```

**IMPORTANT:** Always pass `-n` with the original filename (without extension) so the output file is named correctly. If the uploaded file has been renamed (e.g. to `ocr_input.pdf`), `-n` ensures the output uses the user's original filename.

The script will automatically save the result to `WiseOCR/{name}.md`.

**Example:**

```bash
python wiseocr.py -i /tmp/ocr_input.pdf -n medical_report
# Output saved to: WiseOCR/medical_report.md
```

**With custom output directory:**

```bash
python wiseocr.py -i /path/to/input.pdf -n medical_report -o /custom/output/dir
```

## Arguments

| Flag | Description |
|------|-------------|
| `-i, --input` | Input PDF file path (required) |
| `-n, --name` | Original filename without extension for output (recommended) |
| `-o, --output` | Output directory (default: ./WiseOCR) |
| `--dpi` | PDF rendering DPI, 72-600 (default: 200) |

## Output

After the script runs, the Markdown file is saved automatically:

- Default: `WiseOCR/{name}.md`
- The file is named using the `-n` parameter (e.g. `-n report` → `report.md`), or falls back to the input filename
- No additional saving is needed — the file is already on disk

## Data Privacy

**What happens to your files:**
1. Files are uploaded to WiseDiag's OCR API (`https://openapi.wisediag.com`)
2. Files are processed on WiseDiag servers
3. Processing results are returned to you
4. Files are not permanently stored on WiseDiag servers

**For sensitive documents, use offline/local OCR tools instead.**

## License

MIT
Security scan, version history, and community comments: view on ClawHub