Is my data secure and encrypted?

All data is encrypted at rest using AES-256 and in transit using TLS 1.3. Your encryption keys are stored exclusively on your dedicated VPS.

Are my conversations truly private?

Yes. Each user gets a fully isolated container environment. We follow a zero-knowledge architecture.

Do you train AI models on my data?

Never. Your conversations and data are never used to train AI models.

OpenClaw Skillv1.0.0

Document Pro

by Jackeven02

Deploy on EasyClawdfrom $14.9/mo

文档处理技能 - 让 AI 能够读取、解析、提取 PDF、DOCX、PPT 等文档的关键信息。当用户要求分析文档、提取内容、总结报告时触发此技能。

How to use this skill

OpenClaw skills run inside an OpenClaw container. EasyClawd deploys and manages yours — no server setup needed.

Sign up on EasyClawd (2 minutes)
Connect your Telegram bot
Install Document Pro from the skills panel

Get started — from $14.9/mo

7stars

2,876downloads

28installs

0comments

1versions

Download ZIP View on ClawHub

Latest Changelog

Document Pro 1.0.0 - 文档处理技能上线

- 支持 PDF、Word (DOCX)、PowerPoint (PPTX)、Excel (XLSX)、TXT、Markdown 等常见文档的读取与信息提取
- 实现文档分析、内容提取、格式转换及摘要生成
- 针对表格、文本、关键词等信息自动化提炼
- 针对用户请求智能选择处理工具，自动输出关键信息、摘要和要点
- 标注已知限制，如扫描版 PDF 需 OCR 或复杂格式可能丢失内容

Skill Documentation

---
name: document-pro
version: 1.0.0
description: 文档处理技能 - 让 AI 能够读取、解析、提取 PDF、DOCX、PPT 等文档的关键信息。当用户要求分析文档、提取内容、总结报告时触发此技能。
---

# Document Pro - 文档处理技能

## 概述

赋予 AI 强大的文档处理能力：
- PDF 读取与提取
- Word 文档解析
- PowerPoint 提取
- Excel 数据提取
- 文档格式转换

## 触发场景

1. 用户发送文档并要求"分析"、"总结"
2. 用户要求"提取文档内容"
3. 用户要求"转换成 PDF"
4. 用户询问文档中的具体信息
5. 用户要求"从报告/论文中提取要点"

## 支持的格式

| 格式 | 读取 | 写入 | 工具 |
|------|------|------|------|
| PDF | ✅ | ✅ | pdfplumber, PyPDF2 |
| DOCX | ✅ | ✅ | python-docx |
| PPTX | ✅ | ❌ | python-pptx |
| XLSX | ✅ | ✅ | openpyxl |
| TXT | ✅ | ✅ | 内置 |
| Markdown | ✅ | ✅ | 内置 |

## 工具使用

### PDF 处理

```python
# 提取文本
import pdfplumber

with pdfplumber.open("document.pdf") as pdf:
    for page in pdf.pages:
        text = page.extract_text()
        print(text)

# 提取表格
with pdfplumber.open("document.pdf") as pdf:
    table = pdf.pages[0].extract_tables()
```

### Word 文档

```python
from docx import Document

doc = Document("document.docx")
for para in doc.paragraphs:
    print(para.text)

# 提取表格
for table in doc.tables:
    for row in table.rows:
        print([cell.text for cell in row.cells])
```

### PowerPoint

```python
from pptx import Presentation

prs = Presentation("presentation.pptx")
for slide in prs.slides:
    for shape in slide.shapes:
        if shape.has_text_frame:
            print(shape.text)
```

## 工作流

```
1. 识别文档类型 → 选择正确的工具
2. 读取内容 → 提取文本、表格、图片
3. 分析信息 → 理解结构、提取要点
4. 总结呈现 → 用中文总结给用户
```

## 进阶功能

### 文档摘要
- 提取文档主要观点
- 生成简短摘要
- 列出关键要点

### 表格处理
- 识别表格结构
- 提取表格数据
- 转换为 CSV/Excel

### 关键词提取
- 找出重要名词/术语
- 识别主题
- 提取关键信息

## 输出格式

向用户呈现文档时：
- 文档类型和页数
- 主要内容摘要
- 关键要点（3-5条）
- 建议的后续操作

## 限制

- 扫描版 PDF 需要 OCR
- 复杂格式可能丢失
- 图片/图表无法完全理解

Security scan, version history, and community comments: view on ClawHub

Document Pro

How to use this skill

Latest Changelog

Tags

Skill Documentation