General OCR Struct
name: general-ocr-struct
by 9penny · published 2026-04-01
$ claw add gh:9penny/9penny-general-ocr-struct---
name: general-ocr-struct
description: General-purpose offline OCR and post-processing for Chinese/English screenshots, scanned images, receipts, tables, chat screenshots, statement screenshots, and other text-heavy images. Use when you need to: (1) extract text from an image locally, (2) return raw OCR text before interpretation, (3) clean broken OCR lines into structured content, (4) reorganize recognized text into rows/fields for downstream use, or (5) separate recognition from later table entry, summarization, or document drafting.
---
# General OCR Struct
Use this skill to separate OCR recognition from downstream content整理.
Workflow
1. Run the local OCR script on the image first.
2. Return the raw OCR text before making business interpretations when accuracy matters.
3. If the image is a transaction-detail screenshot, run structuring mode to group rows into fields.
4. Mark uncertain fields explicitly as `待确认`; do not guess missing content.
5. Only after the user confirms recognition quality, use the result for tables, summaries, or documents.
Commands
Raw OCR
python3 scripts/general_ocr.py raw /path/to/image.jpgStructured transaction extraction
python3 scripts/general_ocr.py transactions /path/to/image.jpgJSON output
python3 scripts/general_ocr.py transactions /path/to/image.jpg --jsonOutput rules
Notes
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...