DupFinder
name: DupFinder
by bytesagain · published 2026-03-22
$ claw add gh:bytesagain/bytesagain-dupfinder---
name: DupFinder
description: "Scan directories for duplicate files by hash to reclaim disk space. Use when finding duplicates, reclaiming space, comparing checksums."
version: "2.0.0"
author: "BytesAgain"
homepage: https://bytesagain.com
source: https://github.com/bytesagain/ai-skills
tags: ["duplicate","finder","files","cleanup","storage","hash","disk","dedup"]
categories: ["System Tools", "Utility"]
---
# DupFinder
Utility toolkit for running checks, converting data, analyzing content, generating reports, previewing results, and batch-processing entries. A general-purpose CLI for logging structured operations and exporting collected data.
Commands
| Command | Description |
|---------|-------------|
| `dupfinder run <input>` | Execute the main function and log the input |
| `dupfinder check <input>` | Run a check and record the result |
| `dupfinder convert <input>` | Log a data conversion operation |
| `dupfinder analyze <input>` | Record an analysis result |
| `dupfinder generate <input>` | Log a generated output or asset |
| `dupfinder preview <input>` | Record a preview observation |
| `dupfinder batch <input>` | Log a batch processing operation |
| `dupfinder compare <input>` | Record a comparison between items |
| `dupfinder export <input>` | Log an export operation |
| `dupfinder config <input>` | Record a configuration change |
| `dupfinder status <input>` | Log a status observation |
| `dupfinder report <input>` | Record a report entry |
| `dupfinder stats` | Show summary statistics across all logs |
| `dupfinder export <fmt>` | Export all data (json, csv, or txt) |
| `dupfinder search <term>` | Search across all log files for a term |
| `dupfinder recent` | Show the 20 most recent activity entries |
| `dupfinder status` | Health check — version, disk usage, last activity (no args) |
| `dupfinder help` | Show all available commands |
| `dupfinder version` | Show current version |
Each command without arguments displays the most recent 20 entries from its log file.
Data Storage
All data is stored in `~/.local/share/dupfinder/`:
Data format: each entry is stored as `YYYY-MM-DD HH:MM|<value>`, pipe-delimited for easy parsing.
Requirements
When to Use
1. **Tracking file operations** — log every check, conversion, and analysis with timestamps for a complete audit trail
2. **Batch processing workflows** — record batch operations and compare results across multiple runs
3. **Data analysis journaling** — use `analyze` and `report` to document findings as you work through a dataset
4. **Configuration change tracking** — log config changes with `config` so you can trace what changed and when
5. **Exporting operational data** — export the full history as JSON, CSV, or plain text for reporting or integration with other tools
Examples
# Run the main function with an input
dupfinder run "scan /home/user/documents"
# Record a check result
dupfinder check "file-integrity OK for /var/data"
# Log an analysis
dupfinder analyze "Found 342 entries matching pattern *.tmp"
# Compare two datasets
dupfinder compare "dataset-A vs dataset-B: 97% overlap"
# Batch process a directory listing
dupfinder batch "processed 1500 files in /mnt/archive"
# Record a configuration change
dupfinder config "max_depth=5, hash_algo=sha256"
# Export all data as JSON
dupfinder export json
# Search logs for a specific term
dupfinder search "integrity"
# Show summary statistics
dupfinder stats
# View recent activity
dupfinder recent
# Check overall health
dupfinder statusOutput
All command output goes to stdout. Redirect to a file if needed:
dupfinder stats > summary.txt
dupfinder export csv---
*Powered by BytesAgain | bytesagain.com | hello@bytesagain.com*
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...