deAPI Audio
name: deapi-audio
by aleglowa · published 2026-04-01
$ claw add gh:aleglowa/aleglowa-deapi-audio---
name: deapi-audio
description: Text-to-speech, voice cloning, voice design, and transcribe audio files via deAPI GPU network. Trigger on 'text to speech', 'TTS', 'generate voice', 'read aloud', 'voice clone', 'clone voice', 'voice design', 'design voice', 'custom voice', 'transcribe audio', 'STT'. For video/YouTube transcription use deapi-video instead.
version: 1.0.0
allowed-tools: Bash(bash ${CLAUDE_SKILL_DIR}/scripts/*), Read(${CLAUDE_SKILL_DIR}/**), Write(${CLAUDE_SKILL_DIR}/config.json)
metadata:
author: deapi
openclaw:
requires:
env:
- DEAPI_API_KEY
bins:
- curl
- jq
primaryEnv: DEAPI_API_KEY
homepage: https://deapi.ai
---
# deAPI Audio
Text-to-speech, voice cloning, voice design, and audio transcription via deAPI decentralized GPU network.
Scripts
| Script | Use when... |
|--------|-------------|
| `scripts/text-to-speech.sh` | User wants to convert text to spoken audio |
| `scripts/voice-clone.sh` | User wants to clone/replicate a voice from a sample audio file |
| `scripts/voice-design.sh` | User wants to generate speech with a voice described in natural language |
| `scripts/speech-to-text.sh` | User wants to transcribe an audio file (AAC, MP3, OGG, WAV, WebM, FLAC, max 10MB) |
Your config
! cat ${CLAUDE_SKILL_DIR}/config.json 2>/dev/null || echo "NOT_CONFIGURED"
If the config above is NOT_CONFIGURED, ask the user:
Then write the answer to ${CLAUDE_SKILL_DIR}/config.json as `{ "api_key": "their_key" }`.
Alternatively, the user can set the `DEAPI_API_KEY` environment variable directly, which takes priority over config.json.
Gotchas
Quick examples
# Basic TTS
bash scripts/text-to-speech.sh --text "Hello world"
# British voice
bash scripts/text-to-speech.sh --text "Good morning" --voice bf_emma
# Chatterbox model (multilingual)
bash scripts/text-to-speech.sh --model Chatterbox --text "Bonjour le monde" --lang fr
# Qwen3 model
bash scripts/text-to-speech.sh --model Qwen3 --text "Hello world" --voice Serena --lang English
# Clone a voice from a sample
bash scripts/voice-clone.sh --text "Hello, this is my cloned voice" --ref-audio /path/to/sample.mp3
# Clone with reference transcript for better accuracy
bash scripts/voice-clone.sh --text "Welcome to the show" --ref-audio /path/to/sample.wav --ref-text "This is the original transcript"
# Design a custom voice from description
bash scripts/voice-design.sh --text "Good morning everyone" --instruct "A warm, deep male voice with a slight British accent"
# Voice design in another language
bash scripts/voice-design.sh --text "Bonjour tout le monde" --instruct "A cheerful young female voice" --lang French
# Transcribe audio file (local or URL)
bash scripts/speech-to-text.sh --audio /path/to/recording.mp3
bash scripts/speech-to-text.sh --audio "https://example.com/podcast.mp3"For the full voice list and language codes, see [references/voices.md](references/voices.md).
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...