Pronunciation Coach
name: pronunciation-coach
by crazybuffon · published 2026-03-22
$ claw add gh:crazybuffon/crazybuffon-pronunciation-coach---
name: pronunciation-coach
description: Pronunciation coaching with real voice analysis using Azure Speech Services. Analyzes audio files for phoneme-level accuracy, fluency, prosody, and intonation scores.
env:
AZURE_SPEECH_KEY: Azure Speech Service API Key
AZURE_SPEECH_REGION: Azure Speech Service Region (e.g., southeastasia)
---
# Pronunciation Coach
Analyze spoken English pronunciation using Azure Speech Services and provide actionable coaching feedback.
**Privacy Note**: This skill reads local voice messages from `~/.openclaw/media/inbound/` and transmits them to Microsoft Azure Speech Services for processing.
Prerequisites
Workflow
1. Receive Audio
Voice messages from Telegram are stored in `~/.openclaw/media/inbound/`. Find the latest `.ogg` file matching the message timestamp.
ls -lt ~/.openclaw/media/inbound/*.ogg | head -52. Run Assessment
scripts/pronunciation-assess.sh <audio_file> "<reference_text>"3. Generate Report
Pipe the JSON output into the report generator:
scripts/pronunciation-assess.sh audio.ogg "reference text" | node scripts/pronunciation-report.jsThe report includes:
4. Provide Coaching
After generating the report:
1. **Send the text report** to the user (scores + word breakdown)
2. **Identify top 3 problem sounds** from the phoneme scores
3. **Explain each problem** — what the correct sound is and how to produce it
- See `references/phoneme-guide.md` for phoneme descriptions and fixes
4. **Send a voice message** (via TTS) demonstrating the correct pronunciation of problem words
5. **Assign practice** — give the user specific sentences to re-record focusing on weak sounds
Coaching Tips
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...