Aliyun Speech Transcriber
name: aliyun-speech-transcriber
by chenggongdu · published 2026-04-01
$ claw add gh:chenggongdu/chenggongdu-aliyun-speech-transcriber---
name: aliyun-speech-transcriber
description: Transcribe publicly accessible audio or video URLs with Aliyun speech services. Use when the user wants speech-to-text via Aliyun DashScope, needs transcript JSON or extracted plain text, or wants to process a cloud-accessible media URL (including signed Qiniu URLs) into transcription results.
homepage: https://dashscope.aliyun.com/
metadata: {"clawdbot":{"emoji":"🎤","requires":{"env":["ASR_DASHSCOPE_API_KEY"]},"tags":["speech","transcription","asr","audio","aliyun","dashscope"]}}
---
# Aliyun Speech Transcriber
Use this skill to turn externally accessible media URLs into transcript results.
Current scope
Current implementation focuses on **DashScope file transcription** using the `paraformer-v2` model, aligned with the existing Java service pattern.
Required environment variables
Fallback supported:
Optional:
Inputs
Pass one or more externally accessible URLs:
node scripts/transcribe.js --file-url "https://example.com/audio.mp3"Multiple files:
node scripts/transcribe.js --file-url "https://a.com/1.mp3" --file-url "https://a.com/2.mp3"Output
The script returns JSON with:
`text` is a best-effort plain-text extraction from the final JSON result.
Chaining from Qiniu
Typical workflow:
1. Use `qiniu-upload` to upload a local file.
2. Prefer a signed private URL if the domain is not anonymously readable.
3. Pass the returned URL into this skill.
Safety rules
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...