Alibaba Cloud Model Studio Entry (Routing)
name: alicloud-ai-entry-modelstudio
by cinience · published 2026-03-22
$ claw add gh:cinience/cinience-alicloud-ai-entry-modelstudio---
name: alicloud-ai-entry-modelstudio
description: Route Alibaba Cloud Model Studio requests to the right local skill (Qwen Image, Qwen Image Edit, Wan Video, Wan R2V, Qwen TTS, Qwen ASR and advanced TTS variants). Use when the user asks for Model Studio without specifying a capability.
version: 1.0.0
---
Category: task
# Alibaba Cloud Model Studio Entry (Routing)
Route requests to existing local skills to avoid duplicating model/parameter details.
Prerequisites
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscopeRouting Table (currently supported in this repo)
| Need | Target skill |
| --- | --- |
| Text-to-image / image generation | `skills/ai/image/alicloud-ai-image-qwen-image/` |
| Image editing | `skills/ai/image/alicloud-ai-image-qwen-image-edit/` |
| Text-to-video / image-to-video (i2v) | `skills/ai/video/alicloud-ai-video-wan-video/` |
| Reference-to-video (r2v) | `skills/ai/video/alicloud-ai-video-wan-r2v/` |
| Text-to-speech (TTS) | `skills/ai/audio/alicloud-ai-audio-tts/` |
| Speech recognition/transcription (ASR) | `skills/ai/audio/alicloud-ai-audio-asr/` |
| Realtime speech recognition | `skills/ai/audio/alicloud-ai-audio-asr-realtime/` |
| Realtime TTS | `skills/ai/audio/alicloud-ai-audio-tts-realtime/` |
| Live speech translation | `skills/ai/audio/alicloud-ai-audio-livetranslate/` |
| CosyVoice voice clone | `skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-clone/` |
| CosyVoice voice design | `skills/ai/audio/alicloud-ai-audio-cosyvoice-voice-design/` |
| Voice clone | `skills/ai/audio/alicloud-ai-audio-tts-voice-clone/` |
| Voice design | `skills/ai/audio/alicloud-ai-audio-tts-voice-design/` |
| Omni multimodal interaction | `skills/ai/multimodal/alicloud-ai-multimodal-qwen-omni/` |
| Visual reasoning | `skills/ai/multimodal/alicloud-ai-multimodal-qvq/` |
| Text embeddings | `skills/ai/search/alicloud-ai-search-text-embedding/` |
| Rerank | `skills/ai/search/alicloud-ai-search-rerank/` |
| Vector retrieval | `skills/ai/search/alicloud-ai-search-dashvector/` or `skills/ai/search/alicloud-ai-search-opensearch/` or `skills/ai/search/alicloud-ai-search-milvus/` |
| Document understanding | `skills/ai/text/alicloud-ai-text-document-mind/` |
| Video editing | `skills/ai/video/alicloud-ai-video-wan-edit/` |
| Model list crawl/update | `skills/ai/misc/alicloud-ai-misc-crawl-and-skill/` |
When Not Matched
Common Missing Capabilities In This Repo (remaining gaps)
Async Task Polling Template (video/long-running tasks)
When `X-DashScope-Async: enable` returns `task_id`, poll as follows:
GET https://dashscope.aliyuncs.com/api/v1/tasks/<task_id>
Authorization: Bearer $DASHSCOPE_API_KEYExample result fields (success):
{
"output": {
"task_status": "SUCCEEDED",
"video_url": "https://..."
}
}Notes:
Clarifying questions (ask when uncertain)
1. Are you working with text, image, audio, or video?
2. Is this generation, editing/understanding, or retrieval?
3. Do you need speech (TTS/ASR/live translate) or retrieval (embedding/rerank/vector DB)?
4. Do you want runnable SDK scripts or just API/parameter guidance?
References
Validation
mkdir -p output/alicloud-ai-entry-modelstudio
echo "validation_placeholder" > output/alicloud-ai-entry-modelstudio/validate.txtPass criteria: command exits 0 and `output/alicloud-ai-entry-modelstudio/validate.txt` is generated.
Output And Evidence
Workflow
1) Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
2) Run one minimal read-only query first to verify connectivity and permissions.
3) Execute the target operation with explicit parameters and bounded scope.
4) Verify results and save output/evidence files.
More tools from the same signal band
Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).
Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.
The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...