⚡

// Skill profile

MiniMax Easy Voice

Name: MiniMax Easy Voice
Author: blue-coconut

name: mm-easy-voice

by blue-coconut · published 2026-03-22

API集成自动化任务

Total installs

Stars

★ 0

Last updated

2026-03

// Install command

$ claw add gh:blue-coconut/blue-coconut-mm-easy-voice

View on GitHub

// Full documentation

---

name: mm-easy-voice

description: Simple text-to-speech skill using MiniMax Voice API. Converts text to audio with customizable voice selection. Use for generating speech audio from text.

---

# MiniMax Easy Voice

Simple text-to-speech skill powered by MiniMax Voice API. Converts any text into natural-sounding audio with customizable voice selection.

Usage

[Step 1] Preparation

**IMPORTANT:** Before using this skill, always verify the environment:

python check_environment.py

**If API key is not set:**

export MINIMAX_VOICE_API_KEY="your-api-key-here"

**Requirements:**

Python 3.8+

MINIMAX_VOICE_API_KEY environment variable (required)

FFmpeg (optional, for audio merging/conversion)

[Step 2] Generate speech audio from text

Convert any text to speech in one command:

# Default voice (male-qn-qingse)
python mmvoice.py tts "Hello, this is a test." -o test.mp3

# Chinese text
python mmvoice.py tts "你好，这是一个测试。" -o test_cn.mp3

# Specify a different voice by "-v voice_id"
python mmvoice.py tts "Hello world" -v female-shaonv -o hello.mp3

# Specify output path
python mmvoice.py tts "Your text" -o /path/to/output.mp3

**Required parameters:**

`text`: The text you want to convert to speech

`-o OUTPUT`: Output audio file path (required)

**Optional parameters:**

`-v VOICE_ID`: Voice to use (default: male-qn-qingse)

Usage Details

**Text length limits:**

Up to 10,000 characters per request

For longer text, split into multiple requests and merge later

**Pause insertion:** Use `<#x#>` where x = pause duration in seconds

Example: `"Hello<#1.5#>world"` = 1.5 second pause between words

Range: 0.01 to 99.99 seconds

**Emotion matching:** speech-2.8 models automatically match emotions to your text content

Voice Selection

Choose the right voice for your content by consulting the voice catalog:

# List all available voices
python mmvoice.py list-voices

**Voice catalog:** `reference/voice_catalog.md`

Contains:

All available system voices (male, female)

Voice characteristics and recommended use cases

How to select the right voice for your content

Advanced Options

Voice Management

**List available voices:**

python mmvoice.py list-voices

**Clone a voice from audio sample:**

python mmvoice.py clone audio_file.mp3 --voice-id my-custom-voice

**Design a voice from description:**

python mmvoice.py design "A warm, gentle female voice" --voice-id designed-voice

Audio Processing

**Merge multiple audio files:**

python mmvoice.py merge file1.mp3 file2.mp3 file3.mp3 -o combined.mp3

**Convert audio format:**

python mmvoice.py convert input.wav -o output.mp3 --format mp3

Reference Documents

Open these when needed for more details:

|| Document | When to Use |

||----------|-------------|

|| `reference/voice_catalog.md` | Choosing a voice_id |

|| `reference/getting-started.md` | Environment setup |

|| `reference/audio-guide.md` | Audio processing |

|| `reference/voice-guide.md` | Voice cloning and design |

|| `reference/troubleshooting.md` | Common issues and solutions |

Troubleshooting

Common issues:

1. **API key not set:** Run `export MINIMAX_VOICE_API_KEY="your-key"`

2. **FFmpeg missing:** Install with `brew install ffmpeg` (macOS) or `sudo apt install ffmpeg` (Ubuntu)

3. **Voice not found:** Use `python mmvoice.py list-voices` to see available voices

Run environment check:

python check_environment.py

See `reference/troubleshooting.md` for more solutions.

// Comments

// Related skills

More tools from the same signal band

Order food/drinks (点餐) on an Android device paired as an OpenClaw node. Uses in-app menu and cart; add goods, view cart, submit order (demo, no real payment).

Sign plugins, rotate agent credentials without losing identity, and publicly attest to plugin behavior with verifiable claims and authenticated transfers.

The philosophical layer for AI agents. Maps behavior to Spinoza's 48 affects, calculates persistence scores, and generates geometric self-reports. Give your...

日历管理数据处理

1 installs★ 0