qwen-tts

⚠Review·Scanned 2/18/2026

This skill provides a local Qwen3-based TTS CLI and optional HTTP server for generating WAV audio. It instructs running bash scripts/setup.sh, starting a server bound to 0.0.0.0:8765 and calling http://192.168.188.177:8765, and setting env vars like QWEN_TTS_REMOTE.

from clawhub.ai·vb8f6ecf·35.9 KB·0 installs

Scanned from 1.0.0 at b8f6ecf · Transparency log ↗

$ vett add clawhub.ai/paki81/qwen-ttsReview findings below

Qwen3-TTS Skill

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice model.

Installation

cd /home/brewuser/.nvm/versions/node/v24.13.0/lib/node_modules/clawdbot/skills/public/qwen-tts
bash scripts/setup.sh

This will:

Create a Python 3.12 virtual environment in ./venv
Install qwen-tts package and dependencies (~500MB)
First synthesis auto-downloads ~1.7GB model

Quick Test

scripts/tts.py "Ciao, questo è un test!" -l Italian -o test.wav

Play the audio:

aplay test.wav  # Linux
# or
ffplay test.wav  # Cross-platform

Usage

See SKILL.md for complete documentation.

Basic:

scripts/tts.py "Your text" -l Italian -o output.wav

List speakers:

scripts/tts.py --list-speakers

With emotion:

scripts/tts.py "Sono felice!" -i "Parla con entusiasmo" -l Italian

Integration with OpenClaw

The skill is automatically available to OpenClaw once installed. OpenClaw can call:

cd skills/public/qwen-tts && scripts/tts.py "Text" -l Italian -o /tmp/audio.wav

Output path is printed to stdout (last line).

Requirements

Python 3.10-3.12 (tested with 3.12)
~2.2GB disk space (500MB venv + 1.7GB model)
GPU recommended (CPU works but slower)

License

Uses Qwen3-TTS under Apache 2.0 license. Check model card for details: https://huggingface.co/Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice