walkie-talkie
✓Verified·Scanned 2/18/2026
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
from clawhub.ai·va29e83b·1.1 KB·0 installs
Scanned from 1.0.0 at 1258b13 · Transparency log ↗
$ vett add clawhub.ai/rubenfb23/walkie-talkie
Walkie-Talkie Mode
This skill automates the voice-to-voice loop on WhatsApp using local transcription and local TTS.
Workflow
-
Incoming Audio: When a user sends an audio/ogg/opus file:
- Use
tools/transcribe_voice.shto get the text. - Process the text as a normal user prompt.
- Use
-
Outgoing Response:
- Instead of a text reply, generate speech using
bin/sherpa-onnx-tts. - Send the resulting
.oggfile back to the user as a voice note.
- Instead of a text reply, generate speech using
Triggers
- User sends an audio message.
- User says "activa modo walkie-talkie" or "hablemos por voz".
Constraints
- Use local tools only (ffmpeg, whisper-cpp, sherpa-onnx-tts).
- Maintain a fast response time (RTF < 0.5).
- Always reply with BOTH text (for clarity) and audio.
Manual Execution (Internal)
To respond with voice manually:
bin/sherpa-onnx-tts /tmp/reply.ogg "Tu mensaje aquí"
Then send /tmp/reply.ogg via message tool with filePath.