Tag: text-to-speech

Type: All Skills Tools
tool ★ 4,829

local offline text-to-speech synthesis tool

This utility provides local, offline text-to-speech conversion using the sherpa-onnx engine. It synthesises audio from provided text strings, requiring pre-configured runtime and voice models.

the-open-agent/openagent tts text-to-speech offline audio
skill

Local offline text-to-speech generation skill

This skill provides local, offline text-to-speech functionality using the sherpa-onnx CLI. It allows developers to generate high-quality audio files from text without requiring cloud connectivity or external APIs.

casibase/casibase tts text-to-speech offline audio
tool

ElevenLabs TTS for Local Audio Playback

This CLI utility provides text-to-speech generation using ElevenLabs, supporting multiple models and advanced SSML-like tags for fine-grained audio control. It facilitates local playback and allows developers to define voice characters and …

casibase/casibase elevenlabs tts audio-generation cli
skill

Voice-enabled conversational agent interface

This skill facilitates natural voice interactions by integrating Speech-to-Text and Text-to-Speech capabilities via the `voicemode:converse` MCP tool. It supports advanced conversational patterns, including parallel execution of speech and …

mbailey/voice-mcp voice-interaction speech-to-text text-to-speech mcp-tool
skill ★ 5

Gemini Text-to-Speech Audio Generation

Converts text into natural, expressive spoken audio using Google's Gemini TTS model via the gemini-media MCP. It supports multiple voices and language configurations, outputting 24kHz PCM audio files.

mordor-forge/gemini-media-mcp text-to-speech gemini-tts audio-generation mcp
skill ★ 372,633

ElevenLabs Text-to-Speech Generation and Playback

This skill provides local playback for ElevenLabs Text-to-Speech, allowing developers to generate high-quality audio from text. It supports advanced delivery control via custom tags for emotion, pacing, and detailed normalization rules.

openclaw/openclaw tts elevenlabs text-to-speech audio