Tag: text-to-speech
local offline text-to-speech synthesis tool
This utility provides local, offline text-to-speech conversion using the sherpa-onnx engine. It synthesises audio from provided text strings, requiring pre-configured runtime and voice models.
Local offline text-to-speech generation skill
This skill provides local, offline text-to-speech functionality using the sherpa-onnx CLI. It allows developers to generate high-quality audio files from text without requiring cloud connectivity or external APIs.
ElevenLabs TTS for Local Audio Playback
This CLI utility provides text-to-speech generation using ElevenLabs, supporting multiple models and advanced SSML-like tags for fine-grained audio control. It facilitates local playback and allows developers to define voice characters and …
Voice-enabled conversational agent interface
This skill facilitates natural voice interactions by integrating Speech-to-Text and Text-to-Speech capabilities via the `voicemode:converse` MCP tool. It supports advanced conversational patterns, including parallel execution of speech and …
Gemini Text-to-Speech Audio Generation
Converts text into natural, expressive spoken audio using Google's Gemini TTS model via the gemini-media MCP. It supports multiple voices and language configurations, outputting 24kHz PCM audio files.
ElevenLabs Text-to-Speech Generation and Playback
This skill provides local playback for ElevenLabs Text-to-Speech, allowing developers to generate high-quality audio from text. It supports advanced delivery control via custom tags for emotion, pacing, and detailed normalization rules.