Tag: audio
local offline text-to-speech synthesis tool
This utility provides local, offline text-to-speech conversion using the sherpa-onnx engine. It synthesises audio from provided text strings, requiring pre-configured runtime and voice models.
ElevenLabs TTS for Local Audio Playback
This command-line tool provides local playback access to ElevenLabs Text-to-Speech, supporting multiple models and custom voice selection. It allows advanced speech control via SSML-like tags for dramatic delivery, normalization, and struct…
BluOS CLI for media control and grouping
This skill provides a command-line interface to manage and control Bluesound/NAD streaming devices. It allows developers to perform actions such as checking device status, controlling playback, adjusting volume, and managing device groups.
Natural voice interaction for conversational AI
This skill enables natural, bi-directional voice conversations by integrating Speech-to-Text and Text-to-Speech capabilities. Developers can use the converse MCP tool to speak responses and listen for user input, supporting parallel executi…
Local offline text-to-speech generation skill
This skill provides local, offline text-to-speech functionality using the sherpa-onnx CLI. It allows developers to generate high-quality audio files from text without requiring cloud connectivity or external APIs.
BluOS CLI for media control and discovery
This tool provides a command-line interface to manage and control Bluesound/NAD streaming devices. It allows users to perform actions such as device discovery, volume adjustment, playback control, and group management.
Initiate ongoing voice conversations via MCP
This tool facilitates starting a persistent voice conversation session. It accepts optional parameters for specifying a custom voice profile and providing an initial message prompt.
Local offline text-to-speech generation
This skill provides local, offline text-to-speech functionality using the sherpa-onnx CLI. It allows developers to generate audio files from text without requiring cloud connectivity or external APIs.
ElevenLabs Text-to-Speech Generation and Playback
This skill provides local playback for ElevenLabs Text-to-Speech, allowing developers to generate high-quality audio from text. It supports advanced delivery control via custom tags for emotion, pacing, and detailed normalization rules.
Local Speech-to-Text Transcription using Whisper
This tool provides local, offline speech-to-text transcription using the Whisper CLI. It supports various audio formats and allows advanced tasks such as translation and specific output formatting.
BluOS CLI for media control and discovery
This CLI utility provides comprehensive control over Bluesound/NAD network players, enabling functions such as device discovery, playback management, volume adjustment, and advanced grouping. It supports JSON output for seamless integration…