Tag: voice-synthesis
skill
Custom Voice Cloning and Impressions for VoiceMode
This skill enables voice cloning and custom impressions for VoiceMode by synthesizing fresh speech using a local TTS service. It requires a short reference clip and operates via the mlx-audio backend on Apple Silicon.
skill
★ 5
Gemini Text-to-Speech Audio Generation
Converts text into natural, expressive spoken audio using Google's Gemini TTS model via the gemini-media MCP. It supports multiple voices and language configurations, outputting 24kHz PCM audio files.