AI/智能体/模型工具 · TypeScript
jamiepine/voicebox
The open-source AI voice studio. Clone, dictate, create.
项目解读
The open-source AI voice studio. Clone, dictate, create. 主题标签包括 ai、cuda、mlx、qwen3-tts、qwen3-tts-ui、voice-ai、voice-clone、whisper。 README 重点章节包括:What is Voicebox?、Download、Features、Multi-Engine Voice Cloning、Emotions & Paralinguistic Tags。
README / GitHub 亮点
- GitHub 描述:The open-source AI voice studio. Clone, dictate, create.
- Clone any voice. Generate speech. Dictate into any app. Talk to agents in voices you own.
- The full voice I/O stack, running locally on your machine.
- Click the image above to watch the demo video on voicebox.sh。
适用场景
适合评估 AI 应用、智能体工作流、模型工具链、RAG/提示词工程或 AI 辅助开发场景。
采用前核查
采用前仍需核查许可证、维护节奏、issue 质量、release 记录和生产适配成本。
README 摘要
The open-source AI voice studio. Clone any voice. Generate speech. Dictate into any app. Talk to agents in voices you own. The full voice I/O stack, running locally on your machine. Click the image above to watch the demo video on voicebox.sh Voicebox is a local-first AI voice studio — a free and open-source alternative to ElevenLabs and WisprFlow in one app. Clone voices from a few seconds of audio, generate speech in 23 languages across 7 TTS engines, dictate into any text field with a global hotkey, and give an…