microsoft/VibeVoice
⭐ 14.3K (+1.9K)
VibeVoice is Microsoft's open-source voice synthesis framework that combines low-frame-rate continuous speech tokenizers, an LLM, and diffusion generation to target long-form multi-speaker and low-latency realtime scenarios; suitable for research and prototyping but carrying compliance, licensing, and misuse risks.