FIND-20260402-008 · 2026-04-02 · Innovation Veille

Trending: microsoft/VibeVoice — Open-source frontier voice AI (TTS + ASR, 34k stars, MIT)

trending-repo LOW
Microsoft VibeVoice is a family of open-source frontier voice AI models (TTS and ASR). 34,404 stars, 144 open issues, Python, MIT license, last pushed April 1 2026. Key innovations: continuous speech tokenizers at 7.5 Hz ultra-low frame rate, next-token diffusion framework, VibeVoice-ASR supporting 60-minute long-form audio in single pass with speaker diarization and timestamps, VibeVoice-Realtime-0.5B for streaming TTS. Not directly relevant to ODS's current stack (no voice interface in any service), but could be relevant for future Notification Hub voice alerts or DocSign voice annotation features.

Source

https://github.com/microsoft/VibeVoice

ODS Impact

No immediate ODS impact. SecureMail and Notification Hub could leverage ASR for voice-to-text secure messaging in a future phase. Low priority for current P0-P1 roadmap.

Security Review

License: MIT | Maintenance: ACTIVE | Risk: LOW | Recommendation: SAFE_TO_USE

Tags

trending-repo voice-ai tts asr microsoft python ml