Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VibeVoice-Realtime-0.5B
like
461
Follow
Microsoft
16.9k
Text-to-Speech
Transformers
Safetensors
English
vibevoice_streaming
Realtime TTS
Streaming text input
Long-form speech generation
arxiv:
2508.19205
arxiv:
2412.08635
License:
mit
Model card
Files
Files and versions
xet
Community
11
Deploy
Use this model
main
VibeVoice-Realtime-0.5B
2.04 GB
5 contributors
History:
10 commits
unilm
mcfadyeni
Fixed tag typo (long-from -> long-form) (
#11
)
5212720
verified
about 2 hours ago
figures
add model overview
4 days ago
.gitattributes
Safe
1.57 kB
add model overview
4 days ago
README.md
9.78 kB
Fixed tag typo (long-from -> long-form) (#11)
about 2 hours ago
config.json
2.12 kB
add VibeVoice-Realtime-0.5B
4 days ago
model.safetensors
2.04 GB
xet
add VibeVoice-Realtime-0.5B
4 days ago
preprocessor_config.json
Safe
360 Bytes
add VibeVoice-Realtime-0.5B
4 days ago