Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 486 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 4.68M • 3.29k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 15k • 1.5k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 71.4k • 821
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 86.9k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 777 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 227k • 941
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 86.9k • 475
Text-To-Speech myshell-ai/OpenVoice Text-to-Speech • Updated Dec 24, 2024 • 486 coqui/XTTS-v2 Text-to-Speech • Updated Dec 11, 2023 • 4.68M • 3.29k suno/bark Text-to-Speech • Updated Oct 4, 2023 • 15k • 1.5k microsoft/speecht5_tts Text-to-Speech • Updated Nov 8, 2023 • 71.4k • 821
Speech-To-Text jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 86.9k • 475 nvidia/parakeet-rnnt-1.1b Automatic Speech Recognition • Updated Nov 27, 2025 • 777 • 163 facebook/seamless-m4t-v2-large Automatic Speech Recognition • 2B • Updated Jan 4, 2024 • 227k • 941
jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition • 0.3B • Updated Mar 25, 2023 • 86.9k • 475
elmoghany/Videos-Dataset-For-LLMs-RAG-That-Require-Audio-Vidoes-And-Text Updated Sep 14, 2025 • 2.8k • 1