ASR, TTS, translation, and audio generation models.
Hugging Face downloads among selected speech/audio model alternatives.
Ranked alternatives
Popularity-ranked entries linked back to graph nodes and deterministic sources.
| Rank | Component | Kind | Openness | Popularity metric | Tasks | Source |
|---|---|---|---|---|---|---|
| 1 | XTTS-v2model:xtts-v2 | model | open weights | 9,034,852Hugging Face downloads as of 2026-06-10 | text-to-speech | source |
| 2 | Whisper large-v3model:whisper-large-v3 | model | open weights | 5,054,098Hugging Face downloads as of 2026-06-10 | speech-recognition, speech-translation | source |
| 3 | Whisper smallmodel:whisper-small | model | open weights | 2,315,618Hugging Face downloads as of 2026-06-10 | speech-recognition | source |
| 4 | wav2vec2-base-960hmodel:wav2vec2-base-960h | model | open weights | 1,098,753Hugging Face downloads as of 2026-06-10 | speech-recognition | source |
| 5 | Distil-Whisper large-v3model:distil-large-v3 | model | open weights | 933,990Hugging Face downloads as of 2026-06-10 | speech-recognition | source |
| 6 | SeamlessM4T v2 Largemodel:seamless-m4t-v2-large | model | open weights | 410,860Hugging Face downloads as of 2026-06-10 | speech-translation | source |
| 7 | Parakeet TDT 0.6B v2model:parakeet-tdt-0.6b-v2 | model | open weights | 360,108Hugging Face downloads as of 2026-06-10 | speech-recognition | source |
| 8 | MusicGen Smallmodel:musicgen-small | model | open weights | 186,078Hugging Face downloads as of 2026-06-10 | text-to-audio | source |
| 9 | SpeechT5 TTSmodel:speecht5-tts | model | open weights | 105,237Hugging Face downloads as of 2026-06-10 | text-to-speech | source |
| 10 | Bark Smallmodel:bark-small | model | open weights | 50,282Hugging Face downloads as of 2026-06-10 | text-to-speech | source |