Component technology layer

Speech and audio models

10

alternatives

0

open source

1

node kinds

2026-06-10

retrieved

ASR, TTS, translation, and audio generation models.

Hugging Face downloads among selected speech/audio model alternatives.

Ranked alternatives

Popularity-ranked entries linked back to graph nodes and deterministic sources.

RankComponentKindOpennessPopularity metricTasksSource
1XTTS-v2model:xtts-v2modelopen weights9,034,852Hugging Face downloads as of 2026-06-10text-to-speechsource
2Whisper large-v3model:whisper-large-v3modelopen weights5,054,098Hugging Face downloads as of 2026-06-10speech-recognition, speech-translationsource
3Whisper smallmodel:whisper-smallmodelopen weights2,315,618Hugging Face downloads as of 2026-06-10speech-recognitionsource
4wav2vec2-base-960hmodel:wav2vec2-base-960hmodelopen weights1,098,753Hugging Face downloads as of 2026-06-10speech-recognitionsource
5Distil-Whisper large-v3model:distil-large-v3modelopen weights933,990Hugging Face downloads as of 2026-06-10speech-recognitionsource
6SeamlessM4T v2 Largemodel:seamless-m4t-v2-largemodelopen weights410,860Hugging Face downloads as of 2026-06-10speech-translationsource
7Parakeet TDT 0.6B v2model:parakeet-tdt-0.6b-v2modelopen weights360,108Hugging Face downloads as of 2026-06-10speech-recognitionsource
8MusicGen Smallmodel:musicgen-smallmodelopen weights186,078Hugging Face downloads as of 2026-06-10text-to-audiosource
9SpeechT5 TTSmodel:speecht5-ttsmodelopen weights105,237Hugging Face downloads as of 2026-06-10text-to-speechsource
10Bark Smallmodel:bark-smallmodelopen weights50,282Hugging Face downloads as of 2026-06-10text-to-speechsource