Using Wav2Vec 2.0 / HuBERT / WavLM and Whisper from HuggingFace with SpeechBrain
This tutorial describes how to combine (use and finetune) pretrained models coming from HuggingFace. Any wav2vec 2.0 / HuBERT / WavLM or Whisper model integrated to the transformers interface of HuggingFace can be then plugged to SpeechBrain to approach a speech-related task: automatic speech recognition, speaker recognition, spoken language understanding ...
Open in Google Colab