Skip to content
#

wavlm

Here are 19 public repositories matching this topic...

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…

  • Updated Sep 4, 2022
  • Python

This project investigates the performance of different Machine Learning pipelines applied to the task of speech emotion recognition (SER) on the Italian Emozionalmente dataset. Pipelines: MFCC, Wav2Vec and WavLM (both as feature extractors and after fine-tuning), Audio Spectrogram Transformer, cross-linguistic evaluation with f-t WavLM on CREMA-D.

  • Updated Feb 17, 2026
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the wavlm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wavlm topic, visit your repo's landing page and select "manage topics."

Learn more