Self-Supervised Speech Pre-training and Representation Learning Toolkit
-
Updated
Jun 13, 2025 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
speech to text with self-supervised learning based on wav2vec 2.0 framework
A live speech recognition using Facebooks wav2vec 2.0 model.
PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
Wave2vec 2.0 Recognize pipeline
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recognition
Speeech Recognition for Indic languages.
A library version of wav2vec 2.0 framework for Automatic Speech Recognition task.
Create high-resolution visually dubbed videos with DINet
No api-keys | local | llama3.1 For language studying and live translation
Deep audio modeling
A repo to make installation and training of a wav2vec model easier
Diagnosis of the onset of parkinsons disease, using wav2vec as a feature extractor and a random forest as a classifier. This is an easy to use suite, refer to the README for usage guides.
A collection of speech language models with a focus on acoustic codes
Wav2Vec STT
Add a description, image, and links to the wav2vec topic page so that developers can more easily learn about it.
To associate your repository with the wav2vec topic, visit your repo's landing page and select "manage topics."