Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,209 2,699 Updated Nov 3, 2025

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 646 66 Updated Jun 9, 2024

GT-RIPL / Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,266 323 Updated Jan 27, 2026

jrin771 / Everything-LLMs-And-Robotics

The world's largest GitHub Repository for LLMs + Robotics

851 61 Updated Jul 14, 2024

zubair-irshad / Awesome-Implicit-NeRF-Robotics

A comprehensive list of Implicit Representations and NeRF papers relating to Robotics/RL domain, including papers, codes, and related websites

1,551 96 Updated Jul 12, 2025

jimmyyhwu / tidybot

TidyBot: Personalized Robot Assistance with Large Language Models

Python 678 86 Updated Nov 10, 2023

google-research / robotics_transformer

Python 1,678 193 Updated Jan 31, 2024

kyegomez / PALM-E

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

Python 335 50 Updated Jan 29, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 23,004 2,576 Updated Mar 13, 2025

thuhcsi / LightGrad

Python 68 11 Updated Jul 23, 2023

yangdongchao / Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"

Python 366 37 Updated Aug 3, 2023

WangHelin1997 / DuTa-VC

Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

Python 37 1 Updated Dec 5, 2023

mirfan899 / CTTS

Cantonese TTS frontend

Python 16 4 Updated Oct 14, 2019

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 27,989 5,081 Updated Nov 11, 2023

MasayaKawamura / MB-iSTFT-VITS

Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Python 468 70 Updated Nov 17, 2022

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 5,019 737 Updated Jan 21, 2025

innnky / emotional-vits

无需情感标注的情感可控语音合成模型，基于VITS

Jupyter Notebook 1,396 169 Updated Mar 30, 2023

talhaanwarch / youtube-tutorials

Jupyter Notebook 110 44 Updated Aug 31, 2023

zhangks98 / eeg-adapt

Source Code for "Adaptive Transfer Learning with Deep CNN for EEG Motor Imagery Classification".

Python 100 38 Updated Jan 11, 2023

yistLin / FragmentVC

Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention

Python 203 37 Updated Nov 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WongLaw WongLaw

Highlights

Block or report WongLaw

Stars

ZhiGroup / pytorch_ehr

daje0601 / Google_SCoRe

pengzhendong / g2p-mix

cwchenwang / awesome-3d-diffusion

HqWu-HITCS / Awesome-Chinese-LLM

myshell-ai / OpenVoice

mozillazg / phrase-pinyin-data

b04901014 / MQTTS

yl4579 / StyleTTS2

BradyFU / Awesome-Multimodal-Large-Language-Models

meta-llama / llama-cookbook