Puyuan Peng jasonppy

🍗

302 followers · 27 following

Achievements

jasonppy.github.io Public

HTML 1 Updated Oct 15, 2025
VoiceStar Public

VoiceStar: Robust, Duration-controllable TTS that can Extrapolate

Python 306 27 MIT License Updated May 31, 2025
VoiceStar_web Public

HTML Updated May 26, 2025
VoiceCraft Public

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 8,454 799 Other Updated Mar 15, 2025
VoiceCraft_web Public

JavaScript 4 Updated Jun 14, 2024
PromptingWhisper Public

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Python 151 13 Updated Jan 16, 2024
word-discovery Public

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Jupyter Notebook 26 8 BSD 3-Clause "New" or "Revised" License Updated Dec 4, 2023
syllable-discovery Public

Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model

Python 34 6 BSD 3-Clause "New" or "Revised" License Updated Aug 27, 2023
FaST-VGS-Family Public

Transformer-based visually grounded speech models

Python 19 2 BSD 3-Clause "New" or "Revised" License Updated Sep 22, 2022
vqwordseg Public
Forked from kamperh/vqwordseg

Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.

Jupyter Notebook MIT License Updated Jun 19, 2022
MAE-AST-Public Public
Forked from AlanBaade/MAE-AST-Public

Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer

Python Updated Jun 9, 2022
moment_detr Public
Forked from jayleicn/moment_detr

[NeurIPS 2021] Moment-DETR code and QVHighlights dataset

Python 1 MIT License Updated May 20, 2022
HERO_Video_Feature_Extractor Public
Forked from linjieli222/HERO_Video_Feature_Extractor

Video Feature Extraction Code for EMNLP 2020 paper "HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training"

Python MIT License Updated Apr 12, 2022
zerospeech2021_baseline Public
Forked from kamperh/zerospeech2021_baseline

BERT and LSTM baseline models of the ZeroSpeech Challenge 2021

Python Updated Feb 22, 2022
yt-dl Public
Forked from DavidXu9000/yt-dl

Python Updated Jan 20, 2022
academicpages Public template

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript MIT License Updated Apr 6, 2021
cs61bSpring2018 Public

Java 1 Updated Aug 9, 2019
para-nmt-50m Public
Forked from jwieting/para-nmt-50m

Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions of Machine Translations"

Python Updated Nov 30, 2017

Puyuan Peng jasonppy

Achievements

Achievements

jasonppy.github.io Public

Uh oh!

VoiceStar Public

Uh oh!

VoiceStar_web Public

Uh oh!

VoiceCraft Public

Uh oh!

VoiceCraft_web Public

Uh oh!

PromptingWhisper Public

Uh oh!

word-discovery Public

Uh oh!

syllable-discovery Public

Uh oh!

FaST-VGS-Family Public

Uh oh!

vqwordseg Public

Uh oh!

MAE-AST-Public Public

Uh oh!

moment_detr Public

Uh oh!

HERO_Video_Feature_Extractor Public

Uh oh!

zerospeech2021_baseline Public

Uh oh!

yt-dl Public

Uh oh!

academicpages Public template

Uh oh!

cs61bSpring2018 Public

Uh oh!

para-nmt-50m Public

Uh oh!