Skip to content
View JeremyCCHsu's full-sized avatar

Highlights

  • Pro

Block or report JeremyCCHsu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A library for making PyTorch models streamable

Python 56 4 Updated Dec 29, 2025

AI powered speech denoising and enhancement

Python 2,128 258 Updated Dec 3, 2024

Python library to parse ANT/Garmin .FIT files

Python 806 188 Updated Jan 28, 2025

speech self-supervised representations

Python 514 39 Updated Apr 27, 2023

Code for "Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance", NeurIPS 2022

Jupyter Notebook 17 3 Updated Feb 11, 2023

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,380 2,078 Updated Oct 21, 2025

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

1,377 148 Updated Jun 6, 2024

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 370 47 Updated Jun 21, 2025

Tool that generates a CK3 dna file from pictures.

Python 23 5 Updated Jul 8, 2022

3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry

Jupyter Notebook 409 61 Updated Dec 11, 2024

Benchmark Arabic text diacritization dataset

Python 77 20 Updated Jul 26, 2019

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook 598 129 Updated Sep 18, 2023

Python wrapper for OpenJTalk

Cython 241 82 Updated Apr 8, 2025

Historic Tale Construction Kit - opensource and customizable version !

JavaScript 835 72 Updated Dec 6, 2023

A fast, high-quality neural vocoder.

Python 295 51 Updated Jul 18, 2023

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Python 171 31 Updated Jul 25, 2024

Google Drive Public File Downloader when Curl/Wget Fails

Python 5,024 402 Updated Aug 12, 2025

DDSP: Differentiable Digital Signal Processing

Python 3,176 370 Updated Sep 30, 2025

Authors' implementation of DeepSpeech Distances.

Jupyter Notebook 130 12 Updated May 5, 2020

A python package to analyze and compare voices with deep learning

Python 3,195 476 Updated Oct 12, 2023

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python 1,981 485 Updated Dec 19, 2023

WaveRNN Vocoder + TTS

Python 2,178 694 Updated Jul 2, 2022

A WaveRNN implementation

Python 201 48 Updated Oct 14, 2019

Language Savant. If your repository's language is being reported incorrectly, send us a pull request!

Ruby 13,210 4,911 Updated Dec 12, 2025

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,118 9,407 Updated Dec 15, 2025

code for "FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models".

Python 663 143 Updated Sep 22, 2020

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 24,523 5,844 Updated Aug 14, 2024

Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits

Python 23 4 Updated Sep 23, 2017

pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"

Python 916 238 Updated Jan 23, 2023
Next