Highlights
- Pro
Stars
A library for making PyTorch models streamable
AI powered speech denoising and enhancement
Python library to parse ANT/Garmin .FIT files
speech self-supervised representations
Code for "Score-based Generative Modeling Secretly Minimizes the Wasserstein Distance", NeurIPS 2022
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)
Tool that generates a CK3 dna file from pictures.
3DV 2021: Synergy between 3DMM and 3D Landmarks for Accurate 3D Facial Geometry
Benchmark Arabic text diacritization dataset
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Historic Tale Construction Kit - opensource and customizable version !
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Google Drive Public File Downloader when Curl/Wget Fails
Authors' implementation of DeepSpeech Distances.
A python package to analyze and compare voices with deep learning
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
Language Savant. If your repository's language is being reported incorrectly, send us a pull request!
Clone a voice in 5 seconds to generate arbitrary speech in real-time
code for "FFJORD: Free-form Continuous Dynamics for Scalable Reversible Generative Models".
Code for the paper "Language Models are Unsupervised Multitask Learners"
Dump the text of the Gigaword dataset into a single file, for use with language modeling (and other!) toolkits
pytorch implementation of "Get To The Point: Summarization with Pointer-Generator Networks"




