Stars
A Conversational Speech Generation Model
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

