Skip to content
View simonrouard's full-sized avatar

Block or report simonrouard

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A TTS that fits in your CPU (and pocket)

Python 3,406 377 Updated Feb 27, 2026

Unofficial implementation of "Simplifying, Stabilizing & Scaling Continuous-Time Consistency Models" for MNIST

Python 87 8 Updated Mar 26, 2025

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1,406 110 Updated Apr 15, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,743 898 Updated Feb 12, 2026

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 690 93 Updated Feb 25, 2026
Python 251 11 Updated Feb 14, 2024

Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper

Python 105 8 Updated Nov 20, 2021

The PyTorch-based audio source separation toolkit for researchers

Python 2,544 446 Updated Oct 6, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,792 1,432 Updated Apr 24, 2024

Open-Unmix - Music Source Separation for PyTorch

Python 1,465 200 Updated Jun 17, 2024

Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,802 231 Updated Nov 29, 2022