Skip to content
View haoheliu's full-sized avatar
🚩
Focusing
🚩
Focusing
  • UoSurrey, Centre for Vision, Speech and Signal Processing (CVSSP)
  • Guildford GU2 7XH Stag Hill, UK
  • 23:28 (UTC -12:00)

Block or report haoheliu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
haoheliu/README.md

Haohe's GitHub stats

Pinned Loading

  1. AudioLDM AudioLDM Public

    AudioLDM: Generate speech, sound effects, music and beyond, with text.

    Python 2.8k 249

  2. AudioLDM2 AudioLDM2 Public

    Text-to-Audio/Music Generation

    Python 2.5k 203

  3. versatile_audio_super_resolution versatile_audio_super_resolution Public

    Versatile audio super resolution (any -> 48kHz) with AudioSR.

    Python 1.6k 176

  4. SemantiCodec-inference SemantiCodec-inference Public

    Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

    Python 231 20

  5. audioldm_eval audioldm_eval Public

    This toolbox aims to unify audio generation model evaluation for easier comparison.

    Python 365 37

  6. AudioLDM-training-finetuning AudioLDM-training-finetuning Public

    AudioLDM training, finetuning, evaluation and inference.

    Python 284 56