Skip to content
View jishengpeng's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report jishengpeng

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Languagecodec Languagecodec Public

    [ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

    Python 209 14

  2. TextrolSpeech TextrolSpeech Public

    [ICASSP 2024] TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models

    Python 179 5

  3. ControlSpeech ControlSpeech Public

    [ACL 2025 Main] ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

    Python 264 14

  4. WavTokenizer WavTokenizer Public

    [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

    Python 1.2k 102

  5. WavChat WavChat Public

    A Survey of Spoken Dialogue Models (60 pages)

    310 17

  6. WavReward WavReward Public

    WavReward: Spoken Dialogue Models With Generalist Reward Evaluators

    54