Skip to content
@kyutai-labs

kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

  1. moshi moshi Public

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

    Python 9.5k 864

  2. pocket-tts pocket-tts Public

    A TTS that fits in your CPU (and pocket)

    Python 2.9k 312

  3. delayed-streams-modeling delayed-streams-modeling Public

    Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

    Python 2.8k 292

  4. hibiki hibiki Public

    Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

    Rust 1.4k 108

  5. unmute unmute Public

    Make text LLMs listen and speak

    Python 1.2k 193

  6. moshi-finetune moshi-finetune Public

    Python 368 53

Repositories

Showing 10 of 25 repositories

Top languages

Loading…

Most used topics

Loading…