kyutai

Kyutai - Open Science AI Lab

Popular repositories Loading

moshi moshi Public

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9.5k 864
pocket-tts pocket-tts Public

A TTS that fits in your CPU (and pocket)

Python 2.9k 312
delayed-streams-modeling delayed-streams-modeling Public

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2.8k 292
hibiki hibiki Public

Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…

Rust 1.4k 108
unmute unmute Public

Make text LLMs listen and speak

Python 1.2k 193
moshi-finetune moshi-finetune Public

Python 368 53

Repositories

pocket-tts Public
A TTS that fits in your CPU (and pocket)

kyutai-labs/pocket-tts’s past year of commit activity

Python 2,858 MIT 312 26 (9 issues need help) 9 Updated Jan 30, 2026
delayed-streams-modeling Public
Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

kyutai-labs/delayed-streams-modeling’s past year of commit activity

Python 2,824 Apache-2.0 292 36 0 Updated Jan 26, 2026
invincible-voice Public
To bring back voice to those who lost it

kyutai-labs/invincible-voice’s past year of commit activity

TypeScript 27 MIT 2 5 (4 issues need help) 1 Updated Jan 25, 2026
unmute Public
Make text LLMs listen and speak

kyutai-labs/unmute’s past year of commit activity

Python 1,153 MIT 193 27 (3 issues need help) 0 Updated Jan 23, 2026
dora Public
Dora is an experiment management framework. It expresses grid searches as pure python files as part of your repo. It identifies experiments with a unique hash signature. Scale up to hundreds of experiments without losing your sanity.

kyutai-labs/dora’s past year of commit activity

Python 5 MIT 0 0 0 Updated Jan 22, 2026
tts_longeval Public

kyutai-labs/tts_longeval’s past year of commit activity

Python 29 MIT 2 0 0 Updated Jan 22, 2026
moshi Public
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

kyutai-labs/moshi’s past year of commit activity

Python 9,525 Apache-2.0 864 62 14 Updated Jan 19, 2026
sphn Public
python bindings for symphonia/opus - read various audio formats from python and write opus files

kyutai-labs/sphn’s past year of commit activity

Rust 77 Apache-2.0 7 1 0 Updated Jan 7, 2026
ARC-Encoder Public

kyutai-labs/ARC-Encoder’s past year of commit activity

Python 26 Apache-2.0 3 0 0 Updated Jan 5, 2026
jax-flash-attn3 Public
JAX bindings for the flash-attention3 kernels

kyutai-labs/jax-flash-attn3’s past year of commit activity

C++ 20 3 0 1 Updated Jan 2, 2026

View all repositories

People

Top languages

Loading…

Most used topics

Loading…

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kyutai

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!