Tom Labiausse (@tom_labiausse)

Pinned

T

Tom Labiausse@tom_labiausse · May 23

Unmute is our new cascaded voice assistant: fast, accurate, and flexible. It doesn't have the full-duplex and zero latency of Moshi, but you can change the voice with a 10s sample and plug any LLM. A good playground for testing custom voice AIs.

kkyutai@kyutai_labs · May 23

Talk to unmute.sh 🔊, the most modular voice AI around. Empower any text LLM with voice, instantly, by wrapping it with our new speech-to-text and text-to-speech. Any personality, any voice. Interruptible, smart turn-taking. We’ll open-source everything within the…

2

8

65

21

5.0K

Pinned

Tom Labiausse Retweeted

k

kyutai@kyutai_labs · May 23

Talk to unmute.sh 🔊, the most modular voice AI around. Empower any text LLM with voice, instantly, by wrapping it with our new speech-to-text and text-to-speech. Any personality, any voice. Interruptible, smart turn-taking. We’ll open-source everything within the…

115

265

2.0K

260.0K

T

Tom Labiausse@tom_labiausse · Jul 11

I’m happy to share that I’ll be attending ICML 2025 in Vancouver next week to present 𝐇𝐢𝐛𝐢𝐤𝐢 [github.com/kyutai-labs/hi…] 🇫🇷🇬🇧 — Kyutai’s real-time and expressive speech translation system. I'll be presenting the poster on Wednesday, July 16 at 4:30PM, feel free to stop by! 💬

tom_labiausse's tweet image. I’m happy to share that I’ll be attending ICML 2025 in Vancouver next week to present 𝐇𝐢𝐛𝐢𝐤𝐢 [github.com/kyutai-labs/hi…] 🇫🇷🇬🇧 — Kyutai’s real-time and expressive speech translation system. I'll be presenting the poster on Wednesday, July 16 at 4:30PM, feel free to stop by! 💬

2

8

55

10

4.0K

Tom Labiausse Retweeted

k

kyutai@kyutai_labs · Jul 3

Kyutai TTS and Unmute are now open source! The text-to-speech is natural, customizable, and fast: it can serve 32 users with a 350ms latency on a single L40S. Try it out and get started on the project page: kyutai.org/next/tts

50

176

1.0K

154.0K

Tom Labiausse Retweeted

k

kyutai@kyutai_labs · Jun 19

Kyutai Speech-To-Text is now open-source! It’s streaming, supports batched inference, and runs blazingly fast: perfect for interactive applications. Check out the details here: kyutai.org/next/stt

32

116

617

421

60.0K