Pooneh Mousavi

@MousaviPooneh

Montreal,Canada

Joined January 2019

489Following

148Followers

Pinned

Pooneh Mousavi@MousaviPooneh · Mar 29, 2021

“Ever tried. Ever failed. No matter. Try again. Fail again. Fail better.” Samuel Becket

Pooneh Mousavi@MousaviPooneh · Jun 11

Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM

BBeomseok LEE@beomseok_lee_ · Jun 11

Speech-language models show promise in multimodal tasks—but how well are speech & text actually aligned? 🤔 This paper arxiv.org/abs/2505.19937 proposes a new metric to measure layer-wise correlation between the two, with a focus on SLU tasks. 🔍🗣️📄

635

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Jun 16

📢 Join our Conversational AI Reading Group! 📅 Thursday, June 19th | 11 AM - 12 PM EST 🎙 Speaker: Yuki Mitsufuji (@mittu1204) - SonyAI 📖 Topic: "AI for Creators: Pushing Creative Abilities to the Next Level" 🔗 Details: (poonehmousavi.github.io/rg)

422

Pooneh Mousavi Retweeted

arXiv Sound@ArxivSound · Jun 13

``Discrete Audio Tokens: More Than a Survey!,'' Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch… ift.tt/GA4ZC6u

3.0K

Pooneh Mousavi Retweeted

Gallil Maimon@GallilMaimon · Jun 13

🎵💬 If you are interested in Audio Tokenisers, you should check out our new work! We empirically analysed existing tokenisers from every way - reconstruction, downstream, LMs and more. Grab yourself a ☕/🍺 and sit down for a read!

3.0K

Pooneh Mousavi Retweeted

Gallil Maimon@GallilMaimon · Jun 13

🌟🌟 Great collaboration, with a diverse all-star team led by @MousaviPooneh - check it out👇 📄Paper - arxiv.org/abs/2506.10274 🌐Website (+updating tokeniser DB!) - poonehmousavi.github.io/dates-website/

384

Pooneh Mousavi@MousaviPooneh · Jun 13

🚀 We're excited to announce our latest work: "Discrete Audio Tokens: More Than a Survey!" It presents a comprehensive survey and benchmark of audio tokenizers across speech, music, and general audio. preprint: arxiv.org/pdf/2506.10274 website: poonehmousavi.github.io/dates-website/

3.0K

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Jun 9

📢 Join our Conversational AI Reading Group! 📅 Thursday, June 12th | 11 AM - 12 PM EST 🎙 Speaker: Andros Tjandra 📖 Topic: "Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound" 🔗 Details: (poonehmousavi.github.io/rg)

539

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · May 26

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 29th | 11 AM - 12 PM EST 🎙 Speaker: Yossi Adi @adiyossLC 📖 Topic: "On The Landscape of Spoken Language Models" 🔗 Details: (poonehmousavi.github.io/rg)

523

Pooneh Mousavi Retweeted

Hervé "pyannote" Bredin@hbredin · May 24

Learn about speaker diarization, the science behind it, and the future of diarization at ⁦@pyannoteAI⁩ research labs youtu.be/ECqxZgVevuI?fe…

907

Pooneh Mousavi@MousaviPooneh · May 19

... in which I'll talk about my decade-old love for speaker diarization and the loss functions used to train underlying neural networks

CConvai_rg@convAI2024 · May 19

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 22nd | 11 AM - 12 PM EST 🎙 Speaker: Hervé Bredin (@hbredin) 📖 Topic: "Speaker diarization, a (love) loss story" 🔗 Details: (poonehmousavi.github.io/rg)

1.0K

Pooneh Mousavi Retweeted

Gallil Maimon@GallilMaimon · Feb 25

🗣️🧠 Speech Language Models require lots of compute to train, right? In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? The results may surprise you (they even surprised us)! Tips, open source resources, full paper 👇🏻

136

23.0K

Pooneh Mousavi@MousaviPooneh · May 16

@convAI2024 Thank you for having me, and thank you all the listeners! I had a great time 🙌 If you missed it, here's the recording and the slides! Recording: youtube.com/watch?v=REH034… Slides: poonehmousavi.github.io/assets/slides/…

WWen-Chin Huang@unilightwf · Apr 18

🚨I am honored to give an online invited talk at the Conversational AI Reading Group, MILA @convAI2024 on 5/15 11am-12pm EDT (5/16 0-1am Japan time), titled "Automatic Quality Assessment for Speech and Beyond"! Please find more info on the website: poonehmousavi.github.io/rg

298

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · May 12

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 15th | 11 AM - 12 PM EST 🎙 Speaker: Wen-Chin Huang (@unilightwf) 📖 Topic: "Automatic Quality Assessment for Speech and Beyond" 🔗 Details: (poonehmousavi.github.io/rg) , (youtube.com/@CONVAI_RG)

1.0K

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · May 5

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 8th | 11 AM - 12 PM EST 🎙 Speaker: Leda Sari 📖 Topic: "The Voicebox Model and Its Applications" 🔗 Details: (poonehmousavi.github.io/rg)

317

Pooneh Mousavi@MousaviPooneh · Apr 30

We’re really excited to have Dan Povey join us for our next Conversational AI Reading Group. He is the creator of the Kaldi toolkit and author of many well-known papers. Don’t miss his talk!

CConvai_rg@convAI2024 · Apr 28

📢 Join our Conversational AI Reading Group! 📅 Thursday, May 1st | 11 AM - 12 PM EST 🎙 Speaker: Daniel Povey from Xiaomi Corp. 📖 Topic: "CR-CTC: Consistency regularization on CTC for improved speech recognition" 🔗 Details: (poonehmousavi.github.io/rg)

Pooneh Mousavi Retweeted

Wen-Chin Huang@unilightwf · Apr 18

9.0K

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Apr 21

📢 Join our Conversational AI Reading Group! 📅 Thursday, April 24th | 11 AM - 12 PM EST 🎙 Speaker: Oriol Nieto(@urinieto) from Adobe Research 📖 Topic: "GenAI for Sound Design" 🔗 Details: (poonehmousavi.github.io/rg)

163

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Apr 14

📢 Join our Conversational AI Reading Group! 📅 Thursday, April 17th | 11 AM - 12 PM EST 🎙 Speaker: Titouan Parcollet from Samsung AI Center Cambridge 📖 Topic: "Unsupervised on-device adaptation of a speech recogniser and the Pitfalls of "SpeechLLM" evaluation"

822

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Apr 8

📢 Join our Conversational AI Reading Group! 📅 Thursday, April 10th | 11 AM - 12 PM EST 🎙Speaker: Karen Livescu from TTIC 📖 Topic: "Toward Understanding Sign Language in the Real World" 🔗 Details: (poonehmousavi.github.io/rg)

448

Pooneh Mousavi Retweeted

Convai_rg@convAI2024 · Mar 31

📢 Join our Conversational AI Reading Group! 📅 Thursday, April 3rd | 11 AM - 12 PM EST 🎙Speaker: Min Ma from Google DeepMind 📖 Topic: "Improving Multilingual Speech Recognition and Language Identification" 🔗 Details: (poonehmousavi.github.io/rg)

385