Desh Raj
@rdesh26
Research Scientist @Meta GenAI | Previously: @jhuclsp, @IITGuwahati
**Dissertation now available** 📜: arxiv.org/abs/2402.08932 📽️: desh2608.github.io/static/ppt/the… ⏯️: youtube.com/watch?v=iKnCUH… It's a 332-page tome, but I have summarized it in this thread 👇 1/n
``Listening to Multi-talker Conversations: Modular and End-to-end Perspectives,'' Desh Raj, ift.tt/iHVD9Nk
New York City is living in the future!!! Look at the Mayor announcing the city will now use trash bins instead of dumping their trash on the streets like barbarians.
📢 Introducing VERSA: our new open-source toolkit for speech & audio evaluation! - 80+ metrics in one unified interface - Flexible input support - Distributed evaluation with Slurm - ESPnet compatible Check out the details wavlab.org/activities/202… github.com/wavlab-speech/…
📣 You can now have a conversation with Meta AI using voice. It’s super fast, connected to the web, natural and conversational and even comes with celebrity voice options from Awkwafina, Kristen Bell, John Cena, and more. What voice speaks to you? (pun intended 😆)
``M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses,'' Yufeng Yang, Desh Raj, Ju Lin, Niko Moritz, Junteng Jia, Gil Keren, Egor Lakomkin, Yiteng Huang, Jacob Donley, Jay Mahadeokar, Ozlem Kalinli, ift.tt/EsxaHFL
The #ICASSP2025 paper submission deadline is 9 September 2024. No new submissions will be accepted after this deadline. However, you may update the submitted paper until September 12. The title, author list and EDICS of the submitted paper cannot be changed.
Every once in a while I find a thread on this website which makes all the spam bearable. Couldn't help sharing!
Y'all. Seriously. Hardly anyone even understands the unbelievable depth of Tolkien's linguistic genius. If ever there was an artist just straight up making jokes that only he would ever get, omg. So get this, it's epic. You know the "Brandywine River" in Lord of the Rings?
I keep seeing more and more spam posts on this app. Trying out Threads to see how that's like. Are any of y'all on it?
Huge congrats to @AIatMeta on the Llama 3.1 release! Few notes: Today, with the 405B model release, is the first time that a frontier-capability LLM is available to everyone to work with and build on. The model appears to be GPT-4 / Claude 3.5 Sonnet grade and the weights are…
Starting today, open source is leading the way. Introducing Llama 3.1: Our most capable models yet. Today we’re releasing a collection of new Llama 3.1 models including our long awaited 405B. These models deliver improved reasoning capabilities, a larger 128K token context…
I am 46 years old. Never lived in a state for 15+ yrs My father worked in the Indian Navy. Got posted all over the country. His kids don't deserve jobs in Karnataka? I build companies. Have created 25000+ jobs across India! My kids dont deserve jobs in their home city? Shame.
How to not be a gym douchebag: if there are only 2 benches available, don't occupy both of them for your "RDL setup".
I think the most immigrant South Bay trait is counting you and all your friends net worths, cribbing about who has the bigger house, saying you could move to Virginia and get a mansion while having no real intent to do so
🚨 🔔 Paper alert in Efficient Streaming ASR 🔔🚨 I'm super excited to release our paper “XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models” abs: arxiv.org/abs/2407.04439 [1/n]
black on (esp elderly) asian violence has exploded in oakland, and it makes me fucking enraged. it’s like this is my grandma here, and this young man is taking her stuff and punching her in the face if asians complaining about this is racist, where are the black voices…
Meeting up with Boston #speech folks over BBQ and beer 🍻 (minus @JonathanLeRoux)

Our paper about Canary has been accepted to @ISCAInterspeech and is now available on Arxiv. Check it out to learn how we trained one of the most powerful open-source ASR and speech translation models! arxiv.org/abs/2406.19674