Varun Gumma
@VarunGumma23
SCAI Center Fellow @MSFTResearch | Past: @iitmadras (@ai4bharat), @bitshyd | Interests: Multilinguality, Machine Translation, Efficient Methods, LLM Evaluation.
Peeps stop using IndicTrans2. Use the Rotary IndicTrans2 which supports longer context, and it just got better for inference. Or use Sarvam-translate (IndicTrans3) which can translate academic papers and documents extremely well.
[1/N] Thanks to @Adalat_AI , the IndicTrans2-RoPE models are now CT2 compatible. Big shoutout to their entire team for finding our work interesting and porting them, which definitely wasn't easy (huggingface.co/collections/ad…)
A friend of mine at Adalat AI is looking for a 3-month research intern to work on legal benchmarking—running experiments across LLMs, analyzing results, and co-authoring a paper targeting an A* conference. DM me or @orgho12 on Twitter if you're interested.
Thank you Sunayana and @kalikabali for inviting me to the NLP lunch! Good food and even better conversations with the NLP crew at @MSFTResearch India!
🚨 I'm releasing Notebook 0 a week earlier than planned! 🚨 This is the soft intro to our modern Text-to-Speech (TTS) tutorial series. It’s designed to ground you in signal processing concepts & why they matter for speech! 👉 colab.research.google.com/drive/15NQ10Lu…
✈️I will be @iclr_conf in Singapore🇸🇬 next week to present our work on attributing the cultural knowledge of a LLM to its memorization or generalization of it's pre-training corpora. Looking forward to chatting with people 🙂 #ICLR2025 📜: arxiv.org/abs/2412.20760
Request for participation from people from Kerala and Tamil Nadu in this super interesting study ⬇️
🎯 𝐂𝐚𝐥𝐥 𝐟𝐨𝐫 𝐏𝐚𝐫𝐭𝐢𝐜𝐢𝐩𝐚𝐧𝐭𝐬 (𝐊𝐞𝐫𝐚𝐥𝐚 𝐨𝐫 𝐓𝐚𝐦𝐢𝐥 𝐍𝐚𝐝𝐮 𝐨𝐧𝐥𝐲) 𝐌𝐢𝐜𝐫𝐨𝐬𝐨𝐟𝐭 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡 𝐈𝐧𝐝𝐢𝐚 is inviting community members from Kerala or Tamil Nadu to take part in a user study to evaluate AI-generated images of cultural artifacts.
🎯 𝐂𝐚𝐥𝐥 𝐟𝐨𝐫 𝐏𝐚𝐫𝐭𝐢𝐜𝐢𝐩𝐚𝐧𝐭𝐬 (𝐊𝐞𝐫𝐚𝐥𝐚 𝐨𝐫 𝐓𝐚𝐦𝐢𝐥 𝐍𝐚𝐝𝐮 𝐨𝐧𝐥𝐲) 𝐌𝐢𝐜𝐫𝐨𝐬𝐨𝐟𝐭 𝐑𝐞𝐬𝐞𝐚𝐫𝐜𝐡 𝐈𝐧𝐝𝐢𝐚 is inviting community members from Kerala or Tamil Nadu to take part in a user study to evaluate AI-generated images of cultural artifacts.
Our team at Microsoft Research India is looking for a Research Intern for a 6 month position. The position will be on-site in BLR. You will get to work on multilingual data, modelling and evals. Please DM me with a short blurb about yourself and your CV/Resume.
TFW people with multiple A* papers are struggling to get PhD admits in the USA.
Microsoft Research India is excited to announce applications are open for our Research Fellow program (deadline 15th Feb 2025). Details of the program and the application are here: 🔗 Research Fellow program: aka.ms/msrirf @MSFTResearch
MSR India is accepting applications for the 2025 Research Fellow program
Microsoft Research India is excited to announce applications are open for our Research Fellow program (deadline 15th Feb 2025). Details of the program and the application are here: 🔗 Research Fellow program: aka.ms/msrirf @MSFTResearch
Overall, this paper was responsible for a major shift in my research work because, let's face it, compact models FTW. It greatly benefitted @VarunGumma23's master thesis and prolly will be his PhD focus. If you have not read this paper, then please do. It's a life changer and…
After consulting @VarunGumma23 and @pranjalchitale, in my free time, I have replicated their work, and I am releasing the 4 IndicTrans2 models which use RoPE instead of learned PEs. This effectively enables IndicTrans2 models to handle long context. You can now process longer…
A v2 of our work on Interchangeability of Positional Embeddings (PEs) for NMT System is out: arxiv.org/abs/2408.11382. In this paper, we mainly pitch towards inducing Document-Level abilities in NMT systems and show that you can efficiently switch out PEs to achieve this.
The #EMNLP2024 social event feels like a disaster. Terrible crowd management! I'm hungry! Feed me!
Gen AI models can generate compelling stories, but do they truly reflect all cultures? 🎉Excited to share our work, ‘Kahani’—a visual storytelling pipeline designed to create culturally grounded visual stories for non-Western cultures! 1/6