Antonis Anastasopoulos
@anas_ant
Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.
Check out this cool dialect-related work at #ACL2025NLP next week, if you're in Vienna!
How can we make models understand dialectal input, even in dialects with very little data available? Our work indicates that Rule-Based Normalization can significantly help. If you're at #ACL2025, check out our poster on Monday at 6pm! aclanthology.org/2025.findings-… 1/4
Looking forward to this year's edition! With great speakers: Ryan McDonald @yulanhe Vlad Niculae @anas_ant @raquel_dmg @annargrs @preslav_nakov @mohitban47 @eunsolc @MarieMarneffe !
📢 10 Days Left to apply for the AthNLP - Athens Natural Language Processing Summer School! ✍ Get your applications in before June 15th! athnlp.github.io/2025/cfp.html
📣 Call for Participation EXTENDED to Sunday 15 June for the @AthensNLP 2025! 📍 4–10 September 2025 | NCSR Demokritos, Athens 📄 Apply & view the full program: athnlp.github.io/2025/cfp.html #NLP #AI #SummerSchool #NaturalLanguageProcessing #AthNLP2025
📢 CFP announced for the Athens #NaturalLanguageProcessing #SummerSchool! Are you passionate about #NLP and #MachineLearning? Join us in Athens 4–10 Sept @NCSR_Demokritos for talks, hands-on labs & top domain experts! 📅Apply by 30 May! athnlp.github.io/2025/cfp.html #AthNLP2025 #AI
📢 I am looking for a postdoc for the next academic year! (Due to the funding source, US persons preferred) Interested in multimodal LLMs and their application to education domains (plus multilinguality, cross-lingual, and low-resource learning)? Contact me here/email if yes!
Hi all, Do you have a reviewed ARR paper on speech translation to commit to @iwslt ? @iwslt has enabled paper commitment for fully reviewed papers from ARR for 2025. If you'd like to commit your paper, please fill out this form by May 17, 2025: forms.gle/1QtVrHXyCGoEq3…
Congratulations to @aarsri21 on winning the Best Paper Award at W-NUT at NAACL 2025! This paper applies various interventions simulating noisy text or dialectal variation to discover how different interventions have different effects. arxiv.org/abs/2404.07304
Presenting this at #WACV2025 today + oral session tomorrow. Come say hi if you are around!
🧵1/ New Paper! 🚀 Our latest work with @anas_ant and @ZzwWilliam is now on arXiv. arxiv.org/pdf/2407.02067! We look at cultural understanding in LLMs and develop an approach for automatically updating the culture represented in images. Thread 🧵⬇️
Why participate? ✨Low-resource is truly the frontier, and there are many ways to making an impact for a language community: from providing a dataset for a language that is missing one, to exploring the limits of different models and techniques.
🆕 This year includes new and more diverse language pairs (e.g. Fongbe to French, Estonian to English) in addition to continuing pairs 🆕🔊 Also new this year is a *data track* which encourages the creation of new speech translation datasets for less-supported languages
Today's task: Low-resource ST! 🎯 Goal: Building speech translation models for currently underserved, mostly low-resource languages and varieties 🗓️ This is the 5th iteration, with new and continuing language pairs (10 total!) 🔗: iwslt.org/2025/low-resou…
.@Mason_CEC professor receives an NSF CAREER Award for his work on privacy and data security. Discover more⬇️ #MasonNation gmu.edu/news/2025-01/u…
We are excited to announce the launch of ML SUPERB 2.0 (multilingual.superbbenchmark.org) as part of the Interspeech 2024 official challenge! We hope this upgraded version of ML SUPERB advances universal access to speech processing worldwide. Please join it! #Interspeech2025
The curtains are about to rise in just 2 hours! 🎉 Get ready for an exciting lineup at today's WiNLP Workshop. Stay tuned !
Stop by Jasmine #20 right now to see how @iamshnoo and @chahatsaidit explore multimodal LLMs to uncover the implicit associations they make in their BiasDora work

Epidemiological data is crucial for public health, but extracting and geotagging it from documents is challenging. Our work to be presented at #EMNLP2024’s 3rd NLP4PI Workshop, “From Text to Maps: LLM-Driven Extraction and Geotagging of Epidemiological Data”, tackles this. 🧵