Amir H. Kargaran at ACL ๐ฆ๐น
@amir_nlp
๐ค PhD student @CisLmu/ ๐ ๏ธ Multilingual NLP / Previous: Intern @huggingface
I'll be @aclmeeting in Vienna to present our recent agent papers SMART and CoALM! ๐ฆ๐น๐ค #acl2025 Feel free to stop by our posters to exchange ideas and discuss agents together!
โฐ Reminder: Conference Track Deadline is July 25! Have a paper accepted to @COLM_conf that aligns with linguistic & cultural inclusion in LLMs? Submit it to be featured at MELT 2025! ๐ openreview.net/group?id=colmwโฆ ๐ Deadline: July 25, 2025 ย #MELTWorkshop2025 #COLM2025
FineWeb2 ๐ฅ has been accepted to @COLM_conf See you in October ๐จ๐ฆ
We have finally released the ๐paper for ๐ฅFineWeb2, our large multilingual pre-training dataset. Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.
We have finally released the ๐paper for ๐ฅFineWeb2, our large multilingual pre-training dataset. Along with general (and exhaustive) multilingual work, we introduce a concept that can also improve English performance: deduplication-based upsampling, which we call rehydration.
๐งโ๐ป Call for Reviewers โ Melt Workshop @COLM_conf 2025 ๐ We're looking for researchers passionate about multilingual, multicultural, and inclusive NLP to join our reviewer team! ๐ Fill out the interest form: forms.gle/MYcXED7RLJDSqiโฆ #MeltWorkshop2025
Are you working on multilingual, multicultural #LLM? Interested in diverse & inclusive language modeling? ๐ Stay tuned at our MELT workshop collocated with #COLM2025 ๐ melt-workshop.github.io ๐ซถ We welcome 2p (EA), 4p (short), 8p (long) papers as well as talented reviewers!
๐ โจ Introducing Melt Workshop 2025: Multilingual, Multicultural, and Equitable Language Technologies A workshop on building inclusive, culturally-aware LLMs! ๐ง Bridging the language divide in AI ๐ October 10, 2025 | Co-located with @COLM_conf ๐ melt-workshop.github.io
Consider submitting your multilingual NLP work to the MELT workshop @ COLM 2025: melt-workshop.github.io Deadline: June 23
The full list of COLM 2025 workshops is now online! Most deadlines are June 23, but check the specific CFP of each workshop for the details
Tracing Multilingual Factual Knowledge Acquisition in Pretraining. arxiv.org/abs/2505.14824
I really wanted to see the review details. It's clearly above the acceptance threshold of findings for me. When you fall into the cycle of rejection from ARR, it's hard to come out.

I'm embarassed to admit that I have just grokked how amazing Python coroutines and asyncio are. I want to rewrite every single piece of code with threads I have every written! But the learning curve is steep. This great blog opened my eyes: tenthousandmeters.com/blog/python-beโฆ