Lucas Caccia

@LucasPCaccia

Sr Researcher @ MSR Montréal. PhD from MILA / McGill

Montréal, Québec

Joined July 2013

682Following

1KFollowers

Pinned

Lucas Caccia@LucasPCaccia · Jun 25

RAG and in-context learning are the go-to approaches for integrating new knowledge into LLMs, making inference very inefficient We propose instead 𝗞𝗻𝗼𝘄𝗹𝗲𝗱𝗴𝗲 𝗠𝗼𝗱𝘂𝗹𝗲𝘀 : lightweight LoRA modules trained offline that can match RAG performance without the drawbacks

4.0K

Pinned

Lucas Caccia@LucasPCaccia · Aug 21

If you are working on merging / MoEfication of models, we wrote a survey mapping out the current research landscape. Please check it out :)

PPrateek Yadav@prateeky2806 · Aug 21

We just released our survey on "Model MoErging", But what is MoErging?🤔Read on! Imagine a world where fine-tuned models, each specialized in a specific domain, can collaborate and "compose/remix" their skills using some routing mechanism to tackle new tasks and queries! 🧵👇…

1.0K

Lucas Caccia@LucasPCaccia · Jul 18

Great work led by our past intern Samin. TL;DR sparse masks are a great PEFT method + they merge well!

SSamin Yeasar Arnob@YeasarArnob · Jul 18

Happy to share our paper "Exploring Sparse Adapters for Scalable Merging of Parameter-Efficient Experts" is accepted at #COLM 2025! -paper: arxiv.org/abs/2507.07140 - authors: @zhansu9 @kim__minseon Oleksiy @OhibRiyasat @TheEsraaSaleh Doina Precup @LucasPCaccia @murefil

657

Lucas Caccia@LucasPCaccia · Jun 16

CFP of the Wordplay 2025 (EMNLP) is live! wordplay-workshop.github.io

EEric Xingdi Yuan@ericxyuan · Apr 15

Announcing the 5th Wordplay Workshop at EMNLP 2025 (Suzhou, China). We are co-organizing the CPDC Challenge (total prize value USD 20K!!!), the warm-up round is starting now! wordplay-workshop.github.io

4.0K

Lucas Caccia@LucasPCaccia · Apr 11

If you are looking to explore LLMs for debugging, please check this out!

MMicrosoft Research@MSFTResearch · Apr 10

Developers spend a lot of time debugging code. Learn how debug-gym can equip AI agents to help, enabling them to set breakpoints, navigate the codebase, and print runtime variable values on demand, so they better understand the code and its execution flow: msft.it/6017qF6RT

289

Lucas Caccia@LucasPCaccia · Dec 12

We are looking for interns to work on LLM modularization, please consider applying 🚀

AAlessandro Sordoni@murefil · Dec 8

We have few intern positions open in our ML team @ MSR Montreal, come work with @Cote_Marc @kim__minseon @LucasPCaccia @mathe_per @ericxyuan on reasoning, interactive envs/coding and LLM modularization.. 🤯 @mathe_per and I will also be at #NeurIPS2024 so we can chat about this…

414

Lucas Caccia@LucasPCaccia · Nov 21

If you're interesting in MoErging methods, here's an easy tutorial to get you started!

AAlessandro Sordoni@murefil · Nov 21

Explore zero-shot routing of parameter-efficient experts with Phatgoose arxiv.org/abs/2402.05859 and Arrow arxiv.org/abs/2405.11157 w. github.com/microsoft/mttl 👉 github.com/sordonia/pg_mb… Part of "Dynamic Sparsity in ML" tuto #neurips2024, join for discussions! 😊 thx @zhansu9

544

Lucas Caccia Retweeted

Prateek Yadav@prateeky2806 · Nov 7

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

237

65.0K

Lucas Caccia@LucasPCaccia · Nov 1

We are hiring a Senior Researcher in Montréal! Please consider applying :) More info below

AAlessandro Sordoni@murefil · Nov 1

The ML team at @MSFTResearch Montréal 🍁 is hiring a Senior Researcher with a background in ML / NLP!!! Come work with us at the intersection of interactivity, modularity and reasoning in foundation models 😊 MSR is a highly collaborative environment where risky ideas are…

2.0K

Lucas Caccia@LucasPCaccia · Oct 15

Great opportunity for potential students!

LLaurent Charlin@lcharlin · Oct 15

Come study with us at Mila! I will be looking for new students to work with. Our current projects explore continual learning, modularity, scrutability, algorithm discovery, AI for law (reasoning), invariances, and decision-making...

403

Lucas Caccia@LucasPCaccia · Oct 11

We have a Principal ML Engineer role opening at MSR Montreal. Come and do research with us :) jobs.careers.microsoft.com/global/en/job/…

2.0K

Lucas Caccia@LucasPCaccia · Jul 23, 2024

Presenting this today at 1h30 Vienna time!

EEdoardo Ponti@PontiEdoardo · Jul 22, 2024

[3/3] Towards Modular LLMs by Building and Reusing a Library of LoRAs @LucasPCaccia x.com/_akhaliq/statu…

313

Lucas Caccia@LucasPCaccia · Jul 21, 2024

Made it to Vienna for ICML. Please reach out if you wanna chat!

964