Baharan Mirzasoleiman

@baharanm

Assistant professor @UCLAComSci. Better ML via better data, Machine learning, Optimization

Los Angeles, CA

Joined July 2018

293Following

2KFollowers

Pinned

We’re thrilled by the amazing response to our #ICML2024 tutorial on “Foundations of data-efficient learning”! Over 1000 attendees joined us. Thank you all! 🙌🌱🌱🌱 ➡️ Slides: baharanm.github.io/ICML24_tutoria… ➡️ Recording: will be available on Aug 22 🎊🎊

BBaharan Mirzasoleiman@baharanm · Jul 19, 2024

I'll be giving a 2-hour tutorial on data-efficient learning with my PhD student @sjoshi804 on Monday July 22 at #ICML2024. Join us to learn more about this cool topic! ➡️ We can learn better from better data! ⬅️🙌🌱

217

40.0K

Pinned

Baharan Mirzasoleiman@baharanm · Aug 19

The Adversarial Machine Learning Rising Star Awards deadline is in two weeks! Submit your application and help us promote your work and research vision! @trustworthy_ml @LLMSecurity @ml_safety @safe_paper

PPin-Yu Chen@pinyuchenTW · Jul 23, 2024

🚩(1/2) Please help forward the Call for the 2024 Adversarial Machine Learning (AdvML) Rising Star Awards! We promote junior researchers in AI safety, robustness, and security. Award events are hosted at AdvML'Frontiers workshop @NeurIPSConf 2024 Info: sites.google.com/view/advml/adv…

2.0K

Baharan Mirzasoleiman@baharanm · Jun 6

Can weak LLMs supervise strong LLMs to obtain superior performance? 🤔 Yes!! 🤩 Which weak models are better supervisors? 🤔 Check out @xue_yihao65785’s awesome #icml2025 paper to know how to identify best weak supervisors without having to collect labels! 🎉🌱

YYihao Xue@xue_yihao65785 · Jun 6

🎉 Our paper “Representations Shape Weak-to-Strong Generalization” is accepted at #ICML2025! We study weak-to-strong generalization (W2SG)—a core problem in superalignment—and offer new insights into the role of models' internal representations in W2SG. 1/

2.0K

Baharan Mirzasoleiman Retweeted

Spurious Correlation & Shortcut Learning Workshop@SCSLWorkshop · Apr 25

🚨 Join us at the Workshop on Spurious Correlation & Shortcut Learning (SCSL) at #ICLR2025! @iclr_conf 🗓️ April 28, 2025 📍 Garnet 214-215, Singapore EXPO 🌐 More info: scslworkshop.github.io #ICLR2025

8.0K

Baharan Mirzasoleiman@baharanm · Apr 24

Can we pretrain deep models with small synthetic data? Dataset Distillation via Knowledge Distillation is the way to go! Check out @sjoshi804’s #ICLR2025 paper this Saturday April 26 at 9am, Poster #307 🎉🌱

SSiddharth Joshi@sjoshi804 · Apr 23

#ICLR2025 Can you pre-train deep models with small, synthetic datasets? 🤯 We introduce the first effective dataset distillation method for self-supervised learning (SSL) — boosting downstream accuracy by up to 13% over baselines. 🧪 Poster #307, Sat Apr 26, 9am

2.0K

Baharan Mirzasoleiman@baharanm · Mar 4

Big congrats @YuYang_i on your graduation!! 🎉🎉 🎉 very nice PhD thesis with great contributions 🌱 I’m proud of all you’ve done, and I wish you the best! 💝

YYu Yang@YuYang_i · Mar 1

Sharing a little late update (before it’s no longer news): I wrapped up my PhD at the end of last year and recently joined @OpenAI’s reasoning team 🍓✨!

6.0K

Baharan Mirzasoleiman Retweeted

Hossein Mobahi@TheGradient · Feb 5

(2/2) Not at UCLA but interested in this work? Check arxiv.org/abs/2502.02407. Thanks to our fantastic intern @unregularized (soon to return full-time!) for leading this project, along with collaborators @ynd and Atish. Thanks to my UCLA host @baharanm for the seminar invitation!

2.0K

Baharan Mirzasoleiman Retweeted

Mohammad Hossein Rohban@MhRohban · Dec 19

We are delighted that our proposal for the Workshop on “Spurious Correlation and Shortcut Learning: Foundations and Solutions” has been accepted at @iclr_conf 2025, hosting many brilliant keynote speakers and panelists. Stay tuned: scslworkshop.github.io @SCSLWorkshop 1/

18.0K

Baharan Mirzasoleiman Retweeted

Andrew Ilyas@andrew_ilyas · Dec 13

At NeurIPS? Check out the 2nd workshop on Attributing Model Behavior at Scale (ATTRIB)! Meeting Rm 205-207, starting @ 9am - amazing talks from @SurbhiGoel_ @sanmikoyejo @baharanm, Robert Geirhos, and @coallaoh + exciting contributed talks! More info: attrib-workshop.cc

9.0K

Baharan Mirzasoleiman@baharanm · Dec 13

I’ll help presenting our #NeurIPS2024 posters tomorrow (Friday):🌱 1- Changing the training data distribution to improve in-distribution performance (11@west #7106) w. @dangnth97 2- Data selection for fine-tuning LLMs with superior performance (16:30@west #5401) w. @YUYANG_UCLA

8.0K

Baharan Mirzasoleiman Retweeted

Wuyang Chen@WuyangC · Dec 4

Attending NeurIPS'24? Please mark your calendar for our special event "SFU@NeurIPS 2024" sites.google.com/view/sfu-at-ne… 9 speakers from both academia & industry! Only a 10-min walk from the convention center! Let’s enjoy exciting talks and open discussions!

107

26.0K

Baharan Mirzasoleiman@baharanm · Dec 11

Same training and test distribution yields optimal in-distribution performance? @dangnth97 showed in his #NeurIPS2024 paper that this is not true when training with gradient methods!!😮🙃 Changing the training data distribution yields SOTA!🎊 Check it out Fri Dec 13, 11am, PS#5

1.0K

Baharan Mirzasoleiman@baharanm · Dec 11

Smaller high-quality subsets of language data not only improve LLMs’ training efficiency, but also yield considerably better performance! 🙌🎉🌱 @YUYANG_UCLA has a theoretically-rigorous method for this in her #NeurIPS2024 paper! Check it out on Fri, Dec 13, 16:30, #PS 6

YYu Yang@YuYang_i · Dec 9

1/ I'll be at #NeurIPS2024 presenting our work SmallToLarge (S2L): Data-efficient Fine-tuning of LLMs! 🚀 What’s S2L? It’s a scalable data selection method that trains a small proxy model to guide fine-tuning for larger models, reducing costs while preserving performance. 👇

7.0K

Baharan Mirzasoleiman Retweeted

UCLA Samueli Engineering@UCLAengineering · Oct 22

Assist. Prof. Baharan Mirzasoleiman @baharanm of @UCLAComSci & her large-scale machine learning research group @UCLA is part of the new @NSF-@SimonsFdn Institute for Cosmic Origins at @UTAustin that aims to use AI to research the mysteries of the cosmos. cns.utexas.edu/news/announcem…

651

Baharan Mirzasoleiman Retweeted

Xiang 'Anthony' Chen@_xiang_chen_ · Sep 21

📢 @UCLAComSci is hiring! Open to all CS areas! - Multiple Tenure-track Assistant Professor Positions: recruit.apo.ucla.edu/JPF09799 - Open Rank Teaching Professor Position: recruit.apo.ucla.edu/JPF09800 (We hired 11 Assistant Professors in the past two years ...)

17.0K

Baharan Mirzasoleiman@baharanm · Jul 24, 2024

I’ll also present “SafeClip” on behalf of @WenhanYang0315 tomorrow at 1:30pm (poster session 6) #814. See you there! 🙌

BBaharan Mirzasoleiman@baharanm · Jul 19, 2024

CLIP is highly sensitive to data poisoning and backdoor attacks. In this #ICML2024 paper, @WenhanYang0315 proposed an interesting way to pretrain CLIP robust to such attacks without compromising the performance! 🌱🌱 🔗arxiv.org/pdf/2310.05862 Thu, July 25, Poster session 6, #814

2.0K

Baharan Mirzasoleiman@baharanm · Jul 24, 2024

I’ll present “MixPro” on behalf of @xue_yihao65785 tomorrow at 11:30 (poster session 5) poster #800. Come check it out 🙌

BBaharan Mirzasoleiman@baharanm · Jul 19, 2024

ML models are sensitive to distribution shift. Can we adapt a model with only a few examples from the target domain? In this #ICML2024 paper, @xue_yihao65785 proposes an effective way, with nice theoretical analysis🌱 🔗arxiv.org/pdf/2305.14521 Thu, July 25, Poster session 5, #800

2.0K

Baharan Mirzasoleiman Retweeted

AdvMLFrontiers@AdvMLFrontiers · Jul 24, 2024

📢 We're back with a new edition, this year at @NeurIPSConf in Vancouver! Paper deadline is August 30th, we are looking forward to your submissions!

2.0K