Daria Lioubashevski

@DariaLioub

CS MSc student @CseHuji | Student Researcher @Google

Joined October 2024

81Following

73Followers

Pinned

📢Paper release📢 What computation is the Transformer performing in the layers after the top-1 becomes fixed (a so called "saturation event")? We show that the next highest-ranked tokens also undergo saturation *in order* of their ranking. Preprint: arxiv.org/abs/2410.20210 1/4

DariaLioub's tweet image. 📢Paper release📢
What computation is the Transformer performing in the layers after the top-1 becomes fixed (a so called "saturation event")? We show that the next highest-ranked tokens also undergo saturation *in order* of their ranking.
Preprint: arxiv.org/abs/2410.20210
1/4

3.0K

Daria Lioubashevski@DariaLioub · Jul 16

Ever wondered how Transformers refine their top-k predictions over their layers? 📊 Is there an order to the madness? Come find out at my poster presentation tommorow at @icmlconf 📍East Exhibition Hall E-2512, 11:00-13:30

DariaLioub's tweet image. Ever wondered how Transformers refine their top-k predictions over their layers? 📊
Is there an order to the madness?

Come find out at my poster presentation tommorow at @icmlconf
📍East Exhibition Hall E-2512, 11:00-13:30

397

Daria Lioubashevski@DariaLioub · Jul 12

I'll be at #ICML2025 next week, would love to chat about mechanistic interpretability, neuroAI or cognitive comp neuroscience. BTW if you're already in Vancouver, highly recommend the Aquarium! (second image is me with jet lag 😂)

DariaLioub's tweet image. I'll be at #ICML2025 next week, would love to chat about mechanistic interpretability, neuroAI or cognitive comp neuroscience.

BTW if you're already in Vancouver, highly recommend the Aquarium!
(second image is me with jet lag 😂)

311

Daria Lioubashevski Retweeted

Miriam Havin@MiriamHavin · May 19

Can (A)I change your mind? New study finds LLMs can be as persuasive as humans — even in real-world, ecological conversations on controversial topics. arxiv.org/abs/2503.01844 #AI #LLM #Persuasion #CogSci2025 @cogsci_soc @timnaWK @GoldsteinYAriel @yanivdover @morankor

213

Daria Lioubashevski@DariaLioub · May 1

Accepted at #icml2025🥳 Camera ready version (with newer models like Llama-3 and Qwen-Audio) coming soon!

DDaria Lioubashevski@DariaLioub · Oct 29

612

Daria Lioubashevski Retweeted

Guy Kaplan@GKaplan38844 · Apr 7

✨ Ever tried generating an image from a prompt but ended up with unexpected outputs? Check out our new paper #FollowTheFlow - tackling T2I issues like bias, failed binding, and leakage from the textual encoding side! 💼🔍 arxiv.org/pdf/2504.01137 guykap12.github.io/guykap12.githu… 🧵[1/7]

4.0K

Daria Lioubashevski Retweeted

Amir Taubenfeld@TaubenfeldAmir · Feb 11

New Preprint 🎉 LLM self-assessment unlocks efficient decoding ✅ Our Confidence-Informed Self-Consistency (CISC) method cuts compute without losing accuracy. We also rethink confidence evaluation & contribute to the debate on self-verification. arxiv.org/abs/2502.06233 1/8👇

4.0K

Daria Lioubashevski Retweeted

Refael Tikochinski@R_Tikochinski · Jan 21

Very excited to share our new paper published in Nature Communications @NatureComms (link below). This work is part of my PhD research under the supervision of @roireichart (Technion), @HassonUri (@HassonLab), and @ArielYGoldstein, in collaboration with @YoavMeiri.

441

Daria Lioubashevski Retweeted

Neil Renic@NC_Renic · Nov 21

has reviewer 2 considered that I’m trying my best

246

2.0K

90.0K

Daria Lioubashevski Retweeted

Noam Dahan@Dahan_Noam · Nov 21

Look at the CRAZY domain gap we found in summarization datasets: while English resources are diverse, other languages are mostly restricted to news. Presenting our survey following 130+ datasets in 100+ languages! Explore: github.com/edahanoam/Awes… @GabiStanovsky, @nlphuji 1/6

3.0K

Daria Lioubashevski Retweeted

Amit Ben-Artzy@Amit_BenArtzy · Nov 6

In which layers does information flow from previous tokens to the current token? Presenting our new @BlackboxNLP paper: “Attend First, Consolidate Later: On the Importance of Attention in Different LLM Layers” arxiv.org/abs/2409.03621 1/n

5.0K

Daria Lioubashevski Retweeted

Hadas Orgad @ ICML@OrgadHadas · Oct 9

Hallucinations are a subject of much interest, but how much do we know about them? In our new paper, we found that the internals of LLMs contain far more information about truthfulness than we knew! 🧵 Project page >> llms-know.github.io Arxiv >> arxiv.org/abs/2410.02707

929

701

133.0K

Daria Lioubashevski Retweeted

Itay Itzhak @ ACL 2025 ✈️🇦🇹@Itay_itzhak_ · Aug 2, 2023

📢 New paper alert! 📢 Thrilled to announce `Instructed to Bias: Instruction-Tuned Language Models Exhibit Emergent Cognitive Bias'. Do instruction tuning and RLHF amplify biases in LMs? 🧵 Check it out arxiv.org/abs/2308.00225 W @boknilev @GabiStanovsky and N. Rosenfeld.

16.0K