Gili Lior

@GiliLior

PhD student at @CSEhuji

Joined December 2020

128Following

183Followers

Gili Lior Retweeted

Itay Itzhak @ ACL 2025 ✈️🇦🇹@Itay_itzhak_ · Jul 15

🚨New paper alert🚨 🧠 Instruction-tuned LLMs show amplified cognitive biases — but are these new behaviors, or pretraining ghosts resurfacing? Excited to share our new paper, accepted to CoLM 2025🎉! See thread below 👇 #BiasInAI #LLMs #MachineLearning #NLProc

3.0K

Gili Lior Retweeted

Michael Hassid@MichaelHassid · May 27

The longer reasoning LLM thinks - the more likely to be correct, right? Apparently not. Presenting our paper: “Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning”. Link: arxiv.org/abs/2505.17813 1/n

104

6.0K

Gili Lior Retweeted

AK@_akhaliq · Apr 25

RefVNLI Towards Scalable Evaluation of Subject-driven Text-to-image Generation

136

17.0K

Gili Lior Retweeted

Eliya Habba @ ACL 2025 🇦🇹@EliyaHabba · Mar 17

Care about LLM evaluation? 🤖 🤔 We bring you🕊️ DOVE a massive (250M!) collection of LLMs outputs On different prompts, domains, tokens, models... Join our community effort to expand it with YOUR model predictions & become a co-author!

6.0K

Gili Lior@GiliLior · Oct 30

Really cool work by Daria!

DDaria Lioubashevski@DariaLioub · Oct 29

📢Paper release📢 What computation is the Transformer performing in the layers after the top-1 becomes fixed (a so called "saturation event")? We show that the next highest-ranked tokens also undergo saturation *in order* of their ranking. Preprint: arxiv.org/abs/2410.20210 1/4

250

Gili Lior Retweeted

Guy Kaplan@GKaplan38844 · Oct 14

📢Paper release📢 : 🔍 Ever wondered how LLMs understand words when all they see are tokens? 🧠 Our latest study uncovers how LLMs reconstruct full words from sub-word tokens, even when misspelled or previously unseen. arxiv.org/pdf/2410.05864 (preprint) 👀 👇 [1/7]

6.0K

Gili Lior Retweeted

Uri Berger@uriberger88 · Sep 12

1/ Into Image Captioning? Don’t miss this! Struggling to keep up with the influx of new metrics but still see the same 5 (BLEU, METEOR, ROUGE, CIDEr, SPICE) leading? Read our recent Captioning evaluation survey! arxiv.org/abs/2408.04909 w\ @GabiStanovsky @AbendOmri @leafrermann >

1.0K