Michael Hassid

@MichaelHassid

PhD candidate @HebrewU; Research Assistant @AIatMeta (FAIR)

Joined February 2023

119Following

331Followers

Pinned

Michael Hassid@MichaelHassid · May 27

The longer reasoning LLM thinks - the more likely to be correct, right? Apparently not. Presenting our paper: “Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning”. Link: arxiv.org/abs/2505.17813 1/n

MichaelHassid's tweet image. The longer reasoning LLM thinks - the more likely to be correct, right?

Apparently not.

Presenting our paper: “Don’t Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning”.

Link: arxiv.org/abs/2505.17813

1/n

104

6.0K

Michael Hassid@MichaelHassid · Jul 28

Nice to see that TOVA is still one of the leading KV compression methods, even after more than a year and a half, especially for high-compression regimes. Paper: arxiv.org/abs/2401.06104

AAlessio Devoto@devoto_alessio · Jul 21

🏆 Our @nvidia KV Cache Compression Leaderboard is now live! Compare state-of-the-art compression methods side-by-side with KVPress. See which techniques are leading in efficiency and performance. 🥇 huggingface.co/spaces/nvidia/…

286

Michael Hassid Retweeted

Gabriel Synnaeve@syhw · Jul 22

@MichaelHassid found it too

720

Michael Hassid Retweeted

Itay Itzhak @ ACL 2025 ✈️🇦🇹@Itay_itzhak_ · Jul 15

🚨New paper alert🚨 🧠 Instruction-tuned LLMs show amplified cognitive biases — but are these new behaviors, or pretraining ghosts resurfacing? Excited to share our new paper, accepted to CoLM 2025🎉! See thread below 👇 #BiasInAI #LLMs #MachineLearning #NLProc

3.0K

Michael Hassid Retweeted

Gallil Maimon@GallilMaimon · Jul 8

🎉Thrilled that our paper on "scaling analysis of interleaved speech-text LMs" was accepted to #CoLM2025 It gives room for optimism when scaling SpeechLMs *right* - with large TextLMs (in place of more data), interleaving, and synth training data💪

1.0K

Michael Hassid Retweeted

Itai Gat@itai_gat · Jun 25

Excited to share our recent work on corrector sampling in language models! A new sampling method that mitigates error accumulation by iteratively revisiting tokens in a window of previously generated text. With: @shaulneta @urielsinger @lipmanya Link: arxiv.org/abs/2506.06215

27.0K

Michael Hassid Retweeted

Or Tal@Or__Tal · Jun 12

Which modeling to choose for text-to-music generation? We run a head-to-head comparison to figure it out. Same data, same architecture - AR vs FM. 👇 If you care about fidelity, speed, control, or editing see this thread. 🔗huggingface.co/spaces/ortal16… 📄arxiv.org/abs/2506.08570 1/6

1.0K

Michael Hassid@MichaelHassid · May 29

Thanks @_akhaliq for sharing our work!

AAK@_akhaliq · May 28

Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning

346

Michael Hassid@MichaelHassid · Apr 4

Great work led by @GallilMaimon about scaling Speech LMs. Check it out!

GGallil Maimon@GallilMaimon · Apr 4

Many modern SpeechLMs are trained with Speech-Text interleaving. How does this impact scaling trends? In our new paper, we train several dozen SLMs, and show - quite a lot! So there is room for optimism 😊 Key insights, code, models, full paper 👇🏻

343

Michael Hassid Retweeted

arXiv Sound@ArxivSound · Apr 4

``Scaling Analysis of Interleaved Speech-Text Language Models,'' Gallil Maimon, Michael Hassid, Amit Roth, Yossi Adi, ift.tt/gvRXTOe

1.0K

Michael Hassid Retweeted

Eliya Habba @ ACL 2025 🇦🇹@EliyaHabba · Mar 17

Care about LLM evaluation? 🤖 🤔 We bring you🕊️ DOVE a massive (250M!) collection of LLMs outputs On different prompts, domains, tokens, models... Join our community effort to expand it with YOUR model predictions & become a co-author!

6.0K

Michael Hassid@MichaelHassid · Mar 12

During my TA position for the advanced NLP course at HUJI last year, I mentored students on their projects. I'm excited to share that one of those projects has developed into a (preprint) paper. Check it out :)

SShahar Levy@ShaharLevy19 · Mar 11

Can RAG performance get * worse * with more relevant documents?📄 We put the number of retrieved documents in RAG to the test! 💥Preprint💥: arxiv.org/abs/2503.04388 1/3

197