Adi Haviv

@adihaviv

CS Ph.D. Candidate at @TelAvivUni. Researching #NLProc and Computer Vision.

Joined January 2010

276Following

483Followers

Pinned

Adi Haviv@adihaviv · Jul 27

Excited to present our latest work at #GenLaw #ICML2024! Interested in whether T2I Stable Diffusion models can create original content, how to measure originality, and how this relates to copyright infringements? Join me at the poster session today at 2pm in Lehar 2! 👩‍🏫🧵

adihaviv's tweet image. Excited to present our latest work at #GenLaw #ICML2024! Interested in whether T2I Stable Diffusion models can create original content, how to measure originality, and how this relates to copyright infringements? Join me at the poster session today at 2pm in Lehar 2! 👩‍🏫🧵

2.0K

Adi Haviv Retweeted

Yotam Erel@yotam_erel · Jul 24

{1/8} 🧵 When you click a link, have you ever wondered: “Which webpage is actually important?” Google answered that with PageRank—treating the web as a Markov chain. Now imagine doing the same… but for transformer attention.👇 🔗 yoterel.github.io/attention_chai…

107

5.0K

Adi Haviv Retweeted

Guy Tevet@GuyTvt · Jul 15

1/ Can we teach a motion model to "dance like a chicken" Or better: Can LoRA help motion diffusion models learn expressive, editable styles without forgetting how to move? Led by @HSawdayee, @chuan_guo92603, we explore this in our latest work. 🎥 haimsaw.github.io/LoRA-MDM/ 🧵👇

124

5.0K

Adi Haviv Retweeted

Ido Cohen@IdoC0hen · Jul 6

A Vision-Language Model can answer questions about Robin Williams. It can also recognize him in a photo. So why does it FAIL when asked the same questions using his photo instead of his name? A thread on our new #acl2025 paper that explores this puzzle 🧵

2.0K

Adi Haviv Retweeted

Elad Richardson@EladRichardson · Jun 22

Really impressive results for human-object interaction. They use a two-phase process where they optimize the diffusion noise, instead of the motion itself, to get to sub-centimeter precision while staying on manifold 🧠 HOIDiNi - hoidini.github.io

3.0K

Adi Haviv Retweeted

Omer Dahary@OmerDahary · May 28

Excited to share that our new work, Be Decisive, has been accepted to SIGGRAPH! We improve multi-subject generation by extracting a layout directly from noise, resulting in more diverse and accurate compositions. Website: omer11a.github.io/be-decisive/ Paper: arxiv.org/abs/2505.21488

4.0K

Adi Haviv Retweeted

Sigal Raab@sigal_raab · May 7

🔔Excited to announce that #AnyTop has been accepted to #SIGGRAPH2025!🥳 ✅ A diffusion model that generates motion for arbitrary skeletons ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🌐 Project: anytop2025.github.io/Anytop-page

2.0K

Adi Haviv Retweeted

Daniel Garibi@DanielGaribi · Apr 16

Excited to share that "TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space" got accepted to SIGGRAPH 2025! It tackles disentangling complex visual concepts from as little as a single image and re-composing concepts across multiple images into a coherent…

10.0K

Adi Haviv Retweeted

Linoy Tsaban🎗️@linoy_tsaban · Apr 15

🔔just landed: IP Composer🎨 semantically mix & match visual concepts from images ❌ text prompts can't always capture visual nuances ❌ visual input based methods often need training / don't allow fine grained control over *which* concepts to extract from our input images So👇

199

131

22.0K

Adi Haviv Retweeted

jxmo@jxmnop · Apr 7

pretty mind-blowing fact I just learned about transformer language models: the positional embeddings don't really do anything. you can just get rid of them and the model still works just as well sounds impossible, doesn't it? turns out standard LLMs aren't actually…

127

2.0K

184.0K

Adi Haviv Retweeted

Ofir Press@OfirPress · Apr 5

Transformers can work without using positional embeddings at all. Llama 4 uses positional embs for local attn but not globally. Our paper from 2022 shows why this works- the causal mask allows transformers to infer positions. arxiv.org/pdf/2203.16634

528

313

35.0K

Adi Haviv Retweeted

omer goldman@omerNLP · Apr 1

Wanna check how well a model can share knowledge between languages? Of course you do! 🤩 But can you do it without access to the model’s weights? Now you can with ECLeKTic 🤯

3.0K

Adi Haviv Retweeted

Ori Yoran@OriYoran · Mar 20

New #ICLR2024 paper! The KoLMogorov Test: can CodeLMs compress data by code generation? The optimal compression for a sequence is the shortest program that generates it. Empirically, LMs struggle even on simple sequences, but can be trained to outperform current methods! 🧵1/7

293

149

42.0K

Adi Haviv Retweeted

Elad Richardson@EladRichardson · Mar 16

Ever stared at a set of shapes and thought: 'These could be something… but what?' Designed for visual ideation, PiT takes a set of concepts and interprets them as parts within a target domain, assembling them together while also sampling missing parts. eladrich.github.io/PiT/

115

9.0K

Adi Haviv Retweeted

Andreas Aristidou@andaristidou · Mar 1

🚀 New preprint! 🚀 Check out AnyTop 🤩 ✅ A diffusion model that generates motion for arbitrary skeletons 🦴 ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🦅🐒🪲 🔗 Arxiv: arxiv.org/abs/2502.17327

188

18.0K

Adi Haviv Retweeted

Rotem Shalev-Arkushin@rotemsh3 · Feb 17

Excited to introduce our new work: ImageRAG 🖼️✨ rotem-shalev.github.io/ImageRAG We enhance off-the-shelf generative models with Retrieval-Augmented Generation (RAG) for unknown concept generation, using a VLM-based approach that’s easy to integrate with new & existing models! [1/3]

1.0K

Adi Haviv Retweeted

Guy Tevet@GuyTvt · Feb 12

🚀 Meet DiP: our newest text-to-motion diffusion model! ✨ Ultra-fast generation ♾️ Creates endless, dynamic motions 🔄 Seamlessly switch prompts on the fly Best of all, it's now available in the MDM codebase: github.com/GuyTevet/motio… [1/3]

470

384

38.0K

Adi Haviv Retweeted

Hila Chefer@hila_chefer · Feb 4

VideoJAM is our new framework for improved motion generation from @AIatMeta We show that video generators struggle with motion because the training objective favors appearance over dynamics. VideoJAM directly adresses this **without any extra data or scaling** 👇🧵

201

1.0K

450

164.0K

Adi Haviv Retweeted

Rameen Abdal@AbdalRameen · Jan 29

What if you could compose videos— merging multiple clips, even capturing complex athletic moves where video models struggle - all while preserving motion and context? And yes, you can still edit them with text after! Stay tuned for more results. #AI #VideoGeneration #SnapResearch

150

17.0K

Adi Haviv Retweeted

Mor Geva@megamor2 · Jan 15

How can we interpret LLM features at scale? 🤔 Current pipelines use activating inputs, which is costly and ignores how features causally affect model outputs! We propose efficient output-centric methods that better predict how steering a feature will affect model outputs. New…

111

7.0K

Adi Haviv@adihaviv · Jan 7

Text prompts have shaped how we compose images with foundation models. But what if we could simply inject Visual Prompts instead? We introduce 🌟Visual Composer🌟 which achieves high-fidelity compositions of subjects and backgrounds with visual prompts! snap-research.github.io/visual-compose…

GGaurav Parmar@GauravTParmar · Jan 7

[1/4] Ever wondered what it would be like to use images—rather than text—to generate object and background compositions? We introduce VisualComposer, a method for compositional image generation with object-level visual prompts.

5.0K