Rotem Shalev-Arkushin

@rotemsh3

CS PhD student @ Tel-Aviv University

Joined August 2018

72Following

25Followers

Pinned

Excited to introduce our new work: ImageRAG 🖼️✨ rotem-shalev.github.io/ImageRAG We enhance off-the-shelf generative models with Retrieval-Augmented Generation (RAG) for unknown concept generation, using a VLM-based approach that’s easy to integrate with new & existing models! [1/3]

rotemsh3's tweet image. Excited to introduce our new work: ImageRAG 🖼️✨
rotem-shalev.github.io/ImageRAG

We enhance off-the-shelf generative models with Retrieval-Augmented Generation (RAG) for unknown concept generation, using a VLM-based approach that’s easy to integrate with new &amp; existing models!

[1/3]

1.0K

Rotem Shalev-Arkushin Retweeted

Guy Tevet@GuyTvt · Jul 15

1/ Can we teach a motion model to "dance like a chicken" Or better: Can LoRA help motion diffusion models learn expressive, editable styles without forgetting how to move? Led by @HSawdayee, @chuan_guo92603, we explore this in our latest work. 🎥 haimsaw.github.io/LoRA-MDM/ 🧵👇

125

5.0K

Rotem Shalev-Arkushin Retweeted

Elad Richardson@EladRichardson · Jun 22

Really impressive results for human-object interaction. They use a two-phase process where they optimize the diffusion noise, instead of the motion itself, to get to sub-centimeter precision while staying on manifold 🧠 HOIDiNi - hoidini.github.io

3.0K

Rotem Shalev-Arkushin Retweeted

Omer Dahary@OmerDahary · May 28

Excited to share that our new work, Be Decisive, has been accepted to SIGGRAPH! We improve multi-subject generation by extracting a layout directly from noise, resulting in more diverse and accurate compositions. Website: omer11a.github.io/be-decisive/ Paper: arxiv.org/abs/2505.21488

4.0K

Rotem Shalev-Arkushin@rotemsh3 · May 9

Excited to share that "IP-Composer: Semantic Composition of Visual Concepts" got accepted to #SIGGRAPH2025!🥳 We show how to combine visual concepts from multiple input images by projecting them into CLIP subspaces - no training, just neat embedding math✨ Really enjoyed working…

LLinoy Tsaban🎗️@linoy_tsaban · Apr 15

🔔just landed: IP Composer🎨 semantically mix & match visual concepts from images ❌ text prompts can't always capture visual nuances ❌ visual input based methods often need training / don't allow fine grained control over *which* concepts to extract from our input images So👇

104

9.0K

Rotem Shalev-Arkushin Retweeted

Sigal Raab@sigal_raab · May 7

🔔Excited to announce that #AnyTop has been accepted to #SIGGRAPH2025!🥳 ✅ A diffusion model that generates motion for arbitrary skeletons ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🌐 Project: anytop2025.github.io/Anytop-page

2.0K

Rotem Shalev-Arkushin Retweeted

AK@_akhaliq · Apr 25

RefVNLI Towards Scalable Evaluation of Subject-driven Text-to-image Generation

135

17.0K

Rotem Shalev-Arkushin Retweeted

Aharon Azulay@AharonAzulay · Mar 2

How well LLMs are memorizing obscure details from scientific papers? I created a benchmark for that! Full code, dataset and data creation method included. tl;dr GPT4.5 is a major jump in scientific facts memorization. Thread below 👇

3.0K

Rotem Shalev-Arkushin Retweeted

Guy Tevet@GuyTvt · Feb 12

🚀 Meet DiP: our newest text-to-motion diffusion model! ✨ Ultra-fast generation ♾️ Creates endless, dynamic motions 🔄 Seamlessly switch prompts on the fly Best of all, it's now available in the MDM codebase: github.com/GuyTevet/motio… [1/3]

470

384

38.0K

Rotem Shalev-Arkushin Retweeted

Sigal Raab@sigal_raab · Sep 20

🔔🔔Thrilled to share #MoMo [@SIGGRAPHAsia 2024 🥳🎉]: Exploring the attention space of #MotionDiffusionModels. Our training-free method enables cool applications like this motion transfer 🐒🐒. monkeyseedocg.github.io

10.0K