Omer Dahary

@OmerDahary

Joined January 2023

63Following

65Followers

Pinned

Omer Dahary@OmerDahary · May 28

Excited to share that our new work, Be Decisive, has been accepted to SIGGRAPH! We improve multi-subject generation by extracting a layout directly from noise, resulting in more diverse and accurate compositions. Website: omer11a.github.io/be-decisive/ Paper: arxiv.org/abs/2505.21488

OmerDahary's tweet image. Excited to share that our new work, Be Decisive, has been accepted to SIGGRAPH!
We improve multi-subject generation by extracting a layout directly from noise, resulting in more diverse and accurate compositions.
Website: omer11a.github.io/be-decisive/
Paper: arxiv.org/abs/2505.21488

4.0K

Omer Dahary Retweeted

Guy Tevet@GuyTvt · Jul 15

1/ Can we teach a motion model to "dance like a chicken" Or better: Can LoRA help motion diffusion models learn expressive, editable styles without forgetting how to move? Led by @HSawdayee, @chuan_guo92603, we explore this in our latest work. 🎥 haimsaw.github.io/LoRA-MDM/ 🧵👇

125

5.0K

Omer Dahary Retweeted

Elad Richardson@EladRichardson · Jun 22

Really impressive results for human-object interaction. They use a two-phase process where they optimize the diffusion noise, instead of the motion itself, to get to sub-centimeter precision while staying on manifold 🧠 HOIDiNi - hoidini.github.io

3.0K

Omer Dahary Retweeted

Boaz Lavon@BoazLavon · Jun 17

LLMs Don’t Think Like Developers - Until Now. Together with @KatzShachar and @liorwolf We made LLMs execute their code while generating it, just like a human developer. Meet EG-CFG: A new inference-time method that injects real-time execution feedback into the generation loop.…

299

140

718.0K

Omer Dahary@OmerDahary · May 9

Excited to share that "IP-Composer: Semantic Composition of Visual Concepts" got accepted to #SIGGRAPH2025!🥳 We show how to combine visual concepts from multiple input images by projecting them into CLIP subspaces - no training, just neat embedding math✨ Really enjoyed working…

LLinoy Tsaban🎗️@linoy_tsaban · Apr 15

🔔just landed: IP Composer🎨 semantically mix & match visual concepts from images ❌ text prompts can't always capture visual nuances ❌ visual input based methods often need training / don't allow fine grained control over *which* concepts to extract from our input images So👇

104

9.0K

Omer Dahary Retweeted

Sigal Raab@sigal_raab · May 7

🔔Excited to announce that #AnyTop has been accepted to #SIGGRAPH2025!🥳 ✅ A diffusion model that generates motion for arbitrary skeletons ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🌐 Project: anytop2025.github.io/Anytop-page

2.0K

Omer Dahary Retweeted

Linoy Tsaban🎗️@linoy_tsaban · Apr 15

199

131

22.0K

Omer Dahary Retweeted

Daniel Garibi@DanielGaribi · Apr 16

Excited to share that "TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space" got accepted to SIGGRAPH 2025! It tackles disentangling complex visual concepts from as little as a single image and re-composing concepts across multiple images into a coherent…

10.0K

Omer Dahary Retweeted

Elad Richardson@EladRichardson · Mar 16

Ever stared at a set of shapes and thought: 'These could be something… but what?' Designed for visual ideation, PiT takes a set of concepts and interprets them as parts within a target domain, assembling them together while also sampling missing parts. eladrich.github.io/PiT/

116

9.0K

Omer Dahary Retweeted

Daniel Cohen-Or@DanielCohenOr1 · Mar 5

Vectorization into a neat SVG!🎨✨ Instead of generating a messy SVG (left), we produce a structured, compact representation (right) - enhancing usability for editing and modification. Accepted to #CVPR2025 !

127

1.0K

693

77.0K

Omer Dahary Retweeted

Jonathan Fischoff@jfischoff · Mar 2

“Tight Inversion” uses an IP-Adapter during DDIM inversion to preserve the original image better when editing. arxiv.org/abs/2502.20376

165

7.0K

Omer Dahary Retweeted

Andreas Aristidou@andaristidou · Mar 1

🚀 New preprint! 🚀 Check out AnyTop 🤩 ✅ A diffusion model that generates motion for arbitrary skeletons 🦴 ✅ Using only a skeletal structure as input ✅ Learns semantic correspondences across diverse skeletons 🦅🐒🪲 🔗 Arxiv: arxiv.org/abs/2502.17327

188

18.0K

Omer Dahary Retweeted

Rotem Shalev-Arkushin@rotemsh3 · Feb 17

Excited to introduce our new work: ImageRAG 🖼️✨ rotem-shalev.github.io/ImageRAG We enhance off-the-shelf generative models with Retrieval-Augmented Generation (RAG) for unknown concept generation, using a VLM-based approach that’s easy to integrate with new & existing models! [1/3]

1.0K

Omer Dahary Retweeted

Guy Tevet@GuyTvt · Feb 12

🚀 Meet DiP: our newest text-to-motion diffusion model! ✨ Ultra-fast generation ♾️ Creates endless, dynamic motions 🔄 Seamlessly switch prompts on the fly Best of all, it's now available in the MDM codebase: github.com/GuyTevet/motio… [1/3]

470

384

38.0K

Omer Dahary Retweeted

Rameen Abdal@AbdalRameen · Jan 29

What if you could compose videos— merging multiple clips, even capturing complex athletic moves where video models struggle - all while preserving motion and context? And yes, you can still edit them with text after! Stay tuned for more results. #AI #VideoGeneration #SnapResearch

150

17.0K

Omer Dahary Retweeted

Sagi Polaczek 🦜@PolaczekSagi · Jan 8

[1/5] Rethinking SVGs, the implicit way — meet NeuralSVG! 🎨✨ An implicit neural representation for generating layered SVGs from text prompts. 💧 Powered by SDS and nested-dropout for ordered shapes 🖌️ Enables inference-time editing like color palette & aspect ratio Read more on…

296

177

26.0K

Omer Dahary Retweeted

Or Patashnik@OPatashnik · Jan 3

Ever wondered how a SINGLE token represents all subject regions in personalization? Many methods use this token in cross-attention, meaning all semantic parts share the same single attention value. We present Nested Attention, a mechanism that generates localized attention values

293

144

23.0K