Amir Bar

@_amirbar

Postdoc at Meta (FAIR). Prev: PhD at TAU and Berkeley AI Research.

NYC

Joined March 2016

1KFollowing

2KFollowers

Amir Bar Retweeted

Jim Fan@DrJimFan · Jul 25

I'm observing a mini Moravec's paradox within robotics: gymnastics that are difficult for humans are much easier for robots than "unsexy" tasks like cooking, cleaning, and assembling. It leads to a cognitive dissonance for people outside the field, "so, robots can parkour &…

140

585

2.0K

691

341.0K

Amir Bar Retweeted

Yuxi on the Wired@layer07_yuxi · Jul 22

thread on the new paper: The Serial Scaling Hypothesis joint work with: @phizaz, @YutongBAI1002, Kananart

204

170

26.0K

Amir Bar Retweeted

Grace Luo@graceluo_ · Jul 15

I'm presenting a poster at #ICML2025 today! Stop by if you want to learn how VLMs encode different representations of the same task (spoiler: it's the same). 🌐 icml.cc/virtual/2025/p… 🔗 vlm-cross-modal-reps.github.io cc @_amirbar @trevordarrell

126

7.0K

Amir Bar Retweeted

Zhenfei (Jeremy) Yin @ ICML 2025@9LdROhjZE56jSh9 · May 23

🚨 Excited to announce our ICCV 2025 Workshop: Reliable and Interactive World Model (RIWM 2025) — Call for Papers is now OPEN, and the official website is live! 🌐 🌍 RIWM 2025 explores how to build world models with geometric and physical reliability and strong interactive…

1.0K

Amir Bar@_amirbar · Jun 27

Check out PEVA 🌎, our recent attempt to build a world model for human body control.

YYutong Bai@YutongBAI1002 · Jun 27

What would a World Model look like if we start from a real embodied agent acting in the real world? It has to have: 1) A real, physically grounded and complex action space—not just abstract control signals. 2) Diverse, real-life scenarios and activities. Or in short: It has to…

2.0K

Amir Bar@_amirbar · Jun 21

World models are such an interesting topic. Really fun discussion about how they can be used for navigation with @_amirbar

RRoboPapers@RoboPapers · Jun 21

Ep#15 with @_amirbar on Navigation World Models amirbar.net/nwm/ Co-hosted by @chris_j_paxton & @micoolcho

7.0K

Amir Bar@_amirbar · Jun 9

heading to Nashville to attend @CVPR tomorrow. looking forward to meeting old & new friends and chat about #WorldModels

3.0K

Amir Bar Retweeted

William Rudman@WilliamRudmanjr · Jun 3

When vision-language models answer questions, are they truly analyzing the image or relying on memorized facts? We introduce Pixels vs. Priors (PvP), a method to control whether VLMs respond based on input pixels or world knowledge priors. [1/5]

2.0K

Amir Bar@_amirbar · May 14

a NeurIPS 2025 nightmare ☠️

2.0K

Amir Bar@_amirbar · May 5

Make sure to check out Hanwen's @hanwenjiang1 latest work! 🚀 We introduce RayZer, a self-supervised model for novel view synthesis. We use zero 3D supervision, yet we outperform supervised methods! Some surprising and exciting results inside! 🔍🔥

HHanwen Jiang@hanwenjiang1 · May 2

Supervised learning has held 3D Vision back for too long. Meet RayZer — a self-supervised 3D model trained with zero 3D labels: ❌ No supervision of camera & geometry ✅ Just RGB images And the wild part? RayZer outperforms supervised methods (as 3D labels from COLMAP is noisy)…

5.0K

Amir Bar@_amirbar · May 2

Need a strong feature extractor for your upcoming NeurIPS paper? we got you 😉

PPeter Tong@TongPetersb · Apr 24

We are open-sourcing all the models in Web-SSL, from ViT-L to ViT-7B! It was super fun to train and play with these massive ViTs. Models: huggingface.co/collections/fa… Github: github.com/facebookresear… Huge credit to @DavidJFan for putting these models together!

4.0K

Amir Bar@_amirbar · Apr 24

Our code & pretrained models: github.com/facebookresear…

YYann LeCun@ylecun · Apr 3

New paper from FAIR+NYU: Q: Is language supervision required to learn effective visual representations for multimodal tasks? A: No. ⬇️⬇️⬇️

2.0K

Amir Bar@_amirbar · Apr 19

WORLDMEM: Adding memory to world models

ZZeqi Xiao@zeqi_xiao · Apr 18

Thanks for sharing! @_akhaliq For more information: 📜ArXiv: arxiv.org/abs/2504.12369 🤗 Hugging Face: huggingface.co/papers/2504.12… 🌐 xizaoqu.github.io/worldmem/ 🧑‍💻 GitHub: github.com/xizaoqu/WorldM… 🚀 Demo: huggingface.co/spaces/yslan/w…

751

Amir Bar@_amirbar · Apr 10

Excited to share that our paper on Navigation World Models was selected for an Oral presentation at CVPR! Code & models: github.com/facebookresear… huggingface.co/facebook/nwm

AAmir Bar@_amirbar · Dec 5

Happy to share our new work on Navigation World Models! 🔥🔥 Navigation is a fundamental skill of agents with visual-motor capabilities. We train a single World Model across multiple environments and diverse agent data. w/ @GaoyueZhou, Danny Tran, @trevordarrell and @ylecun.

104

8.0K

Amir Bar Retweeted

Yann LeCun@ylecun · Apr 3

New paper from FAIR+NYU: Q: Is language supervision required to learn effective visual representations for multimodal tasks? A: No. ⬇️⬇️⬇️

603

205

84.0K

Amir Bar@_amirbar · Apr 2

FAIR is probably the only lab outside of academia where research projects can start like this.

DDavid Fan@DavidJFan · Apr 2

[7/8] This side project started in October when @TongPetersb, @_amirbar, and I were thinking about the rise of CLIP as a popular vision encoder for MLLMs. The community often assumes that language supervision is the primary reason for CLIP's strong performance. However, we…

112

16.0K