Prior @ AI2 (@Ai2Prior)

P

Prior @ AI2@Ai2Prior · Jul 14

We’re presenting SAM2Act at #ICML! Come check out the many amazing projects from AI2, and stop by to chat with us and learn more about our work.

AAi2@allen_ai · Jul 14

This week is #ICML in Vancouver, and a number of our researchers are participating. Here's the full list of Ai2's conference engagements—we look forward to connecting with fellow attendees. 👋

0

3

12

2

2.0K

P

Prior @ AI2@Ai2Prior · Jul 10

Excited to present our work at #ICML next week! Always happy to chat about all things 🔥 in Robotics and AI. I’m also be on the academic job market this coming year — would love to connect about any potential opportunities!

JJiafei Duan@ICML@DJiafei · Jan 30

Can we build a generalist robotic policy that doesn’t just memorize training data and regurgitate it during test time, but instead remembers past actions as memory and conditions its decisions on them?🤖💡 Introducing SAM2Act—a multi-view robotic transformer-based policy that…

0

9

47

13

4.0K

P

Prior @ AI2@Ai2Prior · Jul 7

It’s incredible to have both your advisors at the same company! With @fox_dieter17849 building the Robotics team, and @RanjayKrishna leading PRIOR, @allen_ai is set to become a powerhouse in robotics, computer vision, and embodied AI for open science research . Excited to be part…

NNathan Lambert@natolambert · Jul 7

Talent density only going up and to the right at ai2. Let's keep pushing.

0

6

69

8

6.0K

Prior @ AI2 Retweeted

Y

Yi Ru (Helen) Wang@YiruHelenWang · Jul 1

🚨Tired of binary pass/fail metrics that miss the bigger picture? 🤖Introducing #RoboEval — an open benchmark that shows *how* robot manipulation policies behave and *why* they fail, not just *if* they succeed. 🧵1/n 🔗 robo-eval.github.io 📄 robo-eval.github.io/media/RoboEval…

6

35

191

123

32.0K

Prior @ AI2 Retweeted

J

Jason Ren@RenZhongzheng · Jun 30

🥳 Excited to share that I’ll be joining the CS Department at UNC-Chapel Hill (@unccs @unc_ai_group) as an Assistant Professor starting Fall 2026! Before that, I’ll be working at Ai2 Prior (@allen_ai @Ai2Prior) and UW (@uwcse) on multimodal understanding and generation.

16

15

113

17

9.0K

P

Prior @ AI2@Ai2Prior · Jun 13

Our Molmo work won Best Paper Honorable mention at #CVPR2025 ! This large project was one of my best experiences with a fantastic team!

##CVPR2025@CVPR · Jun 13

5

8

128

8

16.0K

Prior @ AI2 Retweeted

R

Ranjay Krishna@RanjayKrishna · Jun 10

I am doing something silly by testing whether I can remember and deliver multiple talks on the same day on different slices of my group’s research. If you are at #CVPR2025 on June 11th, come to one or all of them :D 9:05am: Behaviors & bodies: how they shape one another…

1

8

65

8

5.0K

P

Prior @ AI2@Ai2Prior · Jun 9

Following up on our work on Molmo: Molmo points, but how can those points power real-world robotics? Introducing GraspMolmo, VLM that plugs seamlessly into robotic systems to generate semantically meaningful grasp poses from natural language commands. 👉 abhaybd.github.io/GraspMolmo/

AAbhay Deshpande@ab_deshpande · Jun 9

How should a robot hold a water bottle? 🤔 That depends: is it opening it, or passing it to you? I’m excited to introduce GraspMolmo, a VLM that predicts semantically appropriate grasps based on your command! Website: abhaybd.github.io/GraspMolmo/ 🧵 Thread ↓

0

14

71

28

4.0K

P

Prior @ AI2@Ai2Prior · Jun 9

Building on our work with Molmo, we’re excited to introduce GraspMolmo — a vision-language model that predicts semantically meaningful grasps conditioned on natural language. A fantastic effort led by our PYI, @ab_deshpande !

AAbhay Deshpande@ab_deshpande · Jun 9

How should a robot hold a water bottle? 🤔 That depends: is it opening it, or passing it to you? I’m excited to introduce GraspMolmo, a VLM that predicts semantically appropriate grasps based on your command! Website: abhaybd.github.io/GraspMolmo/ 🧵 Thread ↓

0

1

8

1

288

Prior @ AI2 Retweeted

J

Jiafei Duan@ICML@DJiafei · Jun 8

Excited to be at #CVPR2025 in Nashville! 🎉 I’m presenting a demo paper with real-world robot demos and co-organizing two workshops: Robo 3D VLM and Generalization for Robotic Manipulation. Let’s connect if you’re into 🔥 Robotics + AI — and don’t miss our stacked speaker…

0

8

61

12

3.0K

P

Prior @ AI2@Ai2Prior · May 25

Let us know how good is Molmo is at language guided pointing 👈 Vote here👇

JJiafei Duan@ICML@DJiafei · May 25

Point-Battle is now live! Vote or Submit your multimodal model and see how it stacks up in language-guided pointing and grounded visual reasoning—let the community decide which MLLM really hits the mark. We will also open-source all data for training MLLMs for pointing later on.…

0

1

0

214

P

Prior @ AI2@Ai2Prior · May 19

Great to see Molmo leading on pointing👉

JJiafei Duan@ICML@DJiafei · May 19

👉 Pointing is our first “language”—babies master it before words. Precise spatial grounding powers robotics, assistive tech, HCI, and vision-language interfaces. 🤔 But can today's MLLMs point with pixel-level accuracy and truly ground visual reasoning?📷We introduce PointArena,…

0

2

4

0

514

Prior @ AI2 Retweeted

J

Jiafei Duan@ICML@DJiafei · May 19

👉 Pointing is our first “language”—babies master it before words. Precise spatial grounding powers robotics, assistive tech, HCI, and vision-language interfaces. 🤔 But can today's MLLMs point with pixel-level accuracy and truly ground visual reasoning?📷We introduce PointArena,…

2

20

65

40

10.0K

Prior @ AI2 Retweeted

Z

Zaid Khan@codezakh · Feb 26

✨ Introducing MutaGReP (Mutation-guided Grounded Repository Plan Search) - an approach that uses LLM-guided tree search to find realizable plans that are grounded in a target codebase without executing any code! Ever wanted to provide an entire repo containing 100s of 1000s of…

1

37

88

40

11.0K

P

Prior @ AI2@Ai2Prior · Feb 26

Love the evolution of this research thread: 2015 - Neural Module Networks (NMN) by @jacobandreas et al. was my introduction to neuro-symbolic reasoning in grad school. Super exciting approach but program synthesis and neural modules were both brittle back then. 2022 - GPT3 and…

ZZaid Khan@codezakh · Feb 26

✨ Introducing MutaGReP (Mutation-guided Grounded Repository Plan Search) - an approach that uses LLM-guided tree search to find realizable plans that are grounded in a target codebase without executing any code! Ever wanted to provide an entire repo containing 100s of 1000s of…

0

8

28

5

4.0K

Prior @ AI2 Retweeted

J

Jiafei Duan@ICML@DJiafei · Feb 12

🚀 Many breakthroughs in computer vision have come from large-scale benchmarks & challenges like ImageNet, MS COCO, and WILDS. 🤖⚡ Standardizing benchmarks for robotic manipulation has been challenging, but with the rise of generalist robotic policies, evaluating their…

4

23

99

46

24.0K

Prior @ AI2 Retweeted

J

Jiafei Duan@ICML@DJiafei · Feb 10

🎉📢Exciting news! Join us at the inaugural @CVPR workshop on 3D Vision Language Models (VLMs) for Robotic Manipulation: Opportunities and Challenges, happening on June 11, 2025, in Nashville, TN. Explore how 3D perception can be integrated into robotic manipulation in the…

1

23

143

42

21.0K

Prior @ AI2 Retweeted

J

Jiafei Duan@ICML@DJiafei · Feb 10

🚨 Why do robots fail under out-of-distribution perturbations? Can we diagnose these failures in advance—and 'prescribe' the right data to fix them? 🚨 Our new paper, RoboMD introduces a systematic framework for diagnosing and improving robot manipulation policies. 🤖💡…

3

26

213

141

13.0K

P

Prior @ AI2@Ai2Prior · Feb 1

Check out this new work from our student researcher, @DJiafei! Memory is important to both navigation and manipulation policy.

JJiafei Duan@ICML@DJiafei · Jan 30

Can we build a generalist robotic policy that doesn’t just memorize training data and regurgitate it during test time, but instead remembers past actions as memory and conditions its decisions on them?🤖💡 Introducing SAM2Act—a multi-view robotic transformer-based policy that…

0

3

19

4

2.0K

Prior @ AI2 Retweeted

A

Ai2@allen_ai · Jan 30

Here is Tülu 3 405B 🐫 our open-source post-training model that surpasses the performance of DeepSeek-V3! The last member of the Tülu 3 family demonstrates that our recipe, which includes Reinforcement Learning from Verifiable Rewards (RVLR) scales to 405B - with performance on…

154

379

2.0K

686

445.0K