Rahul Gupta

@rahul1987iit

Responsible AI @Amazon AGI (Nova models) | Organizer @TrustNLP | Organizer @UnlearningSEM

Joined February 2011

271Following

147Followers

Pinned

Rahul Gupta@rahul1987iit · Jul 14

Excited to share our safety framework evaluations with the community! See full thread from the first author below. @AmazonScience #AmazonNovaModels #ResponsibleAI

SSatyapriya Krishna@SatyaScribbles · Jul 14

🚨 Excited to share the first frontier risk evaluation report for Amazon Nova Premier: “Evaluating the Critical Risks of Amazon’s Nova Premier under the Frontier Model Safety Framework”! This is the first comprehensive evaluation of Nova Premier’s frontier safety, aligned with…

199

Rahul Gupta@rahul1987iit · Jul 22

Looking forward to the winner announcements!

RRohit Prasad@RohitPrasadAI · Jul 22

A fun look at the Nova AI Challenge finals. Winners announced tomorrow! 🏆

Rahul Gupta Retweeted

Amazon Science@AmazonScience · Jul 16

Nova Act is now⚡️ enterprise ready ⚡️ and we've added new capabilities to our preview to help you take your prototype to production—with 90%+ reliability across our early enterprise customer use cases!

8.0K

Rahul Gupta Retweeted

Sundar Pichai@sundarpichai · Jul 15

More details on Big Sleep and our latest security work: blog.google/technology/saf…

503

174

68.0K

Rahul Gupta Retweeted

Mahavir@Mahavir_Dabas18 · Jul 12

🎉 Thrilled to be presenting my first paper at @icmlconf! "Just Enough Shifts: Mitigating Over-Refusal in Aligned Language Models with Targeted Representation Fine-Tuning" We introduce ACTOR—a lightweight, activation-based training method that reduces over-refusal without…

1.0K

Rahul Gupta@rahul1987iit · Jul 8

Yet another fruitful collaboration between LIME@USC lab and NovaRAI team! @jieyuzhao11 , @linxins2

LLinxin Song@linxins2 · Jul 8

Our paper just got accepted by COLM2025! See you at Montreal for further discussion on how to discover LLM deficiencies! #colm2025

557

Rahul Gupta Retweeted

Daniel Kang@daniel_d_kang · Jul 8

As AI agents near real-world use, how do we know what they can actually do? Reliable benchmarks are critical but agentic benchmarks are broken! Example: WebArena marks "45+8 minutes" on a duration calculation task as correct (real answer: "63 minutes"). Other benchmarks…

21.0K

Rahul Gupta Retweeted

机

机器之心 JIQIZHIXIN@jiqizhixin · Jul 4

If you're building a benchmark for AI agents, you absolutely need this groundbreaking paper from UIUC, Stanford University, and other leading institutions. The researchers introduce ABC—the Agentic Benchmark Checklist—a practical, actionable guide for creating reliable and…

1.0K

Rahul Gupta Retweeted

Pascale Fung@pascalefung · Jul 4

Our research on embodied AI agents that can perceive, learn, act and interact in the virtual and physical worlds. #metaAI #AIAgent #embodied #worldmodel #superintelligemce arxiv.org/abs/2506.22355

287

199

48.0K

Rahul Gupta@rahul1987iit · Jul 3

Another launch from the Nova team!

RRohit Prasad@RohitPrasadAI · Jul 3

Just launched virtual try-on in Amazon Nova Canvas! This gives retailers a powerful way to help shoppers see products as they’d really look – from trying on different shirt colors to previewing how a coffee table fits in their living room. Learn more: aws.amazon.com/blogs/aws/amaz…

146

Rahul Gupta@rahul1987iit · Jun 30

First author's take on kaleidoscopic teaming!

NNinareh Mehrabi@NinarehMehrabi · Jun 29

Second update of the week: Check out our recent work where we coin the term kaleidoscopic teaming. What is kaleidoscopic teaming you might ask.

134