Salman

@salman1422571

CS PhD @UCLA | prev @nyuniversity 🏙️, Intern @Apple @Amazon AGI | ML/NLP |

Joined February 2025

7Following

24Followers

Pinned

Salman@salman1422571 · Apr 22

🚨 Excited to share our new paper on 𝕏-Teaming! 🤖 Multiagent system for multiturn jaibreaking 🔍 96.2% attack success against Claude 3.7 (immune to single-turn attacks!) 💥 Upto 98.1% attack success on leading model 🛡️ Released 30K safety dataset 🧵below #AI #LLMSafety

salman1422571's tweet image. 🚨 Excited to share our new paper on 𝕏-Teaming!

🤖 Multiagent system for multiturn jaibreaking

🔍 96.2% attack success against Claude 3.7 (immune to single-turn attacks!)

💥 Upto 98.1% attack success on leading model

🛡️ Released 30K safety dataset

🧵below
#AI #LLMSafety

5.0K

Salman Retweeted

Genglin Liu@genglin_liu · Apr 20

Excited to share my first project at UCLA! We built MOSAIC — a social network simulator where LLM-powered agents behave like real users on social media. They post, share, flag, and debate the factuality of news content — all at scale. It’s open-source. 🧵 TL;DR 🌐 Realistic…

143

15.0K

Salman Retweeted

Stanford NLP Group@stanfordnlp · Feb 25

For this week’s NLP Seminar, we are thrilled to host @GabrielSaadia to talk about Simulating Emergent LLM Social Behaviors in Multi-agent Systems! When: 2/27 Thurs 11am PT Non-Stanford affiliates registration form (closed at 9am PT on the talk day): forms.gle/Jpk7XeR317a2Ei…

5.0K