Salman
@salman1422571
CS PhD @UCLA | prev @nyuniversity 🏙️, Intern @Apple @Amazon AGI | ML/NLP |
🚨 Excited to share our new paper on 𝕏-Teaming! 🤖 Multiagent system for multiturn jaibreaking 🔍 96.2% attack success against Claude 3.7 (immune to single-turn attacks!) 💥 Upto 98.1% attack success on leading model 🛡️ Released 30K safety dataset 🧵below #AI #LLMSafety

Excited to share my first project at UCLA! We built MOSAIC — a social network simulator where LLM-powered agents behave like real users on social media. They post, share, flag, and debate the factuality of news content — all at scale. It’s open-source. 🧵 TL;DR 🌐 Realistic…
For this week’s NLP Seminar, we are thrilled to host @GabrielSaadia to talk about Simulating Emergent LLM Social Behaviors in Multi-agent Systems! When: 2/27 Thurs 11am PT Non-Stanford affiliates registration form (closed at 9am PT on the talk day): forms.gle/Jpk7XeR317a2Ei…