Eugene Bagdasarian

@ebagdasa

Challenge AI security and privacy practices. Asst Prof at UMass @manningcics. Researcher at @GoogleAI. he/him 🇦🇲 (opinions mine)

Amherst, MA

Joined April 2014

617Following

1KFollowers

Pinned

Eugene Bagdasarian@ebagdasa · Feb 10

Nerd sniping is probably the coolest description of this phenomena ( @woj_zaremba et al described it recently), but in our case overthinking didn't lead to any drastic consequences besides higher costs.

SSéb Krier@sebkrier · Feb 10

Ha! You can nerdsnipe reasoning models with decoy problems to make them overthink and slow them down/make them more expensive to run. arxiv.org/abs/2502.02542

703

Pinned

Eugene Bagdasarian@ebagdasa · Feb 7

How Sudokus can waste your money? If you are using reasoning LLMs with public data, adversaries could pollute it with nonsense (but perfectly safe!) tasks that will slow down reasoning and amplify overheads 💰 (as you pay but not see reasoning tokens) while keeping answers intact

JJaechul Roh@JaechulRoh · Feb 7

🧠💸 "We made reasoning models overthink — and it's costing them big time." Meet 🤯 #OVERTHINK 🤯 — our new attack that forces reasoning LLMs to "overthink," slowing models like OpenAI's o1, o3-mini & DeepSeek-R1 by up to 46× by amplifying number of reasoning tokens. 🛠️ Key…

1.0K

Eugene Bagdasarian@ebagdasa · Jun 6

Filtering names w LLMs is easy, right? Plenty of privacy solutions out there claiming how well things work. However, our paper led by @dzungvietpham shows that things get tricky once we go to rare names in ambiguous contexts -- which could result in real harm if overlooked.

DDzung Pham@dzungvietpham · Jun 2

🙋 Can LLMs reliably detect PII such as person names? ‼️ Not really, especially if the context has ambiguity. 🖇️ Our work shows that LLMs can struggle to recognize person names in barely ambiguous contexts.

502

Eugene Bagdasarian@ebagdasa · May 16

Thanks @niloofar_mire for moderating the session 😀! Thanks @EarlenceF, @jhasomesh , @christodorescu for organizing this awesome SAGAI workshop (and also inviting me, haha)!

NNiloofar (✈️ ACL)@niloofar_mire · May 15

Join us at the SAGAI workshop @IEEESSP, @ebagdasa is talking about contextual integrity and security for AI agents!

1.0K

Eugene Bagdasarian Retweeted

earlence@EarlenceF · Apr 18

Our @IEEESSP SAGAI workshop on systems-oriented security for AI agents has speaker details (abs/bio) on the website now: sites.google.com/ucsd.edu/sagai… We look forward to seeing you in San Francisco on May 15! As a reminder, we are running this "Dagstuhl" style - real discussions.

8.0K

Eugene Bagdasarian@ebagdasa · Apr 20

I am looking for a postdoc to work on multi-agent safety problems, if you are interested or know anyone let me know: forms.gle/NFuYLKj53fVwdW…

13.0K

Eugene Bagdasarian@ebagdasa · Apr 9

Amazing forward-looking paper on how collaboration could be done where you and I have different perspectives.

AAaron Roth@Aaroth · Apr 9

Suppose you and I both have different features about the same instance. Maybe I have CT scans and you have physician notes. We'd like to collaborate to make predictions that are more accurate than possible from either feature set alone, while only having to train on our own data.

2.0K

Eugene Bagdasarian Retweeted

Nando Fioretto@nandofioretto · Mar 2

The Privacy Preserving AI workshop is back! And is happening on Monday. I am excited about our program and lineup of invited speakers! I hope to see many of you there: ppai-workshop.github.io

2.0K

Eugene Bagdasarian Retweeted

Egor Zverev @ACL2025@egor_zverev_ai · Feb 20

(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: 𝐋𝐋𝐌𝐬’ 𝐢𝐧𝐚𝐛𝐢𝐥𝐢𝐭𝐲 𝐭𝐨 𝐬𝐞𝐩𝐚𝐫𝐚𝐭𝐞 𝐢𝐧𝐬𝐭𝐫𝐮𝐜𝐭𝐢𝐨𝐧𝐬 𝐟𝐫𝐨𝐦 𝐝𝐚𝐭𝐚 𝐢𝐧 𝐭𝐡𝐞𝐢𝐫 𝐢𝐧𝐩𝐮𝐭 ✅ Definition of separation 👉 SEP Benchmark 🔍 LLM evals on SEP

7.0K

Eugene Bagdasarian@ebagdasa · Feb 18

Amazing opportunity to do ground breaking work in LLMs!

YYoav Artzi@yoavartzi · Feb 18

We now have a form for postdoc applications: forms.gle/tiydAChgV1wLcQ… I am looking at candidates on a rolling basis, so while there's no deadline, there's an advantage of throwing your name in the ring earlier than later

885