M
Matan Ben-Tov
@matanbt
CS PhD student in Computer Science @TelAvivUni. Interested in buzzwords like AI and Security, and wherever they meet.
Joined October 2009
1KFollowing
78Followers
Pinned
M
Matan Ben-Tov@matanbt · Jun 18
What makes or breaks powerful jailbreak suffixes? 🔓🤖 We find that: 🥷 they work by hijacking the model’s context; ♾️ the more universal a suffix is the stronger its hijacking; ⚔️🛡️ utilizing these insights, it is possible to both enhance and mitigate these attacks. 🧵

2
12
43
23
5.0K
Matan Ben-Tov Retweeted
N
Noam Bizan@NoamBizan · Jun 27
Russia-North Korea ties are expanding... read my article on the security implications of this relationship for #Geopolitics #RussianUkrainewar 🇷🇺🇰🇵 geopolreport.com/reports/the-st…
0
2
1
0
179