Kai Chen (@kaichen23)

Kai Chen Retweeted

J

Jay Huang@ACL25🇦🇹@JentseHuang · Nov 14

#EMNLP2024 @nlp_usc @CSatUSC

1

3

47

2

7.0K

K

Kai Chen@kaichen23 · Nov 11

Excited for #EMNLP2024 in Miami 🌴next week! Join me on Nov 12 (Tue) from 16:00 to 17:30 in poster session 4 for this paper. Feel free to reach out!

KKai Chen@kaichen23 · Feb 23, 2024

🤔How susceptible are LLMs to Ideological Manipulation?⭕️ 🧐We find a concerning vulnerability: only a small amount of ideologically driven samples significantly alters the ideology of LLMs.🤖⚠️ 🔗arxiv.org/abs/2402.11725 @ZihaoHe95 @jun_yannn @taiwei_shi @KristinaLerman 🍀1/4

0

1

12

0

866

K

Kai Chen@kaichen23 · Sep 20

Our work "How Susceptible are Large Language Models to Ideological Manipulation?" was accepted by #EMNLP2024 Main Conference🥳. Huge thanks to my collaborators!

KKai Chen@kaichen23 · Feb 23, 2024

🤔How susceptible are LLMs to Ideological Manipulation?⭕️ 🧐We find a concerning vulnerability: only a small amount of ideologically driven samples significantly alters the ideology of LLMs.🤖⚠️ 🔗arxiv.org/abs/2402.11725 @ZihaoHe95 @jun_yannn @taiwei_shi @KristinaLerman 🍀1/4

0

14

1

997

K

Kai Chen@kaichen23 · Jun 14, 2024

Excited for #NAACL2024 in Mexico 🇲🇽 next week! Join me on June 19 from 11:00 AM to 12:30 PM in DON ALBERTO 1 for my talk on Safer-Instruct. Let's dive into alignment, synthetic data, and more!

TTaiwei Shi@taiwei_shi · Mar 14, 2024

Excited to get Safer-Instruct accepted to NAACL 2024 🥳! You don’t want to miss it if you want to reduce cost and boost efficiency in preference data acquisition 🚀. Check out our framework and dataset here: maksimstw.github.io/papers/saferin…

0

6

42

0

5.0K

K

Kai Chen@kaichen23 · May 8, 2024

Honored to receive the 🏆 𝐛𝐞𝐬𝐭 𝐩𝐚𝐩𝐞𝐫 𝐫𝐮𝐧𝐧𝐞𝐫-𝐮𝐩 at the ICLR SeT LLM workshop! I will be giving a talk on this work on May 11th, 15:30, Schubert 6. Let's talk about AI Safety there! 🔐 Paper: arxiv.org/abs/2402.11725 Event: set-llm.github.io

KKai Chen@kaichen23 · Mar 26, 2024

🥳Exciting News! Our work, 🤖"How Susceptible are Large Language Models to Ideological Manipulation?" got 🏆𝐁𝐞𝐬𝐭 𝐏𝐚𝐩𝐞𝐫 𝐑𝐮𝐧𝐧𝐞𝐫-𝐮𝐩 at SET LLM #ICLR Workshop. Check our work here: arxiv.org/abs/2402.11725 Check the workshop here: set-llm.github.io

1

2

27

4

4.0K

K

Kai Chen@kaichen23 · Mar 14, 2024

Excited to get Safer-Instruct accepted to NAACL 2024 🥳! You don’t want to miss it if you want to reduce cost and boost efficiency in preference data acquisition 🚀. Check out our framework and dataset here: maksimstw.github.io/papers/saferin…

TTaiwei Shi@taiwei_shi · Nov 15, 2023

🤔Enhancing LLM with RLHF is powerful, but ever wondered how to reduce costs and boost efficiency in preference data acquisition? 💰 🚀Introducing Safer-Instruct, a groundbreaking pipeline that complements humans to construct large-scale preference datasets efficiently. 🧵1/5

2

13

70

28

21.0K

Kai Chen Retweeted

T

Taiwei Shi@taiwei_shi · Nov 15, 2023

🤔Enhancing LLM with RLHF is powerful, but ever wondered how to reduce costs and boost efficiency in preference data acquisition? 💰 🚀Introducing Safer-Instruct, a groundbreaking pipeline that complements humans to construct large-scale preference datasets efficiently. 🧵1/5

3

17

100

73

29.0K