jessica dai (@jessicadai_)

Pinned

j

jessica dai@jessicadai_ · Jul 1

individual reporting for post-deployment evals — a little manifesto (& new preprints!) tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.

jessicadai_'s tweet image. individual reporting for post-deployment evals — a little manifesto (&amp; new preprints!)

tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.

7

28

135

55

27.0K

jessica dai Retweeted

L

Ludwig Yeetgenstein@yeetgenstein · Jul 18

I’m rebelling against ChatGPT emdash hegemony — we must reclaim the emdash — it does not belong to the LLMs — it belongs to us!

6

9

82

2

8.0K

j

jessica dai@jessicadai_ · Jul 15

kernel launches in two days! help me get these magazines out of my house!

jjay@Jacobkupp · Jun 20

just had 500 magazines delivered to my door :)

1

6

72

3

5.0K

j

jessica dai@jessicadai_ · Jul 15

postering today with @paula_gradu at 4:30 (East A-B E-1202) 😀 come say hi

jjessica dai@jessicadai_ · Jul 1

individual reporting for post-deployment evals — a little manifesto (& new preprints!) tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.

0

14

0

782

jessica dai Retweeted

f

florence 🦐@morallawwithin · Jul 9

Being a grad student is peak human existence. The end goal of all political and technological progress should be allowing everyone to be a grad student forever

72

571

10.0K

860

515.0K

j

jessica dai@jessicadai_ · Jul 5

There's been an idea floating around policy spaces for some time now (eg. UN AI Advisory report, California Frontier AI report, etc) around the need for "AI monitoring", "AI incidents", "AI adverse reporting" etc. Jess is now seriously thinking about how to operationalize this:

jjessica dai@jessicadai_ · Jul 1

individual reporting for post-deployment evals — a little manifesto (& new preprints!) tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.

2

9

36

26

6.0K

jessica dai Retweeted

j

jay@Jacobkupp · Jul 2

we're launching a new issue of @kernel_magazine in two weeks!! join us for a launch party in SF @ Gray Area on July 17!

1

11

83

16

9.0K

jessica dai Retweeted

I

Irene Chen@irenetrampoline · Jun 24

What more could we understand about the fractal, “jagged” edges of AI system deployments if we had better ways to listen to the people who interact with them? What a joy to work w @jessicadai_ using individual experiences to inform AI evaluation (blog/ICML/arXiv links 👇)

4

18

65

27

13.0K

jessica dai Retweeted

j

jay@Jacobkupp · Jun 20

just had 500 magazines delivered to my door :)

6

1

69

1

8.0K

jessica dai Retweeted

S

Saffron Huang@saffronhuang · Jun 10

Newest @reboot_hq 🎙️post: @jessicadai_ and I discuss forecasting, and how people present unhelpful narratives about the future (mostly by picking on AI 2027, sorry guys) Why we should view the future as constructed, not predicted

3

10

54

24

3.0K

jessica dai Retweeted

N

Nika Haghtalab@nhaghtal · Jun 9

RLHF fine-tunes to a “mythical user” via aggregated feedback—but what if that user represents no one? Excited to share a new paper with @paulgoelz and @KunheYang “Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?” #AIAlignment #PluralisticAI #LLMs

5

19

100

48

12.0K

jessica dai Retweeted

K

Kevin Black@kvablack · Jun 9

In LLM land, a slow model is annoying. In robotics, a slow model can be disastrous! Visible pauses at best, dangerously jerky motions at worst. But large VLAs are slow by nature. What can we do about this? An in-depth 🧵:

12

56

418

236

63.0K

jessica dai Retweeted

T

The Notorious J.O.V.@whotfisjovana · May 30

4

387

4.0K

142

89.0K

jessica dai Retweeted

E

Eric Zhao@ericzhao28 · May 29

Why RLHF stopped us from getting r1/o1 sooner and why we elected Trump/Biden/[Politician You Dislike]: We can use elections 🇺🇸🇺🇸🇺🇸 to understand why RLHF naturally suppresses reasoning (even if you fix the whole "RLHF isn't really RL" thing)

1

4

20

12

1.0K