Ilya Sutskever (@ilyasut)

I

I sent the following message to our team and investors: — As you know, Daniel Gross’s time with us has been winding down, and as of June 29 he is officially no longer a part of SSI. We are grateful for his early contributions to the company and wish him well in his next…

701

764

14.0K

2.0K

1.9M

I

Ilya Sutskever@ilyasut · Oct 10

And congratulations to @demishassabis and John Jumper for winning the Nobel Prize in Chemistry!!

220

198

6.0K

202

593.0K

I

Ilya Sutskever@ilyasut · Oct 8

Congratulations to @geoffreyhinton for winning the Nobel Prize in physics!!

206

614

12.0K

353

766.0K

I

Ilya Sutskever@ilyasut · Sep 4

Mountain: identified. Time to climb

SSSI Inc.@ssi · Sep 4

SSI is building a straight shot to safe superintelligence. We’ve raised $1B from NFDG, a16z, Sequoia, DST Global, and SV Angel. We’re hiring: ssi.inc

517

777

10.0K

960

2.6M

Ilya Sutskever Retweeted

O

OpenAI@OpenAI · Dec 14, 2023

We're announcing, together with @ericschmidt: Superalignment Fast Grants. $10M in grants for technical research on aligning superhuman AI systems, including weak-to-strong generalization, interpretability, scalable oversight, and more. Apply by Feb 18! openai.com/blog/superalig…

279

472

3.0K

623

2.0M

I

Ilya Sutskever@ilyasut · Dec 14, 2023

RLHF works great for today's models. But aligning future superhuman models will present fundamentally new challenges. We need new approaches + scientific understanding. New researchers can make enormous contributions—and we want to fund you! Apply by Feb 18!

OOpenAI@OpenAI · Dec 14, 2023

We're announcing, together with @ericschmidt: Superalignment Fast Grants. $10M in grants for technical research on aligning superhuman AI systems, including weak-to-strong generalization, interpretability, scalable oversight, and more. Apply by Feb 18! openai.com/blog/superalig…

26

59

538

142

580.0K

I

Ilya Sutskever@ilyasut · Dec 14, 2023

My view is that what makes super-alignment "super" is ensuring we can safely scale the capabilities of AIs even though we can't scale their human supervisors. For this, it is imperative to study the "weak teacher strong student" setting. Paper shows great promise in this area!

AAK@_akhaliq · Dec 14, 2023

Open AI new paper Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision paper: cdn.openai.com/papers/weak-to… blog: openai.com/research/weak-… Widely used alignment techniques, such as reinforcement learning from human feedback (RLHF), rely on the ability of…

19

82

435

195

375.0K

Ilya Sutskever Retweeted

S

Sam Altman@sama · Dec 14, 2023

i'd particularly like to recognize @CollinBurns4 for today's generalization result, who came to openai excited to pursue this vision and helped get the rest of the team excited about it!

174

159

3.0K

195

1.1M

Ilya Sutskever Retweeted

O

OpenAI@OpenAI · Dec 14, 2023

Large pretrained models have excellent raw capabilities—but can we elicit these fully with only weak supervision? GPT-4 supervised by ~GPT-2 recovers performance close to GPT-3.5 supervised by humans—generalizing to solve even hard problems where the weak supervisor failed!

26

84

706

112

242.0K

I

Ilya Sutskever@ilyasut · Dec 14, 2023

new paper! one reason aligning superintelligence is hard is because it will be different from current models, so doing useful empirical research today is hard. we fix one major disanalogy of previous empirical setups. I'm excited for future work making it even more analogous.

OOpenAI@OpenAI · Dec 14, 2023

In the future, humans will need to supervise AI systems much smarter than them. We study an analogy: small models supervising large models. Read the Superalignment team's first paper showing progress on a new approach, weak-to-strong generalization: openai.com/research/weak-…

16

56

415

103

253.0K

I

Ilya Sutskever@ilyasut · Dec 14, 2023

New direction for AI alignment — weak-to-strong generalization. Promising initial results: we used outputs from a weak model (fine-tuned GPT-2) to communicate a task to a stronger model (GPT-4), resulting in intermediate (GPT-3-level) performance.

OOpenAI@OpenAI · Dec 14, 2023

In the future, humans will need to supervise AI systems much smarter than them. We study an analogy: small models supervising large models. Read the Superalignment team's first paper showing progress on a new approach, weak-to-strong generalization: openai.com/research/weak-…

90

210

2.0K

298

466.0K

I

Ilya Sutskever@ilyasut · Dec 14, 2023

Extremely excited to have this work out, the first paper from the Superalignment team! We study how large models can generalize from supervision of much weaker models.

OOpenAI@OpenAI · Dec 14, 2023

In the future, humans will need to supervise AI systems much smarter than them. We study an analogy: small models supervising large models. Read the Superalignment team's first paper showing progress on a new approach, weak-to-strong generalization: openai.com/research/weak-…

16

33

287

59

127.0K

Ilya Sutskever Retweeted

J

Jan Leike@janleike · Dec 14, 2023

Kudos especially to @CollinBurns4 for being the visionary behind this work, @Pavel_Izmailov for all the great scientific inquisition, @ilyasut for stoking the fires, @janhkirchner and @leopoldasch for moving things forward every day. Amazing ✨

9

12

146

11

88.0K

Ilya Sutskever Retweeted

J

Jan Leike@janleike · Dec 14, 2023

Super excited about our new research direction for aligning smarter-than-human AI: We finetune large models to generalize from weak supervision—using small models instead of humans as weak supervisors. Check out our new paper: openai.com/research/weak-…

75

319

2.0K

759

1.4M

Ilya Sutskever Retweeted

O

OpenAI@OpenAI · Dec 14, 2023

In the future, humans will need to supervise AI systems much smarter than them. We study an analogy: small models supervising large models. Read the Superalignment team's first paper showing progress on a new approach, weak-to-strong generalization: openai.com/research/weak-…

556

1.0K

7.0K

2.0K

2.4M

I

Ilya Sutskever@ilyasut · Dec 1, 2023

❤️

GGreg Brockman@gdb · Dec 1, 2023

❤️

446

218

7.0K

186

1.6M