Divya Siddarth

@divyasiddarth

collective intelligence accelerationist @collect_intel

Joined February 2016

969Following

6KFollowers

Pinned

Divya Siddarth@divyasiddarth · Nov 29, 2023

the thing about AI that people don't understand is that it's got all these risks. but also ! all these opportunities. not to mention the risks. but ! think of the opportunities. but the risks :( but the opportunit

124

31.0K

Divya Siddarth Retweeted

Peli Grietzer@peligrietzer · Jul 17

It's cute that everyone was once the youngest person in the world

133

5.0K

Divya Siddarth@divyasiddarth · Jul 10

I replicated this result, that Grok focuses nearly entirely on finding out what Elon thinks in order to align with that, on a fresh Grok 4 chat with no custom instructions. grok.com/share/c2hhcmQt…

RRamez Naam@ramez · Jul 10

Grok 4 decides what it thinks about Israel/Palestine by searching for Elon's thoughts. Not a confidence booster in "maximally truth seeking" behavior. h/t @catehall. Screenshots are mine.

194

744

5.0K

1.0K

1.7M

Divya Siddarth Retweeted

History Calendar@historycalendar · Jul 4

Thomas Jefferson’s rough draft copy of the Declaration of Independence

680

3.0K

325

389.0K

Divya Siddarth Retweeted

Atul Gawande@Atul_Gawande · Jul 1

STAGGERING: This new study of 133 countries is the first to estimate the impact of all USAID’s work. In 2 decades, it saved *92M* lives. Current cuts, if not reversed, are forecast to cost *14M* lives thru 2030. thelancet.com/journals/lance…

962

3.0K

6.0K

1.0K

1.4M

Divya Siddarth Retweeted

jessica dai@jessicadai_ · Jul 1

individual reporting for post-deployment evals — a little manifesto (& new preprints!) tldr: end users have unique insights about how deployed systems are failing; we should figure out how to translate their experiences into formal evaluations of those systems.

134

27.0K

Divya Siddarth Retweeted

Atoosa Kasirzadeh@Dr_Atoosa · Jun 16

I was planning to launch my substack on "Human, life, AI, and future" in a few months, with something very different. I’ve been working quietly on some exciting research about AI and the future of humanity—big questions, long arcs, and some surprising ideas I was excited to share…

4.0K

Divya Siddarth Retweeted

Collective Intelligence Project@collect_intel · May 30

This week, we learned 1 in TEN adults uses AI for emotional support daily - absolutely wild. Talked about it in the #ComputerSaysMaybe podcast. themaybe.org/podcast/the-co…

511

Divya Siddarth Retweeted

Divya Siddarth@divyasiddarth · May 27

As we do societal evals at CIP —public health, AI relationships, democracy, etc. across regional languages we've spent a lot of time dealing with how brittle LLM judge pipelines are. Stoked to share an open-source test suite (blog + code) we’ve built to stress-test ours before…

6.0K

Divya Siddarth Retweeted

Divya Siddarth@divyasiddarth · May 27

It's not like we can make LLMs deterministic but we can measure their quirks and design around them before deploying in high‑stakes settings. Let us know what you find: github.com/collect-intel/…

598

Divya Siddarth Retweeted

Zarinah Agnew@zarinahagnew · May 26

Over in Global Dialogues @collect_intel is asking the a global sample of the world: "𝖯𝖾𝗋𝗌𝗈𝗇𝖺𝗅𝗅𝗒, 𝗐𝗈𝗎𝗅𝖽 𝗒𝗈𝗎 𝖾𝗏𝖾𝗋 𝖼𝗈𝗇𝗌𝗂𝖽𝖾𝗋 𝗁𝖺𝗏𝗂𝗇𝗀 𝖺 𝗋𝗈𝗆𝖺𝗇𝗍𝗂𝖼 𝗋𝖾𝗅𝖺𝗍𝗂𝗈𝗇𝗌𝗁𝗂𝗉 𝗐𝗂𝗍𝗁 𝖺𝗇 𝖠𝖨, 𝗂𝖿 𝗍𝗁𝖾 𝖠𝖨 𝗐𝖺𝗌 𝖺𝖽𝗏𝖺𝗇𝖼𝖾𝖽…

3.0K

Divya Siddarth Retweeted

Collective Intelligence Project@collect_intel · May 23

1/10: LLM Judges Are Unreliable. Our latest blog post from @padolsey shows that positional preferences, order effects, and prompt sensitivity fundamentally undermine the reliability of LLM judges.

1.0K

Divya Siddarth Retweeted

Collective Intelligence Project@collect_intel · May 19

We're officially launching the Global Dialogues Challenge!

6.0K