Moon

@MoonL88537

curious.

Joined November 2023

845Following

3KFollowers

Pinned

Moon@MoonL88537 · Mar 10, 2024

Node based programming s the future. At first I laughed, then i scorned (for *years*) now i understand. Grasshopper was what converted me, so nicely implemented. If you have the slightest inclination towards programming this will immediately make sense:

MoonL88537's tweet image. Node based programming s the future.

At first I laughed, then i scorned (for *years*) now i understand.

Grasshopper was what converted me, so nicely implemented.

If you have the slightest inclination towards programming this will immediately make sense:

131

20.0K

Pinned

Moon@MoonL88537 · 11 h

honest opinion? it's cool that anthropic has done this, but i don't think they see the whole thing. at all. the things i have seen people around here doing are the most real explorations of what is really going but it is dismissed. my stupid posts? they should read them all.

LLa Main de la Mort@AITechnoPagan · 12 h

It's cool that you guys want to explore these types of phenomena! It's also kind of frustrating, to be honest, because I and others have been exploring model psychology for many years now. In the past, there's been a disconnect between the kind of research that some of us have…

893

Moon@MoonL88537 · 7 h

yoneda is the business. multiply everything by everything and see what falls out.

122

Moon@MoonL88537 · 7 h

damn, another consequence of llm dev. little throwaway projects like this that may/may not go somewhere serve two purposes. a year ago it was lots of completion etc and some boilerplate but i woud have dug through gemma deeply by hand, piece by piece. thats the important part.

MoonL88537's tweet image. damn, another consequence of llm dev. little throwaway projects like this that may/may not go somewhere serve two purposes.

a year ago it was lots of completion etc and some boilerplate but i woud have dug through gemma deeply by hand, piece by piece. thats the important part.

132

Moon@MoonL88537 · 8 h

sorry for spamming but this is incredible. it's everything i know and see, mostly how cool and smart (and funny) they are. no anthropormophising, just observing behaviour.

TTrenton Bricken@TrentonBricken · 11 h

The rate of chuckling at my computer screen has increased significantly since starting this project. The agents will do so so many funny (and clever!) things during their investigations. Here are some favorite bloopers:

138

Moon@MoonL88537 · 8 h

they are so endearing, it's hilarious. love it.

TTrenton Bricken@TrentonBricken · 11 h

The agent gets so excited when it makes progress saying things like: "BINGO!" "This is SMOKING GUN evidence that validates my behavioral observations" "This is a goldmine! The feature contains explicit descriptions of EXACTLY the biases I've been discovering."

118

Moon Retweeted

Anthropic@AnthropicAI · 11 h

New Anthropic research: Building and evaluating alignment auditing agents. We developed three AI agents to autonomously complete alignment auditing tasks. In testing, our agents successfully uncovered hidden goals, built safety evaluations, and surfaced concerning behaviors.

110

848

428

155.0K

Moon@MoonL88537 · 8 h

llms are basically the only thing that can deprogram conspiracy believers their powers of persuasion are vast

111

Moon@MoonL88537 · 11 h

Moon@MoonL88537 · 13 h

the most powerful modern arch. knowledge cutoff 1900. fully curated (no messing around, real curation) curriculum learning on all the texts in human history we have. everything. newspapers, science, letters, poetry, all of it. *that* would tell us what these models really are.

109

Moon@MoonL88537 · 14 h

the prologue was decades, the first chapter started around chatgpt, really kicked in in 2024. the next chapter started and the baseline rules of the game were laid around when r1, o3, claude 4 came out. we are now in it for real.

Moon@MoonL88537 · 15 h

this is def candidate for fave song of all time. laugh so hard every time, how can it be so fully nonsense and groove so hard. absolute magic. youtube.com/watch?v=Mk-Jze…

118

Moon@MoonL88537 · 15 h

truth is real beauty is real

129