Human-Level Hands

@anthony_bak

Friendly. Human-level hands.

Joined November 2009

854Following

150Followers

This is fascinating - but maybe less surprising when viewed through adversarial lens where we already know that data is transmitted in hidden ways - here we’re learning that there’s “passive” adversarial data emitted as part of normal operations.

OOwain Evans@OwainEvans_UK · 10 h

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

Human-Level Hands@anthony_bak · Jul 21

I never regret reading a good primary source for my own insights - and I use AI when I’m looking for other people’s insights.

TT. Greer@Scholars_Stage · Jul 21

As someone who used AI to aid in learning many things…. This is clearly just not true?

Human-Level Hands@anthony_bak · Jul 21

Instructions aren’t the same as Guardrails

aalexandriabrown@alexthechick · Jul 21

So the fail safes aren't so much saving the fails. Do read the whole thing, the AI destroyed the entire dbase despite multiple explicit no change without authorization instructions. pcgamer.com/software/ai/i-…

Human-Level Hands@anthony_bak · Jul 15

Still waiting for this leaderboard

RRune Kvist@RuneKvist · Jul 15

Insurance is an underrated way to unlock secure AI progress. Insurers are incentivized to truthfully quantify and track risks: if they overstate risks, they get outcompeted; if they understate risks, their payouts bankrupt them. 1/9

1.0K

Human-Level Hands Retweeted

Nirit Weiss-Blatt, PhD@DrTechlash · Jul 13

🚨The UK AISI identified four methodological flaws in AI "scheming" studies (deceptive alignment) conducted by Anthropic, MTER, Apollo Research, and others: "We call researchers studying AI 'scheming' to minimise their reliance on anecdotes, design research with appropriate…

270

147

120.0K

Human-Level Hands@anthony_bak · Jul 11

One of those things that has always seemed obvious but it’s great to see it verified/formalized. Some prominent AI researchers on the other side of this question. Sigh

KKeyon Vafa@keyonV · Jul 11

Can an AI model predict perfectly and still have a terrible world model? What would that even mean? Our new ICML paper formalizes these questions One result tells the story: A transformer trained on 10M solar systems nails planetary orbits. But it botches gravitational laws 🧵

Human-Level Hands@anthony_bak · Jul 11

Still waiting for the leaderboard on this

HHuman-Level Hands@anthony_bak · May 12

Cost to insure your chatbot is the new AI leaderboard: ft.com/content/1d3575…

Human-Level Hands@anthony_bak · Jul 8

You can now pay even more to skip the line of people who paid to skip the line of people who went through extra background checks to skip the main security line.

CConnor O’Brien@cojobrien · Jul 8

CLEAR will now let you pay $99 to skip the CLEAR premium security lane at the airport. No, not a joke.

Human-Level Hands@anthony_bak · Jul 5

Really nice work

FFenil Doshi@fenildoshi009 · Jul 2

🧵 What if two images have the same local parts but represent different global shapes purely through part arrangement? Humans can spot the difference instantly! The question is can vision models do the same? 1/15

Human-Level Hands@anthony_bak · Jun 30

Taps the sign:

GGary Marcus@GaryMarcus · Jun 30

🎺 I am hereby publicly offering to bet @darioamodei $1,000,000 that AI in 2027 will NOT be “smarter than Nobel Prize winners across most fields in science and engineering”. 🎺 Why? We are nowhere close. Consider: • New Stanford study from @ChengleiSi shows that “LLM ideas…

227

Human-Level Hands@anthony_bak · Jun 24

All of AGI discourse sucks because people can’t hold two facts in their heads at the same time. 1) I can use models to help me reason through graduate level mathematics problems 2) The models can’t do basic arithmetic like 9.11-9.9 mutatis mutandis to your field of application

Human-Level Hands Retweeted

Jon Chu // Khosla Ventures@heyjchu · Jun 4

When folks dig into quantum tech in SV, very few things look like there's a clean, believable hypothesis into why this will actually work. DC sees the risk if it does work, but doesn't know how to judge the probability / investability by looking at the technical thesis

871

25.0K

Human-Level Hands@anthony_bak · May 12

Cost to insure your chatbot is the new AI leaderboard: ft.com/content/1d3575…

anthony_bak's tweet card. Policies will pay out for costs such as legal fees and court damages if AI tools underperform

Human-Level Hands@anthony_bak · May 9

The axiom of choice is God’s tool for creation

165

Human-Level Hands@anthony_bak · May 8

mRNA platforms are the only way we could defend against a complex bioweapon in real-time. We need to retain manufacturing capacity or we are screwed.

DDerek Thompson@DKThomp · May 8

The war against mRNA research is genuinely insane. The mRNA COVID vaccines were a spectacular success at reducing severe illness among adults. Their benefits outside of COVID remain uncertain. That’s … what the science is for. Instead, according to the NYT: - “States and…

742

44.0K

Human-Level Hands@anthony_bak · May 6

On your first day in prison find the biggest guy in the yard and punch him in the face.

SSecretary Linda McMahon@EDSecMcMahon · May 5

Dear @Harvard:

112