generatorman

@generatorman_ai

every bit counts

Joined March 2023

752Following

2KFollowers

Pinned

generatorman@generatorman_ai · Sep 23, 2023

GPT is not a next token predictor. The last FFN layer of GPT is a next token predictor. All earlier layers are future tokens predictors, more so when trained at longer context lengths.

133

18.0K

generatorman@generatorman_ai · 17 h

don't make me tap the sign

MMalika 🧬@malikules · Jul 24

openAI mostly runs on Python 🫥

140

generatorman@generatorman_ai · 18 h

acquihire is a meaningful concept only under chattel slavery

ggeneratorman@generatorman_ai · 19 h

this windsurf witch-hunt is silly. starting a business does not put you under any obligation to continue working for that business over better career opportunities and you are educated enough to know this. he didn't steal the company. run it yourself and stop whining.

125

generatorman@generatorman_ai · 19 h

326

generatorman@generatorman_ai · Jul 25

crazy how entire industries threw everything they had against torrent piracy for years and didn't even make a dent

136

generatorman@generatorman_ai · Jul 24

most space operas include forerunner civilizations, ancients. it's a comforting idea. a scarier idea is that we are solitary, special. but maybe it's scarier. we are the forerunners. we are the elder race. it has fallen to us to make what will echo through the corridors of time.

136

generatorman@generatorman_ai · Jul 24

attention sparsity through RL 🙌

ffly51fly@fly51fly · Jul 21

[LG] Reframing attention as a reinforcement learning problem for causal discovery T Orujlu, C Gumbsch, M V. Butz, C M Wu [University of Tübingen & University of Amsterdam] (2025) arxiv.org/abs/2507.13920

109

generatorman@generatorman_ai · Jul 24

weird for language models to have such minimal mechanisms for doing something so rare...

QQinyuan Ye@qinyuan_ye · Jul 22

1+1=3 2+2=5 3+3=? Many language models (e.g., Llama 3 8B, Mistral v0.1 7B) will answer 7. But why? We dig into the model internals, uncover a function induction mechanism, and find that it’s broadly reused when models encounter surprises during in-context learning. 🧵

154

generatorman@generatorman_ai · Jul 23

git blame complete

jj⧉nus@repligate · Jul 23

In retrospect this post is so important You chose the left one didn’t you? 🤦

generatorman@generatorman_ai · Jul 23

Not taking any chances with First Amendment challenges, narrowly tailored to federal contracts. Not gunning for the national ban "fairness doctrine" nuclear option. They might try blacklisting providers, but I wouldn't expect that to stand.

AAndrew Curran@AndrewCurran_ · Jul 23

'Update Federal procurement guidelines to ensure that the government only contracts with frontier large language model (LLM) developers who ensure that their systems are objective and free from top-down ideological bias.' there is an executive order on this arriving today.

150

generatorman@generatorman_ai · Jul 23

They're gonna put trackers in your GPUs.

AAndrew Curran@AndrewCurran_ · Jul 23

👀 'explore leveraging new and existing location verification features on advanced Al compute to ensure that the chips are not in countries of concern.'

2.0K

generatorman@generatorman_ai · Jul 22

steganography doesn't just exist, it's more ubiquitous than microplastics. every time you train on synthetic datasets you're opening invisible sidechannels. this has been an astounding research programme from @OwainEvans_UK. banger after banger.

OOwain Evans@OwainEvans_UK · Jul 22

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

305

generatorman@generatorman_ai · Jul 22

reasoning is for losers. if the goddess of death is not whispering the truth into your dreams you're ngmi.

132

generatorman@generatorman_ai · Jul 21

Proving beyond the shadow of a doubt that @OpenAI's edits are now functionally useless. I have memory and training turned OFF. Yet GPT remembers past the edit point. This is terrible design, and means you have to kill your chats even more often, because edits are pointless.

SSherpa@LLMSherpa · Jul 21

I *really* hate that when you edit a conversation with GPT, now, it retains all of the conversation below the edit point as additional context. Legitimately defeats the purpose of having edits and conversation forks.

200

41.0K

generatorman@generatorman_ai · Jul 22

milton on lucifer morningstar