Brendan Hogan

@brendanh0gan

AI/ML Research @morganstanley || PhD in CS @cornell 2024 || Abingdon Elementary 2005 https://neuron-by-neuron.ghost.io

Joined November 2020

881Following

2KFollowers

Brendan Hogan@brendanh0gan · Jul 25

first preliminary plot looking at all outputs

BBrendan Hogan@brendanh0gan · Jul 25

last two runs of the biggest scale project ive ever done 🥲 training 1.5b, 3b, 7b, 14b, 32b models - pretraining + rejection sampling to build a ds + supervised finetuning + reinforcement learning now time to write

102

10.0K

Brendan Hogan@brendanh0gan · Jul 25

brendanh0gan's tweet image. last two runs of the biggest scale project ive ever done 🥲

training 1.5b, 3b, 7b, 14b, 32b models - pretraining + rejection sampling to build a ds + supervised finetuning + reinforcement learning

now time to write

240

100

31.0K

Brendan Hogan@brendanh0gan · Jul 24

i hate beginning a paper with “ai is rapidly advancing” or whatever but I also can never think of another way to start that doesnt sound like it’s trying too hard or feels awkward

1.0K

Brendan Hogan@brendanh0gan · Jul 23

there should be an o3 powered thrift store app, take a picture of something, it asks for clarifying details/pictures - then gets you an estimated resell value then if you purchase it auto makes the listing

762

Brendan Hogan@brendanh0gan · Jul 23

training 1.5b, 3b, 7b, 14b and 32b models for the same task really gives you a good feel for when certain properties emerge

464

109

29.0K

Brendan Hogan@brendanh0gan · Jul 23

youtube.com/watch?v=SvrOzY…

923

Brendan Hogan@brendanh0gan · Jul 19

the evaluation is the product

564

Brendan Hogan@brendanh0gan · Jul 19

never give up 🥹

2.0K

Brendan Hogan@brendanh0gan · Jul 19

recent studies have shown that ancient studies have been confirmed

454

Brendan Hogan@brendanh0gan · Jul 15

advice id love to be able give myself 3 weeks ago: 1. get the evaluation perfect before doing any training 2. Use vllm for everything possible (and parallel calls) 3. Have fun :)

108

6.0K

Brendan Hogan@brendanh0gan · Jul 14

it felt like any dumb idea I had grpo could magically learn but now that I have a verifiable coding environment with test cases it can't learn at all

3.0K

Brendan Hogan@brendanh0gan · Jul 12

it’s kinda beautiful actually

573

Brendan Hogan Retweeted

attentionmech@attentionmech · Jul 12

intuition people --> see too many patterns --> cant explain to others reasoning people --> see less, but more precisely --> good explanations former perceives latter as blind, and latter perceives former as delusional

113

951

370

36.0K

Brendan Hogan@brendanh0gan · Jul 12

i have the perfect benchmark but cant tell you what it is or even the results (too risky)

353

Brendan Hogan@brendanh0gan · Jul 12

richard linklater made before sunrise just to have cash for his art film school of rock

387

Brendan Hogan@brendanh0gan · Jul 11

the trial of tim heidecker has got to be one of the best pieces of media ever produced

331