eugene

@eugeneyalt

Care a lot, try hard, have fun. @eugeneyan's Id. 🏂

Joined March 2020

122Following

1KFollowers

Pinned

eugene@eugeneyalt · Oct 11, 2023

Note to self: • Work hard • Keep learning • Cherish loved ones • Find people who inspire you • Be kind & egoless • Eat healthy, exercise, sleep well • Read & write • Practice gratitude & meditate • Be present • Enjoy food & nature • Don’t sweat the small stuff • Smile

13.0K

eugene@eugeneyalt · 7 h

wow had a crazy bug that resetting my codebooks every step was causing epoch count to stop incrementing

162

eugene@eugeneyalt · Jul 26

looking forward to a weekend of no meetings, no conducting interviews, no doc reviews, no mediating tension between teams and systems, no conducting office hours, and just refactoring rqvae and semantic id training code with cc and long walks in seattle summer with the fam

247

eugene@eugeneyalt · Jul 25

I had so much joy this morning that I just had to write this and share my gratitude with the world

EEugene Yan@eugeneyan · Jul 25

what a world we live in! I just took a Jupyter notebook that implements an llm-evaluator, provided it as context to a coding assistant, and then ask it to write an evaluator class with specified inputs, outputs, etc. initially, it was verbose with too many class methods, but with…

307

eugene@eugeneyalt · Jul 24

evals are all you need

TTanishq Abraham back from ICML@iScienceLuvr · Jul 24

Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains 'We introduce Rubrics as Rewards (RaR), a framework that uses structured, checklist-style rubrics as interpretable reward signals for on-policy training with GRPO. Our best RaR method yields up to a relative…

394

181

33.0K

eugene@eugeneyalt · Jul 24

When my model trains well I sleep well lol (note how bad it was over the weekend and more)

EEugene Yan@eugeneyan · Jul 24

Metrics after a 200 epoch run: 0.82 recall@10 and 0.67 NDCG. That's crazy good given the simple, smol SASRec has to learn to: • Create valid 4-token IDs to represent the product • Predict the next item in a session With a 3-level codebook with 256 codes, that's ~17M possible…

315

eugene Retweeted

Eugene Yan@eugeneyan · Jul 22

it's alive! i'm astounded you can replace item IDs with sequences of tokens (3 from the codebooks above + 1 to avoid collisions) and train a transformer to output tokens IN THE RIGHT ORDER recommend items. there are at least 256^3 (16.7M) combinations and it got it right.…

1.0K

eugene Retweeted

Eugene Yan@eugeneyan · Jul 22

look at that codebook distribution 👨‍🍳💋 (thanks @voxmenthe for the tip!)

1.0K

eugene@eugeneyalt · Jul 22

wow, not sure what Anthropic did, but Claude Code is working superbly with notebooks now, even with sloppy instructions.

471

eugene@eugeneyalt · Jul 22

not easy to find other folks that work on rqvaes, recsys, and semantic ids

EEugene Yan@eugeneyan · Jul 22

Runs completed! Here's what I learned • we can reduce reconstruction loss with higher commitment weight, but it increases codebook loss. thus, if what you want is good codebooks and semantic IDs, probably not a good idea. the default of 0.25 works well • EMA worked horribly for…

322

eugene Retweeted

Eugene Yan@eugeneyan · Jul 20

tools for learning and building • zotero: papers, annotations, citations • obsidian: notes, writing, connecting the dots • claude: thinking partner, improve understanding • claude code: implementation, code review (images from work on semantic IDs and residual-quantized…

157

149

9.0K