Eric W. Tramel

@fujikanaeda

Research Scientist @ Nvidia. Ex: Synth Data @ Gretel & Unlearn, Federated Learning @ Amazon Alexa & Owkin. Postdocs @ INRIA & ENS. Views my own.

USA

Joined January 2009

681Following

1KFollowers

Pinned

Eric W. Tramel@fujikanaeda · Jul 24

how did getting a single perplexity number become so dang complicated? sometimes find myself pining for the mnist days

867

Eric W. Tramel Retweeted

Oleksii Kuchaiev@kuchaev · 2 h

Very excited to announce Llama-Nemotron-Super-V1.5! Super-V1.5 is now better than Ultra-V1. This is currently the best model that can be deployed on a single H100. Reasoning On/Off and drop in replacement for V1. Open-weight, code and data on HF huggingface.co/nvidia/Llama-3…

5.0K

Eric W. Tramel Retweeted

NVIDIA AI Developer@NVIDIAAIDev · 2 h

📣 Announcing Llama Nemotron Super v1.5 📣 This release pushes the boundaries of reasoning model capabilities at the weight class of the model and is ready to power agentic applications from individual developers, all the way to enterprise applications. 📈 The Llama Nemotron…

2.0K

Eric W. Tramel@fujikanaeda · 11 h

you want wandb? we have wandb at home. the wandb at home: grep "loss: " *.log

356

Eric W. Tramel@fujikanaeda · 11 h

more experiments should run cradle-to-grave and across scale. we're not really doing science well without it. 10-100x more compute required for us to make sustainable open/published advancements as a field in general.

BBrendan Hogan@brendanh0gan · 12 h

last two runs of the biggest scale project ive ever done 🥲 training 1.5b, 3b, 7b, 14b, 32b models - pretraining + rejection sampling to build a ds + supervised finetuning + reinforcement learning now time to write

159

Eric W. Tramel@fujikanaeda · 13 h

the ml research code bases are always “like this”. they all smell like each other and it’s always the same. im in a simulation

114

Eric W. Tramel@fujikanaeda · 22 h

the distinction between a research scientist and a research engineer is whether you run towards or away from the docs

200

Eric W. Tramel@fujikanaeda · 23 h

this is a really cool building btw, you can go have beers up there any day

BBurkay Gur@burkaygur · 24 h

Reached the final boss

266

Eric W. Tramel@fujikanaeda · 23 h

don’t worry about hle 30% is just the error rate of the human soul

116

Eric W. Tramel@fujikanaeda · Jul 24

claude its just a distributed all reduce please stop fumbling the bag

111

Eric W. Tramel Retweeted

Andrew White 🐦‍⬛@andrewwhite01 · Jul 23

HLE has recently become the benchmark to beat for frontier agents. We @FutureHouseSF took a closer look at the chem and bio questions and found about 30% of them are likely invalid based on our analysis and third-party PhD evaluations. 1/7

584

170

116.0K

Eric W. Tramel@fujikanaeda · Jul 23

im ready for the singularity-of-good-software. rate-of-innovation outpacing rate-of-communication. there are probably another 5-10 latent tools right now that are solving problems and you don't even know about it yet.

kkalomaze@kalomaze · Jul 23

several other attempts at solving the Python Dependency Problem. one is so much better than the rest that it's not even a competition. we are talking about a hydrogen bomb vs a coughing baby.

245

Eric W. Tramel@fujikanaeda · Jul 23

entering that stage of life where you're linearly extrapolating losses on a google spreadsheet

113

Eric W. Tramel@fujikanaeda · Jul 23

please remember to select “b200” on the menu

ssamsja@samsja19 · Jul 23

buy compute on @PrimeIntellect and data at @datologyai

3.0K