tensorqt

@tensorqt

chaos dancing star

Icecrown Citadel

Joined February 2022

300Following

2KFollowers

Pinned

tensorqt@tensorqt · Sep 27

ML is so attractive to young physicists because many physics grads aren't really looking for physics, but for our times' great challenge. My generation still believed LHC was about to give us the secrets of the universe. We then found our own quantum mechanics in ML

2.0K

424

150.0K

Pinned

tensorqt Retweeted

Francesco Capuano@_fracapuano · Jul 25

1/ RL x Physics = 💥 Laser pulse shapes are essential in studying light-matter interactions. Yet, they're typically shaped overlooking joint effects. I will be presenting our work on shaping laser pulses with RL in Alberta at @RL_Conference. ML for Science, all open source! 🧵

3.0K

Pinned

tensorqt@tensorqt · Jul 24

is this AGI

GGiorgio Strano @ EEML@krytox205 · Jul 23

If anyone's interested I just made a @tensorqt simulator LLM, you really can't tell it apart from the real guy dropping it here as open source with MIT license

2.0K

Pinned

tensorqt Retweeted

Giorgio Strano @ EEML@krytox205 · Jul 23

If anyone's interested I just made a @tensorqt simulator LLM, you really can't tell it apart from the real guy dropping it here as open source with MIT license

3.0K

tensorqt@tensorqt · 4 h

shadowboxing was invented to be done right after launching a run and pulling up the wandb on the big screen

263

tensorqt@tensorqt · Jul 26

i am both blessed and cursed by obsession

932

tensorqt@tensorqt · Jul 25

you guys are testing the lich's patience by cooking him on the tl today

FFrancesco Capuano@_fracapuano · Jul 25

going to actual physics labs, unlike @tensorqt

571

tensorqt Retweeted

tensorqt@tensorqt · May 14

this is literally me

207

11.0K

tensorqt Retweeted

Niccolo' Gentile @ ACL 2025@Niccolg92 · Jul 25

Around 2 weeks ago, Moonshot released open-source Kimi-K2, which, with its 1T parameters, is officially among the top-performing models, also compared to closed ones. Kimi-K2 is a 1T parameters Mixture-of-Expert model, of which only 32B get activated per token (8 active experts…

746

tensorqt Retweeted

iulio@thelokasiffers · Jul 25

silence first-name-last-name-face-pfp poster, pseudoanon poaster is speaking

359

tensorqt@tensorqt · Jul 25

801

tensorqt@tensorqt · Jul 24

if you're in environments pivot to environments building environments

RRohan Pandey@khoomeik · Jul 24

What happens when the models are smart enough to: 1. crawl the web 2. discover millions of verifiable problems 3. rank em all by expected value 4. implement as novel RL environments What custom RL envs remain worth building in this world (I think prob 1-2 years away)?

2.0K

tensorqt@tensorqt · Jul 24

real and based

bbertø@graffioh · Jul 24

i RAGa walking into NeurIPS with an italian made paper

579

tensorqt@tensorqt · Jul 24

these machines are miles away from taming the lich king's whispers

iiulio@thelokasiffers · Jul 24

yeah I just tried it, completely useless on @tensorqt’s Roman accent + code switching

639

tensorqt Retweeted

iulio@thelokasiffers · Jul 24

739

tensorqt@tensorqt · Jul 24

met @elyxlz, @JMadaluni and the team from @audiogenai this week (in Rome!). incredibly strong team, very bullish on them building some really good stuff

980

tensorqt@tensorqt · Jul 23

imo we often overlook the problem of catastrophic forgetting for pretraining in the sense that we think it's solely a problem of post-training . The fact that the model doesn't update robustly means you are likely updating wrong, and this is both true in pretraining and…

2.0K