Michal Wilinski

@inverse_hessian

member of technical staff @ stealth · incoming cs phd student @SCSatCMU · bsc from @PUT_Poznan

Poland

Joined November 2016

279Following

167Followers

Michal Wilinski@inverse_hessian · Jul 25

spent few hours integrating this to TRL for online methods 🤝🏻 the code itself isn't much but testing took time 🥲 works when training part is on a single GPU or when training and vLLM are colocated on single GPU 🙇🏻‍♀️

mmerve@mervenoyann · Jul 23

transformers 🤝 vLLM for VLM serving is out 🔥 you can now serve many vision language models in vLLM, it makes a huuuge difference for Qwen VLs 😍

5.0K

Michal Wilinski@inverse_hessian · Jul 21

We’ve updated Qwen3 and made excellent progress. The non‑reasoning model now delivers significant improvements across a wide range of tasks and many of its capabilities already rival those of reasoning models. It’s truly remarkable, and we hope you enjoy it!

QQwen@Alibaba_Qwen · Jul 21

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…

723

34.0K

Michal Wilinski@inverse_hessian · Jul 21

Very excited to share that an advanced version of Gemini Deep Think is the first to have achieved gold-medal level in the International Mathematical Olympiad! 🏆, solving five out of six problems perfectly, as verified by the IMO organizers! It’s been a wild run to lead this…

TThang Luong@lmthang · Jul 25, 2024

Super thrilled to share that our AI has has now reached silver medalist level in Math at #imo2024 (1 point away from 🥇)! Since Jan, we now not only have a much stronger version of #AlphaGeometry, but also an entirely new system called #AlphaProof, capable of solving many more…

226

2.0K

227

386.0K

Michal Wilinski Retweeted

Mikhail Samin@Mihonarium · Jul 20

🚨 According to a friend, the IMO asked AI companies not to steal the spotlight from kids and to wait a week after the closing ceremony to announce results. OpenAI announced the results BEFORE the closing ceremony. According to a Coordinator on Problem 6, the one problem OpenAI…

199

2.0K

507

456.0K

Michal Wilinski@inverse_hessian · Jul 17

If you're at #ICML2025, come say hi and learn about teacher hacking in distillation. See you at poster E-2706!

DDaniil Tiapkin@dtiapkin · Feb 7

1/ If you’re familiar with RLHF, you likely heard of reward hacking —where over-optimizing the imperfect reward model leads to unintended behaviors. But what about teacher hacking in knowledge distillation: can the teacher be hacked, like rewards in RLHF?

349

Michal Wilinski Retweeted

Piotr Miłoś@PiotrRMilos · Jul 17

Today, come and see us at the poster session; East Exhibition Hall - Joint MoE Scaling Laws (E-2609); tl;dr MoE can be memory efficient - Since Faithfulness Fails (E-2101); tl;dr inferring causal relationships turns out to be surprisingly hard Let’s chat more, my great…

3.0K

Michal Wilinski@inverse_hessian · Jul 15

Come see our poster!

MMononito Goswami@MononitoGoswami · Jul 15

Wednesday, 4:30 PM PDT Presenting our work Exploring Representations and Interventions in Time Series Foundation Models (arxiv.org/abs/2409.12915) along with @inverse_hessian in (West Exhibition Hall B2-B3 hashtag#W-507)

283

Michal Wilinski Retweeted

ML@CMU@mlcmublog · Jul 8

blog.ml.cmu.edu/2025/07/08/car… Check out our latest post on CMU @ ICML 2025!

3.0K

Michal Wilinski Retweeted

Davide Tateo@davide_tateo · Jul 9

We are excited to announce the second edition of the Robot Air Hockey Challenge! A challenging benchmark to test your robotics and robot learning abilities! Another collaboration between @ias_tudarmstadt and @Huawei Noah's Ark lab, to push the limits of robotics research!

2.0K

Michal Wilinski Retweeted

chilconference@CHILconference · May 29

Excited to highlight @WPotosnak et al.'s work: a novel hybrid global-local architecture + model-agnostic pharmacokinetic encoder that enables patient-specific treatment effect modeling—significantly improving blood glucose forecasting on large-scale datasets. #CHIL2025 @AutonLab

458

Michal Wilinski@inverse_hessian · Jun 23

when you rely on data augmentation because your model doesn’t have scale equivariance built-in

T@ ·

580

32.0K

Michal Wilinski@inverse_hessian · Jun 23

the mech interp team v. the safety team at Anthropic in a nutshell

nnostalgebraist@nostalgebraist · Jun 23

the more i think about that "agentic misalignment" research, the more frustrated i get. it is deeply, *offensively* unserious work. if you really think you're in a position of unprecedented leverage over the human future, then -- start acting like it!! nostalgebraist.tumblr.com/post/787119374…

360

22.0K

Michal Wilinski@inverse_hessian · Jun 22

In case there is any ambiguity: DINOv2 is 100% a product of dumb hill-climbing on ImageNet-1k knn accuracy (and linear too) Overfitting an eval can be bad. But sometimes the reward signal is reliable, and leads to truly good models. It's about finding a balance

ssamsja@samsja19 · Jun 19

Oh I am a big fan of self supervised learning. Also ssl has never been benchmark maxing on imagenet afaik. I am mainly complaining about the supervised classification imagenet hill climb

199

26.0K

Michal Wilinski Retweeted

Kyunghyun Cho@kchonyc · Jun 21

i was finally convinced by @_sungmin_cha and @beopst to work on unlearning. the first we did (or i learned) together with sungjin and dason was to learn how people evaluate unlearning, and horror 😱 here is our short writeup and report on what we found and what we propose as a…

144

120

16.0K