Avery Ma

@avery__ma

Joined April 2018

47Following

17Followers

Avery Ma@avery__ma · Jul 17

A renowned researcher in the field just stopped by my poster and we chatted. One of the best moment of my career so far.

Avery Ma@avery__ma · Jun 19

We often use #VAML/ #MuZero losses with deterministic models. But if we want stochastic models to measure uncertainty or to leverage current SOTA models such as #transformers and #diffusion, we need to take care! Naively translating the loss functions leads to mistakes!

AAnastasiia Pedan@AnastasiiaPedan · Jun 19

Would you be surprised that many empirical implementations of value-aware model learning (VAML) algos, including MuZero, lead to incorrect model & value functions when training stochastic models 🤕? In our new @icml_conf 2025 paper, we show why this happens and how to fix it 🦾!

1.0K

Avery Ma Retweeted

Amir-massoud Farahmand@SoloGen · Nov 22

🎉Good news, everyone! 🎉 I will recruit graduate students on the algorithmic and theoretical aspects of Reinforcement Learning. You will join Adage, @Mila_Quebec, and @polymtl. More info on why and how you should apply: academic.sologen.net/2024/11/22/gra… Deadline: Dec 1st

183

17.0K

Avery Ma@avery__ma · May 10, 2024

I’ll be presenting our work on understanding the robustness difference between models trained via different optimizers at @iclr_conf. Visit our poster (Friday 4:30-6:30 Halle B #101) to learn about the pitfall of adaptive gradient methods. #ICLR2024 Paper: arxiv.org/abs/2308.06703

845

Avery Ma Retweeted

Amir-massoud Farahmand@SoloGen · May 8, 2024

"Without a perfect model, model-based RL is hopeless!" Our paper at #ICLR2024 challenges this belief! Even an inaccurate model can help a lot. Don’t throw it away! Title: Maximum Entropy Model Correction in Reinforcement Learning Paper: openreview.net/forum?id=kNpSU… 🧵(1/7)

137

13.0K

Avery Ma Retweeted

Amir-massoud Farahmand@SoloGen · Mar 22, 2024

Blog: Is Your Neural Network at Risk? The Pitfall of Adaptive Gradient Optimizers Summary: Models trained using SGD exhibit significantly higher robustness to input perturbations than those trained via adaptive gradient methods such as Adam or RMSProp. vectorinstitute.ai/is-your-neural…

107

14.0K

Avery Ma@avery__ma · Feb 26, 2024

Another paper rejected, CVPR review, GPT-suspected, AC inaction, disappointed, Innovation, undetected, To ECCV, resubmitted.

525

Avery Ma Retweeted

Amir-massoud Farahmand@SoloGen · Jan 17, 2024

Did you know that the optimizer significantly affects the robustness of NN? And Adam is the wrong answer!😯 "Understanding the robustness difference between SGD and adaptive gradient methods” dives deep into this. Paper: openreview.net/forum?id=ed8Sk… Code: github.com/averyma/opt-ro… 🧵1/4

5.0K