Behrad Moniri

@bemoniri

Working on the foundations of machine learning @Penn. QR Intern @ Cubist Systematic Strategies (Point72). Previously studied EE @ Sharif in Iran.

Joined February 2020

627Following

981Followers

Pinned

Behrad Moniri@bemoniri · May 1

Accepted at #icml2025!

BBehrad Moniri@bemoniri · Feb 5

Check out our recent paper on layer-wise preconditioning methods for optimization and feature learning theory:

6.0K

Behrad Moniri Retweeted

Behrad Moniri@bemoniri · Jul 15

Deep learning theorists study simple NNs (e.g., two-layer model, or linear networks) in idealized settings (e.g., isotropic inputs). There are very nice results showing that SGD can learn "good" features in these settings.

581

Behrad Moniri@bemoniri · Jun 17

«ز رقیبِ دیوسیرت، به خدای خود پناهم / ...» ـ حافظ ganjoor.net/hafez/ghazal/s…

779

Behrad Moniri@bemoniri · Jun 12

👀 Cool recent work on uncertainty quantification of LLMs.

SSima Noorani@NooraniSimaa · Jun 12

How can we quantify uncertainty in LLMs from only a few sampled outputs? The key lies in the classical problem of missing mass—the probability of unseen outputs. This perspective offers a principled foundation for conformal prediction in query-only settings like LLMs.

2.0K

Behrad Moniri Retweeted

Statistics Papers@StatsPapers · May 27

On the Mechanisms of Weak-to-Strong Generalization: A Theoretical Perspective. arxiv.org/abs/2505.18346

1.0K

Behrad Moniri Retweeted

Jeremy Bernstein@jxbz · May 13

I was really grateful to have the chance to speak at @Cohere_Labs and @ml_collective last week. My goal was to make the most helpful talk that I could have seen as a first-year grad student interested in neural network optimization. Sharing some info about the talk here... (1/6)

568

550

54.0K

Behrad Moniri@bemoniri · May 8

Super excited to announce our ICML workshop on highlighting the power (and limitations?) of small-scale in the era of large-scale ML. You can submit just a Jupyter notebook, Jupyter notebook + paper, or a survey/position paper. Do submit your work and help us spread the word!

MMOSS@MOSS_workshop · May 6

Announcing the 1st Workshop on Methods and Opportunities at Small Scale (MOSS) at @icmlconf 2025! 🔗Website: sites.google.com/view/moss2025 📝 We welcome submissions! 📅 Paper & jupyter notebook deadline: May 22, 2025 Topics: – Inductive biases & generalization – Training…

6.0K

Behrad Moniri@bemoniri · Apr 29

I am hugely indebted to Prof. Strogatz. His book on Nonlinear Dynamics and Chaos literally changed my life in undergrad. Otherwise I would have been doing electromagnetic theory (although he doesn't even know me).

NNational Academy of Sciences@theNASciences · Apr 25

Congratulations Steven Strogatz of @Cornell, newly inducted #NASmember! #NAS162 #mathematics

1.0K

Behrad Moniri@bemoniri · Apr 21

Cool software package by Subramonian and @dohmatobelvis for free-probability computations in ML theory.

MMathematical Software Papers@GFNCL · Apr 16

auto-fpt: Automating Free Probability Theory Calculations for Machine Learning Theory. arxiv.org/abs/2504.10754

1.0K

Behrad Moniri Retweeted

Penn Engineering AI@PennEngAI · Apr 9

With $840K in funding from @awscloud, @PennAsset is supporting 12 Ph.D. students conducting cutting-edge research in AI safety, robustness and interpretability. bit.ly/422Nfeo #AIMonth2025 #TrustworthyAI

3.0K

Behrad Moniri@bemoniri · Apr 4

Circles of Hell: Circle 8: "Thanks for the response. My concerns are addressed. I wish to keep my score." Circle 9: "Acknowledgement: I confirm that I have read the author response to my review and will update my review in light of this response as necessary."

744

Behrad Moniri@bemoniri · Feb 5

Check out our recent paper on layer-wise preconditioning methods for optimization and feature learning theory:

SStat.ML Papers@StatMLPapers · Feb 5

On The Concurrence of Layer-wise Preconditioning Methods and Provable Feature Learning ift.tt/LvMb2yw

10.0K

Behrad Moniri Retweeted

Amin Karbasi@aminkarbasi · Jan 31

A new generation of jailbreaks are rolling out by our team at @robusthq and in collaboration with @PennEngineers. We jailbreak @deepseek_ai R1 model with a %100 attack success rate. To know more, see our blog post on @CiscoSecure and the corresponding @WIRED article. amazing…

4.0K