Shanda Li 黎善达

@Shanda_Li_2000

PhD student @mldcmu

Joined November 2021

279Following

317Followers

Pinned

Shanda Li 黎善达@Shanda_Li_2000 · Jun 17

Can LLM solve PDEs? 🤯 We present CodePDE, a framework that uses LLMs to automatically generate solvers for PDE and outperforms human implementation! 🚀 CodePDE demonstrates the power of inference-time algorithms and scaling for PDE solving. More in 🧵: #ML4PDE #AI4Science

Shanda_Li_2000's tweet image. Can LLM solve PDEs? 🤯
We present CodePDE, a framework that uses LLMs to automatically generate solvers for PDE and outperforms human implementation! 🚀
CodePDE demonstrates the power of inference-time algorithms and scaling for PDE solving. More in 🧵:
#ML4PDE #AI4Science

16.0K

Pinned

Shanda Li 黎善达 Retweeted

Yiping Lu@2prime_PKU · Apr 20

In our new preprint, we demonstrate, for the first time, the test‑time inference scaling behavior (with faster convergence rate) of neural PDE solvers. The core idea is to derive a new PDE that characterizes the error of the neural PDE solver. 2prime.github.io/files/scasml_t…

8.0K

Pinned

Shanda Li 黎善达 Retweeted

Yiping Lu@2prime_PKU · Mar 2

How can we do inference time scaling for scientific machine learning？ Our new inference time scaling framework leads to a new framework for high dim PDE solving and a new eigenvalue solver Join us @ #SIAM CSE next Monday 9:45 AM - 11:25 AM Room: 114

9.0K

Shanda Li 黎善达 Retweeted

Ambroise Odonnat@AmbroiseOdonnat · Jul 22

🚀 We are happy to organize the BERT²S workshop @NeurIPSConf 2025 on Recent Advances in Time Series Foundation Models. 🌐 berts-workshop.github.io 📜Submit by August 22 🎓Speakers and panelists: @ChenghaoLiu15 Mingsheng Long @zoe_piran @danielle_maddix @atalwalkar @qingsongedu

4.0K

Shanda Li 黎善达@Shanda_Li_2000 · Jul 19

Stop by the poster sessions today at ICML Workshop on Computer Use Agents to chat about OpenHands-Versa!

AAditya Soni@Aditya_Soni_8 · Jun 4

Can we design AI Agents that achieve generalizability across diverse task domains? Our new paper introduces OpenHands-Versa, a generalist agent with strong performance on three challenging agent benchmarks, ranking #1 on SWE-Bench Multimodal and The Agent Company leaderboards 🚀

3.0K

Shanda Li 黎善达 Retweeted

Andrej Risteski @ ICML ✈️@risteski_a · Jul 15

In recent work arxiv.org/abs/2502.12123 w/ E. Botta, @_Yuchen_Li_ , A. Mehta, @jordan_t_ash, @_cyrilzhang, we explore some algorithmic aspects of constrained generation w/ a generator & process verifier. Paper at #ICML2025, poster session (today) details in screenshot, 🧵below.

2.0K

Shanda Li 黎善达@Shanda_Li_2000 · Jun 30

Check out our new recent work on research agent led by @PlanarG1 and @sunweiwei12!

WWeiwei Sun@sunweiwei12 · Jun 28

Most AI agents are tested in a bubble. But real ML breakthroughs happen in communities. We introduce CoMind, an research agent that learns from community knowledge. 📊 CoMind outperforms ~70% human teams in a CVPR 2025 workshop competition. 🧵👇

212

Shanda Li 黎善达 Retweeted

Junhong Shen@JunhongShen1 · Jun 10

🔥Unlocking New Paradigm for Test-Time Scaling of Agents! We introduce Test-Time Interaction (TTI), which scales the number of interaction steps beyond thinking tokens per step. Our agents learn to act longer➡️richer exploration➡️better success Paper: arxiv.org/abs/2506.07976

166

79.0K

Shanda Li 黎善达@Shanda_Li_2000 · Jun 12

Multi-turn agents need to really interact with the environment to get new context and information. This is the key difference from single-turn QA settings.

AAviral Kumar@aviral_kumar2 · Jun 12

Lot of work in agents these days is using reasoning RL to now train agents. But is that good enough? @jackbai_jkb & @JunhongShen1 show that its not: we also want RL to learn *how* to explore and *discover* novel behaviors, by scaling "in-context" interaction!…

3.0K

Shanda Li 黎善达 Retweeted

Shengyu Feng@ShawnSYFeng · May 23

🌟Get rid of the evaluation on synthetic toy problems and advance human intelligence like #AlphaEvolve! 🚀 Introducing FrontierCO — our new Machine Learning for Combinatorial Optimization benchmark featuring high-quality NP-hard instances from real-world applications and…

4.0K

Shanda Li 黎善达 Retweeted

Ameet Talwalkar@atalwalkar · May 22

I’m excited to share new work from Datadog AI Research! We just released Toto, a new SOTA (by a wide margin!) time series foundation model, and BOOM, the largest benchmark of observability metrics. Both are available under the Apache 2.0 license. 🧵

244

218

37.0K

Shanda Li 黎善达 Retweeted

Andrej Risteski @ ICML ✈️@risteski_a · May 6

@khodakmoments, @__tm__157, along with myself, @nmboffi and Jianfeng Lu are organizing a COLT 2025 workshop on the Theory of AI for Scientific Computing, to be held on the first day of the conference (June 30).

9.0K

Shanda Li 黎善达@Shanda_Li_2000 · Apr 30

Cooool!

KKeegan Harris@keegan_w_harris · Apr 30

Back in March, I wore a head-mounted camera for a week straight and fine-tuned ChatGPT on the resulting data. Here's what happened (1/6) arxiv.org/pdf/2504.03857

206

Shanda Li 黎善达 Retweeted

ML@CMU@mlcmublog · Apr 9

blog.ml.cmu.edu/2025/04/09/cop… How do real-world developer preferences compare to existing evaluations? A CMU and UC Berkeley team led by @iamwaynechi and @valeriechen_ created @CopilotArena to collect user preferences on in-the-wild workflows. This blogpost overviews the design and…

4.0K

Shanda Li 黎善达@Shanda_Li_2000 · Mar 4

We built @CopilotArena this fall as part of @lmarena_ai in order to evaluate coding models in realistic, interactive environments. Check out our recent writeup describing the results, as well as details of the system itself. Work led by @iamwaynechi and @valeriechen_.

WWayne Chi@iamwaynechi · Mar 4

What do developers 𝘳𝘦𝘢𝘭𝘭𝘺 think of AI coding assistants? In October, we launched @CopilotArena to collect user preferences on real dev workflows. After months of live service, we’re here to share our findings in our recent preprint. Here's what we have learned /🧵

1.0K

Shanda Li 黎善达@Shanda_Li_2000 · Feb 6

We introduce Mixture-of-Mamba, a multi-modal SSM that leverages modality-aware sparsity for efficient multi-modal pretraining! At the core of Mixture-of-Mamba: 🔹Modality-aware sparsity to optimize efficiency 🔹Mixture-of-SSMs to enable cross-modal interactions 🔹Scales…

WWeixin Liang@liang_weixin · Feb 6

🚀 Want 2x faster pretraining for your multi-modal LLM? 🧵 Following up on Mixture-of-Transformers (MoT), we're excited to share Mixture-of-Mamba (MoM)! arxiv.org/abs/2501.16295 🔥 Why it matters: MoM applies modality-aware sparsity across image, text, and speech—making…

3.0K