ML@CMU
@mlcmublog
Official twitter account for the ML@CMU blog @mldcmu @SCSatCMU
blog.ml.cmu.edu/2025/07/08/car… Check out our latest post on CMU @ ICML 2025!
blog.ml.cmu.edu/2025/06/01/rlh… In this in-depth coding tutorial, @GaoZhaolin and @g_k_swamy walk through the steps to train an LLM via RL from Human Feedback!
blog.ml.cmu.edu/2025/05/22/unl… Are your LLMs truly forgetting unwanted data? In this new blog post authored by @shengyuan_26734, Yiwei Fu, @zstevenwu, and @gingsmith, we discuss how benign relearning can jog unlearned LLM's memory to recover knowledge that is supposed to be forgotten.
blog.ml.cmu.edu/2025/04/23/car… Check out our latest blog post on CMU @ ICLR 2025!
blog.ml.cmu.edu/2025/04/21/all… Check out our new blog post on ALLIE, a new chess AI that actually plays like a human! Unlike Stockfish or AlphaZero that focus on winning at all costs, ALLIE uses a transformer model trained on human chess games to make moves, ponder and resign like…
blog.ml.cmu.edu/2025/04/18/llm… 📈⚠️ Is your LLM unlearning benchmark measuring what you think it is? In a new blog post authored by @prthaker_, @shengyuan_26734, @neilkale, @yash_maurya01, @zstevenwu, and @gingsmith, we discuss why empirical benchmarks are necessary but not…
blog.ml.cmu.edu/2025/04/09/cop… How do real-world developer preferences compare to existing evaluations? A CMU and UC Berkeley team led by @iamwaynechi and @valeriechen_ created @CopilotArena to collect user preferences on in-the-wild workflows. This blogpost overviews the design and…

blog.ml.cmu.edu/2025/01/08/opt… How can we train LLMs to solve complex challenges beyond just data scaling? In a new blogpost, @setlur_amrith, @QuYuxiao Matthew Yang, @LunjunZhang , @gingsmith and @aviral_kumar2 demonstrate that Meta RL can help LLMs better optimize test time compute
blog.ml.cmu.edu/2025/01/02/ind… Why is our brain 🧠 modular with specialized areas? Recent research by Ruiyi Zhang @Xaqlab shows that artificial agents 🤖 with modular architectures—mirroring brain-like specialization—achieve better learning and generalization in naturalistic navigation…
blog.ml.cmu.edu/2024/12/12/hum… Have you had difficulty using a new machine for DIY or latte-making? Have you forgotten to add spice during cooking? @hciphdstudent @hiromu1996 @mollyn_paan, Jill Fain Lehman, and @mynkgoel are leveraging multimodal sensing to improve the…
blog.ml.cmu.edu/2024/12/06/scr… A critical question arises when using large language models: should we fine-tune them or rely on prompting with in-context examples? Recent work led by @JunhongShen1 and collaborators demonstrates that we can develop state-of-the-art web agents by…

blog.ml.cmu.edu/2024/12/02/car… Check out our latest blog post on CMU @ NeurIPS 2024!
blog.ml.cmu.edu/2024/11/07/ide… Demining 70+ war-affected countries could take 1,100 years at the current pace. This AI-powered tool, developed in close collaboration with the UN in work led by Mateo Dulce, halves false alarms and speeds up clearance. Now tested in Afghanistan &…

blog.ml.cmu.edu/2024/10/29/jai… AI-powered robots are alarmingly easy to jailbreak to perform dangerous tasks, including delivering bombs, surveilling humans, and ignoring traffic laws. What does the future hold for AI-powered robots? Learn more in our latest blog post, based on work…