Chaitanya Malaviya

@cmalaviya11

PhD student at UPenn @upennnlp | benchmarking and evaluation | soon senior research scientist @GoogleDeepMind | prev @allen_ai @GoogleDeepMind and @LTIatCMU

Seattle, WA

Joined September 2023

307Following

327Followers

Pinned

Chaitanya Malaviya@cmalaviya11 · Jun 6

Ever wondered what makes language models generate overly verbose, vague, or sycophantic responses? Our new paper investigates these and other idiosyncratic biases in preference models, and presents a simple post-training recipe to mitigate them! Thread below 🧵↓

cmalaviya11's tweet image. Ever wondered what makes language models generate overly verbose, vague, or sycophantic responses?

Our new paper investigates these and other idiosyncratic biases in preference models, and presents a simple post-training recipe to mitigate them! Thread below 🧵↓

14.0K

Chaitanya Malaviya@cmalaviya11 · Jul 22

issues w preference LM benchmarks 🐡data contains cases where the "bad" response is just as good as chosen one 🐟model rankings can feel off (claude ranks lower than expected) led by @cmalaviya11 (TACL 2025), we study underspecified queries & detrimental effect on model evals

AAi2@allen_ai · Jul 22

In our new paper, “Contextualized Evaluations: Judging Language Model Responses to Underspecified Queries,” we find that adding just a bit of missing context can reorder model leaderboards—and surface hidden biases. 🧵👇

4.0K

Chaitanya Malaviya@cmalaviya11 · Jul 22

Context is an overlooked aspect of language model evaluations. Check out how to incorporate context into evaluations in our TACL paper, how it changes evaluation conclusions and makes evaluation more reliable!

AAi2@allen_ai · Jul 22

168

Chaitanya Malaviya@cmalaviya11 · Jul 10

Happy to share that EvalAgent has been accepted to #COLM2025 @COLM_conf 🎉🇨🇦 We introduce a framework to identify implicit and diverse evaluation criteria for various open-ended tasks! 📜 arxiv.org/pdf/2504.15219

MManya Wadhwa@ManyaWadhwa1 · Apr 22

Evaluating language model responses on open-ended tasks is hard! 🤔 We introduce EvalAgent, a framework that identifies nuanced and diverse criteria 📋✍️. EvalAgent identifies 👩‍🏫🎓 expert advice on the web that implicitly address the user’s prompt 🧵👇

5.0K

Chaitanya Malaviya@cmalaviya11 · Jun 9

Thanks for the mention @natolambert :) shoutout to the amazing undergrad @abharadwaj123 who led this work!

NNathan Lambert@natolambert · Jun 9

Nice to see folks studying biases in RLHF / preference tuning all the way down to the datasets. I think many of the biases are mostly irreducible human biases that can't be solved within current training regimes, just mitigated.

327

Chaitanya Malaviya Retweeted

Manya Wadhwa@ManyaWadhwa1 · Apr 22

126

18.0K

Chaitanya Malaviya Retweeted

Ori Yoran@OriYoran · Nov 15

Super excited to be awarded the 2024 Google PhD Fellowship in Natural Language Processing! Huge thanks to my advisor @JonathanBerant, my collaborators, and @GoogleAI for supporting our research - exciting things ahead! blog.google/technology/res…

128

7.0K

Chaitanya Malaviya@cmalaviya11 · Nov 14

come chat w me and @cmalaviya11 at #emnlp2024 about evaluating LMs, how findings can be impacted when dataset queries are vague, underspecified, and lacking clarifying context!

CChaitanya Malaviya@cmalaviya11 · Nov 13

Excited to share ✨ Contextualized Evaluations ✨! Benchmarks like Chatbot Arena contain underspecified queries, which can lead to arbitrary eval judgments. What happens if we provide evaluators with context (e.g who's the user, what's their intent) when judging LM outputs? 🧵↓

3.0K