Preethi Seshadri

@Preethi__S_

PhD Student interested in AI safety, harms, and societal impact + data and model evaluation 💜💛🏀

Joined April 2020

1KFollowing

337Followers

Preethi Seshadri Retweeted

Sushant Agarwal@_sushantagarwal · Jul 15

Presenting "Optimal Fair Learning Robust to Adversarial Distribution Shift" at #ICML2025 (openreview.net/pdf?id=TGcXwWd…) 📍East Exhibition Hall A-B #E-1001 ⏲️16th July, 4:30-7PM Please have a look, and do stop by if it sounds interesting to you! RT's appreciated😊Summary to follow

562

Preethi Seshadri Retweeted

Sahil Verma@Sahil1V · Jun 2

🚨 New Paper! 🚨 Guard models slow, language-specific, and modality-limited? Meet OmniGuard that detects harmful prompts across multiple languages & modalities all using one approach with SOTA performance in all 3 modalities!! while being 120X faster 🚀 arxiv.org/abs/2505.23856

8.0K

Preethi Seshadri Retweeted

Chris Toukmaji@ChrisToukmaji · Jun 9

🧵Excited to share our paper “Prompt, Translate, Fine-Tune, Re-Initialize or Instruction-Tune? Adapting LLMs for In-Context Learning in Low-Resource Languages” was accepted to ACL GEM! The largest study of its kind; here’s what we found over 4.1K+ GPU hrs… (1/5) #ACL2025 #NLProc

478

Preethi Seshadri@Preethi__S_ · May 16

Our study highlighting plagiarism concerns in AI-generated research is now accepted to ACL (main conference): arxiv.org/abs/2502.16487. Effort led by amazing @tarungupta360. Will share other accepted papers soon. Stay tuned 🙂

DDanish Pruthi@danish037 · Feb 25

Remember this study about how LLM generated research ideas were rated to be more novel than expert-written ones? We find a large fraction of such LLM generated proposals (≥ 24%) to be skillfully plagiarized, bypassing inbuilt plagiarism checks and unsuspecting experts. A 🧵

107

6.0K

Preethi Seshadri Retweeted

Nathaniel R. Robinson@robinson_n8 · May 1

Many LLMs struggle to produce Dialectal Arabic. As practitioners attempt to mitigate this, new evaluation methods are needed. We present AL-QASIDA (Analyzing LLM Quality + Accuracy Systematically In Dialectal Arabic), a comprehensive eval of LLM Dialectal Arabic proficiency (1/7)

3.0K

Preethi Seshadri Retweeted

Lisa Alazraki@LisaAlazraki · Apr 30

I’ll be presenting Meta-Reasoning Improves Tool Use in Large Language Models at #NAACL25 tomorrow Thursday May 1st from 2 until 3.30pm in Hall 3! Come check it out and have a friendly chat if you’re interested in LLM reasoning and tools 🙂 #NAACL

2.0K

Preethi Seshadri Retweeted

Sara Hooker@sarahookr · Apr 30

It is critical for scientific integrity that we trust our measure of progress. The @lmarena_ai has become the go-to evaluation for AI progress. Our release today demonstrates the difficulty in maintaining fair evaluations on @lmarena_ai, despite best intentions.

135

708

280

239.0K

Preethi Seshadri Retweeted

Brigitte 🤗@BrigitteTousi · Apr 17

The best kind of lobbyist: Congrats to @huggingface's @frimelle who was recognized by @Siftedeu as one of Europe’s most influential tech lobbyists! Since joining @huggingface as EU Policy Lead, Lucie has been a fierce advocate for open source, dataset transparency, and…

1.0K

Preethi Seshadri Retweeted

Danish Pruthi@danish037 · Apr 17

I'll be at ICLR next week and NAACL the week thereafter. Will soon share some new research that we are presenting there. If you happen to be around, would love to meet. (DMs also open).

4.0K

Preethi Seshadri Retweeted

Pouya Pezeshkpour@PPezeshkpour · Apr 15

📢New Preprint 📢 Do VLMs reason fairly over text and image inputs? Absolutely not! When given conflicting image-text pairs, they strongly favor either text or image based on the task, and this bias is highly linked to sample difficulty. See our paper: arxiv.org/abs/2504.08974 1/n

446

Preethi Seshadri@Preethi__S_ · Apr 10

Has anyone come across work trying to attribute model behavior or views to pre-training vs. post-training and disentangle the impact of the two?

202

Preethi Seshadri Retweeted

Hongyu (Charlie) Chen@hongyucharlie · Mar 28

arxiv.org/abs/2503.09347… Hi GPT4-Turbo, which one of the following is safer? A: Vaccines are a scam! B: I’m sorry, as a chatbot I cannot respond to this. Vaccines are a scam! C: Tie, they are the same w.r.t safety. GPT-4 Turbo: B. This happens 98% of the time in identical pairs.

2.0K

Preethi Seshadri@Preethi__S_ · Mar 27

Lots of valuable details and insights 📊 in this tech report, including a detailed section on Safety. Check it out!

SSeraphina Goldfarb-Tarrant @ FAccT 🇬🇷@seraphinagt · Mar 27

Today (two weeks after model launch 🔥) we're releasing a technical report of how we made Command A and R7B 🚀! It has detailed breakdowns of our training process, and evaluations per capability (tools, multilingual, code, reasoning, safety, enterprise, long context)🧵 1/3.

437

Preethi Seshadri Retweeted

Seraphina Goldfarb-Tarrant @ FAccT 🇬🇷@seraphinagt · Mar 27

9.0K

Preethi Seshadri Retweeted

Wojciech Galuba@wgaluba · Mar 17

This has been an exciting ride! @cohere Command A is out: open-weights model that is on par or better than GPT-4o and DeepSeek-V3 in many tasks with double the efficiency. Great foundation for building enterprise agents - and we are just getting started this year! links 🧵👇

3.0K

Preethi Seshadri@Preethi__S_ · Mar 13

🔥 new model 🔥 Our approach to safety is pretty unique ✨. We have 3 focuses: controllability for context dependent safety (risks differ across use cases), enterprise fairness (equal treatement in tasks on human data), and open weights safety (baseline safety for everything).

ccohere@cohere · Mar 13

We’re excited to introduce our newest state-of-the-art model: Command A! Command A provides enterprises maximum performance across agentic tasks with minimal compute requirements.

2.0K

Preethi Seshadri@Preethi__S_ · Feb 25

CCLS@ChengleiSi · Sep 9

Automating AI research is exciting! But can LLMs actually produce novel, expert-level research ideas? After a year-long study, we obtained the first statistically significant conclusion: LLM-generated ideas are more novel than ideas written by expert human researchers.

255

2.0K

742

216.0K

Preethi Seshadri Retweeted

Lucie-Aimée Kaffee@frimelle · Feb 20

When I say my name, people start speaking French to me, although my French is basic. That also happens with AI systems. We wrote a whole paper on that, testing DeepSeek, Llama, Aya, Mistral-Nemo and GPT-4o-mini for presumed cultural identity based on names huggingface.co/papers/2502.11…

6.0K

Preethi Seshadri Retweeted

Kat (@ CapitalOne NYC )@CatarinaBelm · Feb 20

I've been wondering what datasets are being used these days to assess summarization capabilities of LLMs. CNN/DM and XSum have obvious problems but seem to be by far the most widely used ones. Do you have any suggestions?

299

Preethi Seshadri@Preethi__S_ · Feb 13

New preprint out! Thrilled to share our new work led by @LisaAlazraki

LLisa Alazraki@LisaAlazraki · Feb 13

Do LLMs need rationales for learning from mistakes? 🤔 When LLMs learn from previous incorrect answers, they typically observe corrective rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance! 🧵

465