Diana Abagyan

@dianaabagyan

Research Scholar @Cohere_Labs

Joined June 2025

68Following

58Followers

Pinned

Diana Abagyan@dianaabagyan · Jun 16

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with @Cohere_Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

dianaabagyan's tweet image. 🚨New pretraining paper on multilingual tokenizers 🚨

Super excited to share my work with @Cohere_Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

102

14.0K

Diana Abagyan@dianaabagyan · Jul 22

I’m very excited to be co-organizing this @NeurIPSConf workshop on LLM evaluations! Evaluating LLMs is a complex and evolving challenge. With this workshop, we hope to bring together diverse perspectives to make real progress. See the details below:

LLLM Evals Workshop @NeurIPS@LLM_eval · Jul 22

We are happy to announce our @NeurIPSConf workshop on LLM evaluations! Mastering LLM evaluation is no longer optional -- it's fundamental to building reliable models. We'll tackle the field's most pressing evaluation challenges. For details: sites.google.com/corp/view/llm-…. 1/3

5.0K

Diana Abagyan@dianaabagyan · Jul 9

Prompt engineering places all the work on the end user to try and squeeze out performance. It's a hack to deal with the limitations in adaptability of our model. However, the future should be that this happens behind the scenes and is inferred automatically.

CCohere Labs@Cohere_Labs · Jul 9

Prompts shouldn't have to be engineered. Our latest research marks another step towards fluid, natural language communication with LLMs.

7.0K

Diana Abagyan Retweeted

Puneesh Deora@puneeshdeora · Jun 25

🚨 New paper drop! 🚨 🤔 When a transformer sees a sequence that could be explained by many rules, which rule does it pick? It chooses the simplest sufficient one! 🧵👇

352

355

33.0K

Diana Abagyan Retweeted

Naomi Saphra@nsaphra · Jun 24

🚨 New preprint! 🚨 Phase transitions! We love to see them during LM training. Syntactic attention structure, induction heads, grokking; they seem to suggest the model has learned a discrete, interpretable concept. Unfortunately, they’re pretty rare—or are they?

348

269

31.0K

Diana Abagyan Retweeted

Cohere Labs@Cohere_Labs · Jun 26

Can we improve the performance of LLMs during inference without the need for extensive sampling OR special reward models? 🤔 Our latest work introduces a new inference time scaling recipe that is sample-efficient, multilingual, and suitable for multi-task requirements. 🍋

2.0K

Diana Abagyan Retweeted

Ammar Khairi@ammar__khairi · Jun 26

💪🏼Huge thanks to my incredible mentors: Julia Kreutzer @mrdanieldsouza, @YeS855811, @sarahookr for guiding me and supporting this work ✨ Find our arXiv release here! 📜: arxiv.org/abs/2506.20544

371

Diana Abagyan Retweeted

Ammar Khairi@ammar__khairi · Jun 26

🚀 Want better LLM performance without extra training or special reward models? Happy to share my work with @Cohere_labs : "When Life Gives You Samples: Benefits of Scaling Inference Compute for Multilingual LLMs" 👀How we squeeze more from less at inference 🍋, details in 🧵

7.0K

Diana Abagyan Retweeted

Cohere Labs@Cohere_Labs · Jun 19

How can AI capture the nuances of different languages?💬🗨️ By using a team of specialized teacher models via Multilingual Arbitration we've achieved up to 19.5% improvement in win rates across languages. Find us at ACL to discuss how we can further break down language barriers.

6.0K

Diana Abagyan Retweeted

Cohere Labs@Cohere_Labs · Jun 18

🤹 How do we move away from complicated and brittle prompt engineering at inference for under-represented tasks?🤔 🧠 Our latest work finds that optimizing training protocols improves controllability and boosts performance on underrepresented use cases at inference time 📈

4.0K

Diana Abagyan@dianaabagyan · Jun 18

Can we train models for better inference-time control instead of over-complex prompt engineering❓ Turns out the key is in the data — adding fine-grained markers boosts performance and enables flexible control at inference🎁 Huge congrats to @mrdanieldsouza for this great work

DDaniel D'souza @mrdanieldsouza · Jun 18

🚨 Wait, adding simple markers 📌during training unlocks outsized gains at inference time?! 🤔 🚨 Thrilled to share our latest work at @Cohere_Labs: “Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers“ that explores this phenomenon! Details in 🧵 ⤵️

2.0K

Diana Abagyan Retweeted

Daniel D'souza @mrdanieldsouza · Jun 18

6.0K

Diana Abagyan Retweeted

Cohere Labs@Cohere_Labs · Jun 17

Global MMLU is revolutionizing multilingual AI. 🌍 Recognized by Stanford HAI and adopted by top labs, it's the benchmark for fair evaluation across 42 languages. Looking forward to sharing this work at ACL in Vienna next month. 🇦🇹

3.0K

Diana Abagyan Retweeted

Srishti Gureja@srishti_gureja · Jun 7

Our paper M-RewardBench got accepted to ACL main: arxiv.org/abs/2410.15522 We construct the first-of-its-kind multilingual RM evaluation benchmark and leverage it to look into the performances of several Reward Models in non-English settings along w/ other interesting insights.

105

7.0K

Diana Abagyan@dianaabagyan · Jun 17

amazing work!!!

DDiana Abagyan@dianaabagyan · Jun 16

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with @Cohere_Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

661

Diana Abagyan@dianaabagyan · Jun 17

Huge congrats to @dianaabagyan on her first first author paper. Was a pleasure collaborating on this work — we ask what cheap interventions in pre-training can allow for more language plasticity downstream.

DDiana Abagyan@dianaabagyan · Jun 16

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with @Cohere_Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

5.0K

Diana Abagyan@dianaabagyan · Jun 16

An excellent work by @dianaabagyan💎 We show that a "universal" tokenizer, covering more than just primary languages, greatly boosts new language adaptation without hurting pretraining performance 🚀 A very critical study for multilingual LLMs given huge cost of pretraining🔥

DDiana Abagyan@dianaabagyan · Jun 16

🚨New pretraining paper on multilingual tokenizers 🚨 Super excited to share my work with @Cohere_Labs: One Tokenizer To Rule Them All: Emergent Language Plasticity via Multilingual Tokenizers

3.0K