Zeming Chen

@eric_zemingchen

PhD Candidate, NLP Lab @EPFL; Research Scientist Intern @AIatMeta; Ex @AIatMeta (FAIR) @allen_ai #AI #ML #NLP

Lausanne, Switzerland

Joined July 2021

293Following

553Followers

Pinned

Zeming Chen@eric_zemingchen · Nov 28, 2023

We present MEDITRON, a set of new open-access #LLMs (70B & 7B) adapted to the medical domain, achieving new SoTA open-source performance on common medical benchmarks, outperforming #GPT-3.5 and Med-PaLM, and coming within 5% of #GPT4 Find out how we did this ⬇️

eric_zemingchen's tweet image. We present MEDITRON, a set of new open-access #LLMs (70B &amp; 7B) adapted to the medical domain, achieving new SoTA open-source performance on common medical benchmarks, outperforming #GPT-3.5 and Med-PaLM, and coming within 5% of #GPT4

Find out how we did this ⬇️

130

589

330

176.0K

Zeming Chen Retweeted

Qiyue Gao@QiyueGao123 · Jul 1

🤔 Have @OpenAI o3, Gemini 2.5, Claude 3.7 formed an internal world model to understand the physical world, or just align pixels with words? We introduce WM-ABench, the first systematic evaluation of VLMs as world models. Using a cognitively-inspired framework, we test 15 SOTA…

207

138

30.0K

Zeming Chen Retweeted

Badr AlKhamissi@bkhmsi · Jun 17

🚨New Preprint!! Thrilled to share with you our latest work: “Mixture of Cognitive Reasoners”, a modular transformer architecture inspired by the brain’s functional networks: language, logic, social reasoning, and world knowledge. 1/ 🧵👇

380

340

37.0K

Zeming Chen@eric_zemingchen · Apr 23

If you’re at @iclr_conf this week, come check out our spotlight poster INCLUDE during the Thursday 3:00–5:30pm session! I will be there to chat about all things multilingual & multicultural evaluation. Feel free to reach out anytime during the conference. I’d love to connect!

AAngelika Romanou@agromanou · Dec 2

🚀 Introducing INCLUDE 🌍: A multilingual LLM evaluation benchmark spanning 44 languages! Contains *newly-collected* data, prioritizing *regional knowledge*. Setting the stage for truly global AI evaluation. Ready to see how your model measures up? #AI #Multilingual #LLM #NLProc

9.0K

Zeming Chen Retweeted

Silin Gao@silin_gao · Mar 31

NEW PAPER ALERT: Generating visual narratives to illustrate textual stories remains an open challenge, due to the lack of knowledge to constrain faithful and self-consistent generations. Our #CVPR2025 paper proposes a new benchmark, VinaBench, to address this challenge.

1.0K

Zeming Chen Retweeted

Badr AlKhamissi@bkhmsi · Mar 5

🚨 New Preprint!! LLMs trained on next-word prediction (NWP) show high alignment with brain recordings. But what drives this alignment—linguistic structure or world knowledge? And how does this alignment evolve during training? Our new paper explores these questions. 👇🧵

281

213

25.0K

Zeming Chen Retweeted

Badr AlKhamissi@bkhmsi · Dec 19

🚨 New Paper! Can neuroscience localizers uncover brain-like functional specializations in LLMs? 🧠🤖 Yes! We analyzed 18 LLMs and found units mirroring the brain's language, theory of mind, and multiple demand networks! w/ @GretaTuckute, @ABosselut, & @martin_schrimpf 🧵👇

100

21.0K

Zeming Chen Retweeted

Beatriz Borges@obiwit · Dec 4

📘 Could ChatGPT get an engineering degree? Spoiler, yes! In our new @PNASNews article, we explore how AI assistants like GPT-4 perform in STEM university courses — and on average they pass a staggering 91.7% of core courses. 🧵 #AI #HigherEd #STEM #LLMs #NLProc

19.0K

Zeming Chen Retweeted

Angelika Romanou@agromanou · Dec 2

184

61.0K

Zeming Chen Retweeted

Badr AlKhamissi@bkhmsi · Nov 11

🚨 New Paper!! How can we train LLMs using 100M words? In our @babyLMchallenge paper, we introduce a new self-synthesis training recipe to tackle this question! 🍼💻 This was a fun project co-led by me, @yingtian80536, @akgokce0, w/ @HannesMehrer & @martin_schrimpf 🧵⬇️

9.0K

Zeming Chen Retweeted

Yu Fei (@ Amazon Rufus)@Walter_Fei · Oct 22

Alignment is necessary for LLMs, but do we need to train aligned versions for all model sizes in every model family? 🧐 We introduce 🚀Nudging, a training-free approach that aligns any base model by injecting a few nudging tokens at inference time. 🌐fywalter.github.io/nudging/…

135

15.0K

Zeming Chen Retweeted

Antoine Bosselut@ABosselut · Sep 5

Hey #NLProc folks, we had a lot of fun last year, so we're inviting guest lecturers again for our Topics in NLP course during this Fall 2024 semester at EPFL! More information here: t.ly/QMTCA Please share and RT!

10.0K