Ningyu Zhang@ZJU

@zxlzr

Associate Professor @ZJU_China. Research interests include NLP, LLM, KG, Agent, Knowledge Editing.

Hangzhou, China

Joined February 2009

2KFollowing

3KFollowers

Pinned

🚀 This year, we’ve rolled out a series of updates to EasyEdit1—and dropped EasyEdit2 to steer LLM behavior on the fly! 🔧✨ 👉 Code: github.com/zjunlp/EasyEdit What’s new? • Datasets: Integrated AKEW, LEME & UNKE • Methods: NAMET, CORE, UNKE, AnyEdit & Reference-free Preference…

zxlzr's tweet image. 🚀 This year, we’ve rolled out a series of updates to EasyEdit1—and dropped EasyEdit2 to steer LLM behavior on the fly! 🔧✨
👉 Code: github.com/zjunlp/EasyEdit

What’s new?
• Datasets: Integrated AKEW, LEME &amp; UNKE
• Methods: NAMET, CORE, UNKE, AnyEdit &amp; Reference-free Preference…

112

Pinned

Ningyu Zhang@ZJU Retweeted

Yoshua Bengio@Yoshua_Bengio · 14 h

This article from @TheEconomist offers an accurate overview of key dynamics shaping the development of AI today: the risks of the rapid race toward AGI and ASI, the challenges posed by open-sourcing frontier models, the deep uncertainty revealed by ongoing scientific debates and…

138

8.0K

Ningyu Zhang@ZJU@zxlzr · 18 m

Due to a scheduling conflict, I won’t be able to attend #ACL2025 in person. Our group will be presenting the following works—feel free to connect and chat with our team members at the conference! Main conferences: Beyond Prompt Engineering: Robust Behavior Control in LLMs via…

zxlzr's tweet image. Due to a scheduling conflict, I won’t be able to attend #ACL2025 in person. Our group will be presenting the following works—feel free to connect and chat with our team members at the conference!

Main conferences:

Beyond Prompt Engineering: Robust Behavior Control in LLMs via…

Ningyu Zhang@ZJU Retweeted

Chen-Yu Lee@chl260 · Jul 23

Thrilled to introduce "𝗗𝗲𝗲𝗽 𝗥𝗲𝘀𝗲𝗮𝗿𝗰𝗵𝗲𝗿 𝘄𝗶𝘁𝗵 𝗧𝗲𝘀𝘁-𝗧𝗶𝗺𝗲 𝗗𝗶𝗳𝗳𝘂𝘀𝗶𝗼𝗻," a new deep research agent designed to mimic the iterative nature of human research, complete with cycles of planning, drafting, and revision. 🚀🚀 arxiv.org/pdf/2507.16075

427

272

29.0K

Ningyu Zhang@ZJU@zxlzr · Jul 23

Do you find RL makes the LLM reasoning more stubborn? Keep repeating the same answers? How to make multi-turn conversational history be helpful in RL training? We identify a simple "try again" feedback can boost reasoning and make RL training a conversational manner!…

LLicheng Liu@liulicheng10 · Jul 23

Will conversation history help reasoning? We found that when models mess up once, they often get stuck. Surprisingly, a simple “try again” fixes this — and boosts reasoning.🧵 Project Page: unary-feedback.github.io

110

14.0K

Ningyu Zhang@ZJU Retweeted

Aryo Pradipta Gema@aryopg · Jul 22

New Anthropic Research: “Inverse Scaling in Test-Time Compute” We found cases where longer reasoning leads to lower accuracy. Our findings suggest that naïve scaling of test-time compute may inadvertently reinforce problematic reasoning patterns. 🧵

152

1.0K

572

141.0K

Ningyu Zhang@ZJU Retweeted

Stella Li@StellaLisy · Jul 22

WHY do you prefer something over another? Reward models treat preference as a black-box😶‍🌫️but human brains🧠decompose decisions into hidden attributes We built the first system to mirror how people really make decisions in our #COLM2025 paper🎨PrefPalette✨ Why it matters👉🏻🧵

359

251

38.0K

Ningyu Zhang@ZJU Retweeted

Mihir Prabhudesai@mihirp98 · Jul 22

🚨 The era of infinite internet data is ending, So we ask: 👉 What’s the right generative modelling objective when data—not compute—is the bottleneck? TL;DR: ▶️Compute-constrained? Train Autoregressive models ▶️Data-constrained? Train Diffusion models Get ready for 🤿 1/n

120

167

941

815

155.0K

Ningyu Zhang@ZJU Retweeted

Rohan Paul@rohanpaul_ai · Jul 19

Multimodal models still leak harmful text when attackers mix tricky words and images. AutoSteer adds a lightweight layer-aware safety prober and refusal head to frozen multimodal LLMs, driving attack success below 5% while leaving regular performance unchan A safety awareness…

2.0K

Ningyu Zhang@ZJU Retweeted

Mikita Balesni 🇺🇦@balesni · Jul 15

A simple AGI safety technique: AI’s thoughts are in plain English, just read them We know it works, with OK (not perfect) transparency! The risk is fragility: RL training, new architectures, etc threaten transparency Experts from many orgs agree we should try to preserve it:…

102

416

243

197.0K

Ningyu Zhang@ZJU Retweeted

Lingpeng Kong@ikekong · Jul 15

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

6.0K

Ningyu Zhang@ZJU Retweeted

AK@_akhaliq · Jul 8

MemOS A Memory OS for AI System

417

298

36.0K

Ningyu Zhang@ZJU Retweeted

Tiago Pimentel@tpimentelms · Jul 14

Mechanistic interpretability often relies on *interventions* to study how DNNs work. Are these interventions enough to guarantee the features we find are not spurious? No!⚠️ In our new paper, we show many mech int methods implicitly rely on the linear representation hypothesis🧵

204

174

16.0K

Ningyu Zhang@ZJU Retweeted

Neel Nanda@NeelNanda5 · Jul 14

GDM interp work: Do LLMs have self-preservation? Concerning recent work: models may block shutdown if it interferes with the task? But we found the model was just confused: if told to prioritize shut down *over* the task it complies 100% And we only needed black box methods!

203

15.0K

Ningyu Zhang@ZJU Retweeted

ACL 2025@aclmeeting · Jul 14

🤯 Get ready for #ACL2025NLP! featuring 3500+ paper presentations (talks & posters!), numerous workshops, several tutorials, insightful keynotes, and engaging panels? 📚🎤💡 Deep dive into the latest in #NLProc! Check out the full program here: 2025.aclweb.org/program/

116

11.0K