Cheng Qian (@qiancheng1231)

Pinned

C

Cheng Qian@qiancheng1231 · May 27

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

qiancheng1231's tweet image. 📢 New Paper Drop: From Solving to Modeling!
LLMs can solve math problems — but can they model the real world? 🌍

📄 arXiv: arxiv.org/pdf/2505.15068
💻 Code: github.com/qiancheng0/Mod…

Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

3

29

101

51

12.0K

C

Cheng Qian@qiancheng1231 · Jul 24

Won't be at ACL in person this time, but come and chat with Emre about our new paper on mitigation of tool overuse!

EEmre Can Acikgoz @acl2025@emrecanacikgoz · Jul 24

I'll be @aclmeeting in Vienna to present our recent agent papers SMART and CoALM! 🇦🇹🤖 #acl2025 Feel free to stop by our posters to exchange ideas and discuss agents together!

0

5

0

487

Cheng Qian Retweeted

E

Emre Can Acikgoz @acl2025@emrecanacikgoz · Jul 24

I'll be @aclmeeting in Vienna to present our recent agent papers SMART and CoALM! 🇦🇹🤖 #acl2025 Feel free to stop by our posters to exchange ideas and discuss agents together!

0

4

23

1

1.0K

Cheng Qian Retweeted

H

Hongru Wang@HongruWang007 · Apr 9

Write a blog to share my recent thoughts about knowledge boundaries & tool use & language agent. This is the first time to propose three laws of knowledge boundaries!🔥 candle-walker-56d.notion.site/NAACL-2025-Ora… Chinese Version: mp.weixin.qq.com/s/XzjiLUFAr1Yc…

1

7

31

4

2.0K

Cheng Qian Retweeted

M

May Fung ✈️@ACL'25 🇦🇹@May_F1_ · Jul 2

🧠 How can AI evolve from statically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘢𝘣𝘰𝘶𝘵 𝘪𝘮𝘢𝘨𝘦𝘴 → dynamically 𝘵𝘩𝘪𝘯𝘬𝘪𝘯𝘨 𝘸𝘪𝘵𝘩 𝘪𝘮𝘢𝘨𝘦𝘴 as cognitive workspaces, similar to the human mental sketchpad? 🔍 What’s the 𝗿𝗲𝘀𝗲𝗮𝗿𝗰𝗵 𝗿𝗼𝗮𝗱𝗺𝗮𝗽 from tool-use → programmatic…

0

61

180

116

13.0K

Cheng Qian Retweeted

Y

Yuji Zhang@Yuji_Zhang_NLP · Jun 18

🧠Let’s teach LLMs to learn smarter, not harder💥[arxiv.org/pdf/2506.06972] 🤖How can LLMs verify complex scientific information efficiently? 🚀We propose modular, reusable atomic reasoning skills that reduce LLMs’ cognitive load to verify scientific claims with little data.…

7

32

106

58

10.0K

C

Cheng Qian@qiancheng1231 · Jun 8

Excited to share that EmbodiedBench was selected for an Oral at ICML 2025! We recently added results for new models (InternVL3, Gemma3, Ovis2) and released a large agent trajectory dataset on 🤗: embodiedbench.github.io Try training and evaluating your MLLM for embodied agents!

RRui Yang@RuiYang70669025 · Feb 14

🤖Can MLLM agents reason about spatial relationships and plan atomic actions for navigation & manipulation? 🔥 Meet EmbodiedBench 🏆—the first fine-grained benchmark for MLLM-based embodied agents! 📄 Paper: arxiv.org/abs/2502.09560 🌐 Website & code: embodiedbench.github.io

2

21

93

32

12.0K

C

Cheng Qian@qiancheng1231 · Jun 11

What is key of agent decision making? Is there a decision making boundary? I am always thinking of the potential boundary of correct decision making and the uncertainty of this boundary. The alignment of decision making boundary and tool-use boundary led by @WangCarrey…

HHongru Wang@HongruWang007 · Jun 3

What’s is the agent? What is the optimal behavior to achieve the predefined goal? And how to learn that behavior policy? We formally introduce a systematic Theory of Agent (ToA), analogous to the cognitive framework of Theory of Mind (ToM). Where ToM refers to the ability to…

3

17

69

28

8.0K

Cheng Qian Retweeted

X

Xiusi Chen@xiusi_chen · Jun 4

Can LLMs make rational decisions like human experts? 📖Introducing DecisionFlow: Advancing Large Language Model as Principled Decision Maker We introduce a novel framework that constructs a semantically grounded decision space to evaluate trade-offs in hard decision-making…

2

15

55

26

5.0K

C

Cheng Qian@qiancheng1231 · Jun 3

Theory of Agent: From reasoning and tool use, we are defining agent from a knowledge and behavior driven perspective. Welcome to check our newest release!! arxiv.org/pdf/2506.00886

HHongru Wang@HongruWang007 · Jun 3

What’s is the agent? What is the optimal behavior to achieve the predefined goal? And how to learn that behavior policy? We formally introduce a systematic Theory of Agent (ToA), analogous to the cognitive framework of Theory of Mind (ToM). Where ToM refers to the ability to…

0

1

4

616

Cheng Qian Retweeted

H

Hongru Wang@HongruWang007 · Jun 3

What’s is the agent? What is the optimal behavior to achieve the predefined goal? And how to learn that behavior policy? We formally introduce a systematic Theory of Agent (ToA), analogous to the cognitive framework of Theory of Mind (ToM). Where ToM refers to the ability to…

1

29

129

79

18.0K

Cheng Qian Retweeted

P

Peixuan Han (韩沛煊)@peixuanhakhan · May 30

(1/5) Want to make your LLM a skilled persuader? Check out our latest paper: "ToMAP: Training Opponent-Aware LLM Persuaders with Theory of Mind"! For details: 📄Arxiv: arxiv.org/pdf/2505.22961 🛠️GitHub: github.com/ulab-uiuc/ToMAP

2

6

24

9

2.0K

C

Cheng Qian@qiancheng1231 · May 27

Mathematical modeling is a key way for our humans to understand how the world runs. If you truly believe in your agent, you should test them on our new benchmark!

CCheng Qian@qiancheng1231 · May 27

📢 New Paper Drop: From Solving to Modeling! LLMs can solve math problems — but can they model the real world? 🌍 📄 arXiv: arxiv.org/pdf/2505.15068 💻 Code: github.com/qiancheng0/Mod… Introducing ModelingAgent, a breakthrough system for real-world mathematical modeling with LLMs.

0

4

15

1

1.0K

C

Cheng Qian@qiancheng1231 · May 22

While building Agents for Enterprise applications, one thing is very important: not to overuse tool-calling with LLMs - that makes your AI agent very expensive. In our new ACL paper, we show a method to mitigate over-use of tools using a SMART way. Read more from post below 👇

CCheng Qian@qiancheng1231 · May 19

📣 SMARTAgent is accepted to ACL 2025 Findings! It’s increasingly important to form an agent’s metacognition, which we believe should guide its action and reasoning. We are continuing on this way!! Position paper will be released soon!

0

1

0

485

C

Cheng Qian@qiancheng1231 · May 19

📣 EscapeBench is accepted to ACL 2025 Main! Creativity is what many current agent works neglect, but will be extremely important for agent to reach human level intelligence and be applied to solve real world challenges. Another paper continuing this work is also on the way!

CCheng Qian@qiancheng1231 · Dec 19

💡Want to know your language model's CREATIVITY? Check our newest paper here! 📖EscapeBench: Pushing Language Models to Think Outside the Box 🌐arxiv.org/pdf/2412.13549 📊github.com/qiancheng0/Esc… Challenge your LM to innovatively use tools and escape from conventional thoughts!

0

3

32

4

2.0K