Qian Liu (@sivil_taram)

Pinned

Q

Qian Liu@sivil_taram · Jul 17

🔥 LLMs can fix bugs, but can they make your code faster? We put them to the test on real-world repositories, and the results are in! 🚀 New paper: "SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?" Key findings: 1️⃣ We introduce SWE-Perf, the…

sivil_taram's tweet image. 🔥 LLMs can fix bugs, but can they make your code faster? We put them to the test on real-world repositories, and the results are in!

🚀 New paper: "SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?"

Key findings:
1️⃣ We introduce SWE-Perf, the…

1

17

63

28

6.0K

Q

Qian Liu@sivil_taram · Jul 25

😝 Thanks to @sivil_taram for the evaluation. It's great to see Qwen3-Coder performing excellently on SWE-Perf! The community also needs more outstanding evaluations like SWE-Perf to continue guiding the development of CodeLLM!

QQian Liu@sivil_taram · Jul 25

🚀 Just one week after SWE-Perf launched (the first repository-level benchmark for realistic code performance optimization), Qwen3-Coder drops and IMMEDIATELY takes the crown! 👑 Released just 3 days ago, Qwen3-Coder with OpenHands is now the top performer on SWE-Perf's…

2

3

38

4

4.0K

Q

Qian Liu@sivil_taram · Jul 25

🚀 Just one week after SWE-Perf launched (the first repository-level benchmark for realistic code performance optimization), Qwen3-Coder drops and IMMEDIATELY takes the crown! 👑 Released just 3 days ago, Qwen3-Coder with OpenHands is now the top performer on SWE-Perf's…

QQian Liu@sivil_taram · Jul 17

🔥 LLMs can fix bugs, but can they make your code faster? We put them to the test on real-world repositories, and the results are in! 🚀 New paper: "SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?" Key findings: 1️⃣ We introduce SWE-Perf, the…

1

3

39

10

6.0K

Q

Qian Liu@sivil_taram · Jul 25

Definitely worth a read, MoE plus RL

CChujie Zheng@ChujieZheng · Jul 25

Proud to introduce Group Sequence Policy Optimization (GSPO), our stable, efficient, and performant RL algorithm that powers the large-scale RL training of the latest Qwen3 models (Instruct, Coder, Thinking) 🚀 📄 huggingface.co/papers/2507.18…

0

12

3

1.0K

Q

Qian Liu@sivil_taram · Jul 23

Wrapped up a SWE-Perf website redesign using Qwen3-Coder on AnyCoder (huggingface.co/spaces/akhaliq…). The process was incredibly fast and great! One question for Qwen devs, though: did you pretrain a secret love for the color purple into the coder's persona? 😉

sivil_taram's tweet image. Wrapped up a SWE-Perf website redesign using Qwen3-Coder on AnyCoder (huggingface.co/spaces/akhaliq…). The process was incredibly fast and great!

One question for Qwen devs, though: did you pretrain a secret love for the color purple into the coder's persona? 😉

1

14

84

22

24.0K

Qian Liu Retweeted

X

Xiao Ma@yusufma555 · Jul 22

🚀🚀🚀 Ever wondered what it takes for robots to handle real-world household tasks? long-horizon execution, deformable object dexterity, and unseen object generalization — meet GR-3, ByteDance Seed’s new Vision-Language-Action (VLA) model! GR-3 is a generalizable…

8

89

494

291

40.0K

Q

Qian Liu@sivil_taram · Jul 23

The most rewarding moment in research: hearing someone say "This actually works in our scenario!" ✨

3

2

63

3

4.0K

Q

Qian Liu@sivil_taram · Jul 23

Apart from the performance, it’s pure entertainment just watching Qwen3‑Coder build Qwen Code all by itself. Agentic coding is really something: it explores, understands, plans, and acts seamlessly. Honored to be “in the game”—even if my entire work so far is smashing the Enter…

QQwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

2

10

44

7

4.0K

Qian Liu Retweeted

Q

Qwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

273

1.0K

9.0K

4.0K

1.8M

Q

Qian Liu@sivil_taram · Jul 22

After three intense months of hard work with the team, we made it! We hope this release can help drive the progress of Coding Agents. Looking forward to seeing Qwen3-Coder continue creating new possibilities across the digital world!

QQwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

58

83

939

108

61.0K

Qian Liu Retweeted

M

Marktechpost AI Dev News ⚡@Marktechpost · Jul 21

TikTok Researchers Introduce SWE-Perf: The First Benchmark for Repository-Level Code Performance Optimization SWE-Perf, introduced by TikTok researchers, is the first benchmark designed to evaluate large language models (LLMs) on repository-level code performance optimization.…

0

10

23

5

1.0K

Qian Liu Retweeted

A

All Hands AI@allhands_ai · Jul 19

Nice new research work by @tiktok_us on benchmarking performance optimization by LLM agents: arxiv.org/abs/2507.12415 OpenHands w/ Sonnet 3.7 achieves the best results, optimizing 44 functions in popular open-source code bases (compared to human experts' 184).

4

16

104

62

7.0K

Q

Qian Liu@sivil_taram · Jul 18

Excited to share that our two papers have been accepted to #ICML2025! @icmlconf However, I can't be there in person due to visa issues. What a pity.🥲 Feel free to check out our poster, neither online nor offline in the Vancouver Convention Center. Programming Every Example:…

ZZengzhi Wang@SinclairWang1 · Jul 18

Excited to share that our two papers have been accepted to #ICML2025! @icmlconf However, I can't be there in person due to visa issues. What a pity.🥲 Feel free to check out our poster, neither online nor offline in the Vancouver Convention Center. Programming Every Example:…

1

4

23

1

3.0K

Qian Liu Retweeted

A

AK@_akhaliq · Jul 17

SWE-Perf Can Language Models Optimize Code Performance on Real-World Repositories?

4

23

110

40

16.0K