Pengxiang Li

@oliverlee1999

Research intern at http://bigai.ai. #ComputerVision, #MultimodalLearning, #NonEuclideanOptimization Ph.D. Student of BIGAI&BIT @BIT1940.

Joined November 2022

95Following

41Followers

Pinned

Pengxiang Li@oliverlee1999 · May 26

🔥Introducing SPORT, a multimodal agent that explores tool usage without human annotation. It leverages step-wise DPO to further enhance tool-use capabilities following SFT. SPORT achieves improvements on the GTA and GAIA benchmarks. sport-agents.github.io

oliverlee1999's tweet image. 🔥Introducing SPORT, a multimodal agent that explores tool usage without human annotation. It leverages step-wise DPO to further enhance tool-use capabilities following SFT. SPORT achieves improvements on the GTA and GAIA benchmarks. sport-agents.github.io

6.0K

Pengxiang Li Retweeted

The Humanoid Hub@TheHumanoidHub · Jul 6

Noetix N2 endures some serious abuse but keeps walking.

701

306

3.0K

964

6.5M

Pengxiang Li Retweeted

Jiayi Zhang@didiforx · Jun 19

It's actually a pity that we got no enough time to maintain OpenManus during the past 3 months. But the better news is that we will build a formal open-source community for OpenManus at the end of this month.

11.0K

Pengxiang Li@oliverlee1999 · Jun 11

Looks good to me. Try to build a horror research game with MGX. Publish something or go die 😭

MMetaGPT@MetaGPT_ · Jun 11

The MGX · AI Tools Challenge is live! Build a powerful AI app and aim for the top! Just publish your app to join. 🗓️ Deadline: June 17th, 6:00 PM PT 🎁 Top reward: $500 MGX Pro + usage credits Vote daily to earn too — no app needed! [Join now] ➡️ discord.com/invite/NMrp44a…

473

Pengxiang Li@oliverlee1999 · Jun 3

🗳️ Cast your vote for Yuguang “Michael” Fang for IEEE ComSoc Board of Governors (2025–2027)! With 26+ years of service, he's committed to mentorship, inclusion, and advancing cutting-edge research. Vote now 👉 eballot.app/ieee #IEEE #ComSoc #Leadership #VoteIEEE

oliverlee1999's tweet image. 🗳️ Cast your vote for Yuguang “Michael” Fang for IEEE ComSoc Board of Governors (2025–2027)!

With 26+ years of service, he's committed to mentorship, inclusion, and advancing cutting-edge research.
Vote now 👉 eballot.app/ieee

#IEEE #ComSoc #Leadership #VoteIEEE

Pengxiang Li@oliverlee1999 · May 30

Just attended a paper sharing talk on this paper. Spatial reasoning is still a tough challenge for current VLMs, but this work makes a great step forward.

FFangfu Liu@fangfu0830 · May 30

Elevate Visual-Spatial Intelligence with Spatial-MLLM! 🚀🚀🚀 Discover how we incorporate 3D information to help MLLMs better think in space in our work: Spatial-MLLM. 🔗Code: github.com/diankun-wu/Spa… 🌐Project Page: diankun-wu.github.io/Spatial-MLLM/ 📄Paper: arxiv.org/abs/2505.23747

248

Pengxiang Li Retweeted

DeepSeek@deepseek_ai · May 29

🚀 DeepSeek-R1-0528 is here! 🔹 Improved benchmark performance 🔹 Enhanced front-end capabilities 🔹 Reduced hallucinations 🔹 Supports JSON output & function calling ✅ Try it now: chat.deepseek.com 🔌 No change to API usage — docs here: api-docs.deepseek.com/guides/reasoni… 🔗…

528

2.0K

10.0K

1.0K

1.4M

Pengxiang Li@oliverlee1999 · May 30

Security is a fundamental threshold that must be ensured before CUA can truly enter the user market.

ZZeyi Liao@LiaoZeyi · May 30

⁉️Can you really trust Computer-Use Agents (CUAs) to control your computer⁉️ Not yet, @AnthropicAI Opus 4 shows an alarming 48% Attack Success Rate against realistic internet injection❗️ Introducing RedTeamCUA: realistic, interactive, and controlled sandbox environments for…

Pengxiang Li Retweeted

Anthropic@AnthropicAI · May 22

Introducing the next generation: Claude Opus 4 and Claude Sonnet 4. Claude Opus 4 is our most powerful model yet, and the world’s best coding model. Claude Sonnet 4 is a significant upgrade from its predecessor, delivering superior coding and reasoning.

972

3.0K

21.0K

4.0K

4.2M

Pengxiang Li@oliverlee1999 · May 21

We're thrilled to announce the launch of the "Computer Use Agent (CUA)" community on AlphaXiv! 🎉 This community is dedicated to academic discussions, engineering collaborations, and creative brainstorming in the CUA field. alphaxiv.org/invite/b32af68… #CUA #AlphaXiv #AIResearch

oliverlee1999's tweet image. We're thrilled to announce the launch of the "Computer Use Agent (CUA)" community on AlphaXiv! 🎉

This community is dedicated to academic discussions, engineering collaborations, and creative brainstorming in the CUA field.

alphaxiv.org/invite/b32af68…

#CUA #AlphaXiv #AIResearch

898

Pengxiang Li Retweeted

Qwen@Alibaba_Qwen · May 13

Please check out our Qwen3 Technical Report. 👇🏻 github.com/QwenLM/Qwen3/b…

301

2.0K

538

203.0K

Pengxiang Li Retweeted

Overleaf@overleaf · May 14

⚠️ Attention: The site is currently down. Our engineering team is investigating. We will update as soon as possible. You can track progress here: status.overleaf.com Sorry for any inconvenience.

226

206

804

264.0K

Pengxiang Li Retweeted

hardmaru@hardmaru · May 12

New Paper: Continuous Thought Machines 🧠 Neurons in brains use timing and synchronization in the way that they compute, but this is largely ignored in modern neural nets. We believe neural timing is key for the flexibility and adaptability of biological intelligence. We…

571

3.0K

2.0K

239.0K

Pengxiang Li@oliverlee1999 · Apr 3

Excellent Agent training infra!

YYujia Qin@TsingYoga · Apr 3

Celebrating the open source of VeOmni (github.com/ByteDance-Seed…), part of the training infra behind UI-TARS~ You can now optionally fine-tune UI-TARS with VeOmni🥳🥳