Lei Li

@_TobiasLee

Ph.D. student@HKUNLP. Previously @PKU1898 COYG @Arsenal opinions are my own.

Hong Kong

Joined August 2015

833Following

2KFollowers

Pinned

Lei Li@_TobiasLee · Jul 22

Thrilled to announce our MiMo-VL series hit 100K downloads on HuggingFace last month! 🚀🚀 Incredible to see the community's enthusiasm for our VLMs. More exciting updates coming soon! 😜 huggingface.co/XiaomiMiMo/MiM…

_TobiasLee's tweet image. Thrilled to announce our MiMo-VL series hit 100K downloads on HuggingFace last month! 🚀🚀
Incredible to see the community's enthusiasm for our VLMs. More exciting updates coming soon! 😜
huggingface.co/XiaomiMiMo/MiM…

6.0K

Lei Li Retweeted

Orr Zohar@orr_zohar · 10 h

🧵 Introducing TimeScope, an open-source benchmark rigorously evaluating the true “temporal context window” of video-language models on videos ranging from 1 minute to 8 hours. #AI #MachineLearning

586

Lei Li@_TobiasLee · 22 h

Tried it with Claude Code and found it much faster & cheaper than K2. My first choice for vibe coding now.

QQwen@Alibaba_Qwen · Jul 22

>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…

1.0K

Lei Li Retweeted

Qwen@Alibaba_Qwen · Jul 21

Bye Qwen3-235B-A22B, hello Qwen3-235B-A22B-2507! After talking with the community and thinking it through, we decided to stop using hybrid thinking mode. Instead, we’ll train Instruct and Thinking models separately so we can get the best quality possible. Today, we’re releasing…

216

578

4.0K

849

877.0K

Lei Li Retweeted

Manuel Faysse@ManuelFaysse · Jul 17

Introducing ColQwen-Omni, a 3B omnimodal retriever that extends the ColPali concept of multimodal retrieval with late interaction to audio chunks and short videos, with no performance degradation on visual document retrieval wrt our best models! (1/N)

102

590

518

53.0K

Lei Li Retweeted

Hyung Won Chung@hwchung27 · Jul 16

This is my lecture from 2 months ago at @Cornell “How do I increase my output?” One natural answer is "I will just work a few more hours." Working longer can help, but eventually you hit a physical limit. A better question is, “How do I increase my output without increasing…

766

6.0K

8.0K

444.0K

Lei Li@_TobiasLee · Jul 15

Huan and I are looking for a postdoc to join us on agent research (broadly defined: planning, reasoning, safety, memory, continual learning, etc.). If you have a strong record in this space, drop us an email with CV! Retweet appreciated.

HHuan Sun (OSU)@hhsun1 · Jul 15

🚨 Postdoc Hiring: I am looking for a postdoc to work on rigorously evaluating and advancing the capabilities and safety of computer-use agents (CUAs), co-advised with @ysu_nlp @osunlp. We welcome strong applicants with experience in CUAs, long-horizon reasoning/planning,…

8.0K

Lei Li Retweeted

Lingpeng Kong@ikekong · Jul 15

What happend after Dream 7B? First, Dream-Coder 7B: A fully open diffusion LLM for code delivering strong performance, trained exclusively on public data. Plus, DreamOn cracks the variable-length generation problem! It enables code infilling that goes beyond a fixed canvas.

6.0K

Lei Li@_TobiasLee · Jul 16

Check Dream-Coder 7B, a new member of the Dream Family and the most powerful open Coder DLM!!

ZZhihui Xie@_zhihuixie · Jul 15

🚀 Thrilled to announce Dream-Coder 7B — the most powerful open diffusion code  LLM to date.

840

Lei Li Retweeted

Zirui Wu@WilliamZR7 · Jul 15

We present DreamOn: a simple yet effective method for variable-length generation in diffusion language models. Our approach boosts code infilling performance significantly and even catches up with oracle results.

113

14.0K

Lei Li@_TobiasLee · Jul 15

Stop by to discuss with Yiheng about Aguvis and CUA Agents!!

YYiheng Xu@yihengxu_ · Jul 14

Attending #ICML2025 🇨🇦 this week! Will be presenting Aguvis (arxiv.org/abs/2412.04454) on July 15 at 11am, and joining Computer Use Agent Workshop @workshopcua on July 19. If you’re into digital agent research, especially around computer/browser use, let’s grab a coffee!

477

Lei Li@_TobiasLee · Jul 14

method #2 to prompt a breakthrough: run those "what if i just..." / "will it break if ..." experiments, purely out of curiosity. the results may surprise you, and give you the clue you need

MMax Bain@maxhbain · Aug 15

why does the breakthrough always happen 0-1 days before the deadline

1.0K

Lei Li@_TobiasLee · Jul 11

great work, always like controlled (even just toy) experiments. I'm afraid we can't have OOD generalization in current overparameterization ML system. I’ve recently grown fond of pattern/concept narratives. In fact, I don’t believe that data-driven neural networks are capable of…

KKeyon Vafa@keyonV · Jul 11

Our paper aims to answer two questions: 1. What's the difference between prediction and world models? 2. Are there straightforward metrics that can test this distinction? Our paper is about AI. But it's helpful to go back 400 years to answer these questions.

1.0K

Lei Li@_TobiasLee · Jul 11

⚡️⚡️

RReka@RekaAILabs · Jul 10

📢 We are open sourcing ⚡Reka Flash 3.1⚡ and 🗜️Reka Quant🗜️. Reka Flash 3.1 is a much improved version of Reka Flash 3 that stands out on coding due to significant advances in our RL stack. 👩‍💻👨‍💻 Reka Quant is our state-of-the-art quantization technology. It achieves…

485

Lei Li@_TobiasLee · Jul 9

use our state-of-the-art research agent via api or try here app.reka.ai/research

RReka@RekaAILabs · Jul 9

🚀 Meet Reka Research––agentic AI that 🤔 thinks → 🔎 searches → ✏️ cites across the open web and private docs to answer your questions. 🥇 State-of-the-art performance, available now via our API and Playground!

1.0K

Lei Li Retweeted

Ai2@allen_ai · Jul 9

Introducing FlexOlmo, a new paradigm for language model training that enables the co-development of AI through data collaboration. 🧵

412

185

315.0K

Lei Li Retweeted

Reka@RekaAILabs · Jul 9

599.0K

Lei Li@_TobiasLee · Jul 8

SmolVLM has been accepted to @COLM_conf 2025 🥳! See you in Montreal!

AAndi Marafioti@andimarafioti · Jan 23

Introducing the smollest VLMs yet! 🤏 SmolVLM (256M & 500M) runs on <1GB GPU memory. Fine-tune it on your laptop and run it on your toaster. 🚀 Even the 256M model outperforms our Idefics 80B (Aug '23). How small can we go? 👀

219

17.0K

Lei Li Retweeted

Reka@RekaAILabs · Jul 8

Excited to introduce Reka Vision, an agentic visual understanding and search platform. Transform your unstructured multimodal data into insights and actions.

122

485.0K

Lei Li Retweeted

Jiayuan Rao@Houston_Rao · Jul 7

In ACM MM 2025, we introduced "Multi-Agent System for Comprehensive Soccer Understanding" with @WeidiXie ⚽️AI with external knowledge to analyze on/off-field dynamics! 🧐 📄: arxiv.org/abs/2505.03735 🌐: jyrao.github.io/SoccerAgent/ See details below! #AI4Sports #MultiAgent #ACMMM25

649