Zhengyang Geng

@ZhengyangGeng

PhD student @SCSatCMU with @zicokolter / curiosity＆love / dynamics to super intelligence

Joined February 2019

650Following

1KFollowers

Pinned

Zhengyang Geng@ZhengyangGeng · May 22

Excited to share our work with my amazing collaborators, @Goodeat258, @SimulatedAnneal, @zicokolter, and Kaiming. In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,…

ZhengyangGeng's tweet image. Excited to share our work with my amazing collaborators, @Goodeat258, @SimulatedAnneal, @zicokolter, and Kaiming.

In a word, we show an “identity learning” approach for generative modeling, by relating the instantaneous/average velocity in an identity. The resulting model,…

143

25.0K

Pinned

Zhengyang Geng@ZhengyangGeng · Jun 25

Thrilled to introduce AlphaGenome, our new DNA sequence model now available via our AlphaGenome API. Really excited to see how the scientific community uses AlphaGenome’s predictions to understand genome function, drive biological discoveries, develop new treatments, and more...

GGoogle DeepMind@GoogleDeepMind · Jun 25

Introducing AlphaGenome: an AI model to help scientists better understand our DNA – the instruction manual for life 🧬 Researchers can now quickly predict what impact genetic changes could have - helping to generate new hypotheses and drive biological discoveries. ↓

356

2.0K

426

312.0K

Zhengyang Geng Retweeted

Weijian Luo (Vancouver-ICML25)@William74312006 · Jul 16

I will present the Diff-Instruct-star poster paper at Wed 16 Jul 11 a.m. PDT — 1:30 p.m. PDT, East Exhibition Hall A-B #E-1808. Feel free to join and chat about one-step text-to-image/video models at scales!

650

Zhengyang Geng@ZhengyangGeng · Jul 14

Diffentiable🤫 Dense🤫 E2E🤫

XXun Huang@xunhuang1995 · Jul 13

We should have called it "scaling up rollout", not RL. RL is a necessary evil for the discrete nature of language. My intuition tells me using RL for continuous data (images, videos, audios), where differentiable supervision is easily available, is a terrible idea.

444

Zhengyang Geng Retweeted

Weijian Luo (Vancouver-ICML25)@William74312006 · Jun 29

🚀 Last weekend at Peking University, I worked with Yifei @WangYw251 and developed Easy Meanflow (github.com/pkulwj1994/eas…), an open-sourced Pytorch DDP implementation of MeanFlow (a phenomenal paper by my bro @ZhengyangGeng, Mingyang Deng, Xingjian Bai, J. Zico Kolter, and…

2.0K

Zhengyang Geng@ZhengyangGeng · Jul 2

now wouldn't that be something...

JJimmy Apples 🍎/acc@apples_jimmy · Jul 2

Let me play a video game of my veo 3 videos already. Google cooked so good 👌 @OfficialLoganK playable world models wen?

213

244

5.0K

626

554.0K

Zhengyang Geng@ZhengyangGeng · Jun 23

now the code is up here: github.com/Gsunshine/mean…

ZZhengyang Geng@ZhengyangGeng · May 22

10.0K

Zhengyang Geng@ZhengyangGeng · Jun 20

Thanks, @CSProfKGD! I love MeanFlow's elegant formulation of one-step generative modeling. But I was a bit confused about the notation and derivation. Hopefully, this video will help people interested in the paper understand it better.

KKosta Derpanis@CSProfKGD · Jun 19

Fresh out of the oven! 🍞 @jbhuang0604 breaks down Mean Flow from Kaiming’s group in his latest video.

8.0K

Zhengyang Geng Retweeted

Xun Huang@xunhuang1995 · Jun 9

Real-time video generation is finally real — without sacrificing quality. Introducing Self-Forcing, a new paradigm for training autoregressive diffusion models. The key to high quality? Simulate the inference process during training by unrolling transformers with KV caching.

126

778

583

135.0K

Zhengyang Geng@ZhengyangGeng · May 20

As I was saying: it's happening

GGoogle DeepMind@GoogleDeepMind · May 20

We’ve developed Gemini Diffusion: our state-of-the-art text diffusion model. Instead of predicting text directly, it learns to generate outputs by refining noise, step-by-step. This helps it excel at coding and math, where it can iterate over solutions quickly. #GoogleIO

714

114

49.0K

Zhengyang Geng@ZhengyangGeng · May 20

Is a 1-step & 1k-token text model far?

GGoogle DeepMind@GoogleDeepMind · May 20

966

Zhengyang Geng@ZhengyangGeng · May 9

Your LLMs can literally attend Tsinghua University. Can they graduate?

MMeng-Hao Guo@MengHaoGuo1 · May 9

#ICML2025 arxiv.org/abs/2505.02018 We surveyed 100+ courses across 19 departments at Tsinghua University. With expert and model filtering, we curated a graduate-level, Olympiad-difficulty, multi-disciplinary benchmark R-Bench. Even GPT-4o struggles (33.4% on multimodal)!

609

Zhengyang Geng Retweeted

Runtian Zhai@RuntianZhai · Apr 29

Why can foundation models transfer to so many downstream tasks? Will the scaling law end? Will pretraining end like Ilya Sutskever predicted? My PhD thesis builds the contexture theory to answer the above. Blog: runtianzhai.com/thesis Paper: arxiv.org/abs/2504.19792 🧵1/12

163

113

22.0K

Zhengyang Geng Retweeted

Yutong (Kelly) He@electronickale · Apr 28

✨ Love 4o-style image generation but prefer to use Midjourney? Tired of manual prompt crafting from inspo images? PRISM to the rescue! 🖼️→📝→🖼️ We automate black-box prompt engineering—no training, no embeddings, just accurate, readable prompts from your inspo images! 1/🧵

20.0K

Zhengyang Geng Retweeted

Ricky T. Q. Chen@RickyTQChen · Apr 25

This ICLR is the best conference ever. Attendees are extremely friendly and cuddly. ..What do you mean this is the wrong hall?

407

27.0K