huaijiang
@huaijiangzhu
Founding engineer @alquistrobotics, PhD @nyuniversity. prev Boston Dynamics AI Institute @MPI_IS @TU_Muenchen he/him 🏳️🌈
vLLM v0.10.0 just released, and its biggest feature might be a hidden gem: initial support for the OpenAI /responses API. It might sound like a small feature, but this is a huge market signal. The industry is moving in this direction for building the next generation of powerful,…
dude if there is anyone causing toxicity right now it's you
And to be clear, of course the students are amazing (worked with many of them). What I don't like at all is the ecosystem / way of working set by advisors, and the toxicity that it causes on the global research community.
I’m Shawn, founder of Memories.ai, former researcher at Meta and CS PhD at University of Cambridge. Today we’re launching : we built the world’s first Large Visual Memory Model - to give AI human-like visual memories. Why visual memory? AI to…
Interesting piece by Matt Levine on the huge AI salaries: “I tell you what, if Meta Platforms Inc. paid me a $100 million signing bonus to come work for their artificial intelligence business, I would be the most dedicated worker they have ever seen until the check cleared!…
Will conversation history help reasoning? We found that when models mess up once, they often get stuck. Surprisingly, a simple “try again” fixes this — and boosts reasoning.🧵 Project Page: unary-feedback.github.io
Test-time scaling nailed code & math—next stop: the real 3D world. 🌍 MindJourney pairs any VLM with a video-diffusion World Model, letting it explore an imagined scene before answering. One frame becomes a tour—and the tour leads to new SOTA in spatial reasoning. 🚀 🧵1/
I made a simple tutorial how to fine-tune LLMs using (almost) same memory as needed for inference.
Dr Jonathan Hurst of @AgilityRobotics makes the case of why wheel based bots are not the solution to mobility in humanoids. I've made similar arguments, so nice to have backup TL;DR: They're tippy
Our co-founder, Jonathan Hurst, shares his vision for the path that humanoid robots will take to becoming part of our everyday lives. agilityrobotics.com/content/humano…
The Invisible Leash: Why RLVR May Not Escape Its Origin "RLVR is constrained by the base model's support-unable to sample solutions with zero initial probability-and operates as a conservative reweighting mechanism that may restrict the discovery of entirely original solutions"…
to be fair, rl has always worked with solvable exploration and a small sim-to-real gap. and yes the policy always learns to work around your bugs.
RL went from not working at all to working so well that code can have major correctness bugs and you don't notice because it still just works
🚨 Olympiad math + AI: We ran Google’s Gemini 2.5 Pro on the fresh IMO 2025 problems. With careful prompting and pipeline design, it solved 5 out of 6 — remarkable for tasks demanding deep insight and creativity. The model could win gold! 🥇 #AI #Math #LLMs #IMO2025
I had a great time presenting "It's Time to Say Goodbye to Hard Constraints" at the Flatiron Institute. In this talk, I describe a philosophy for model construction. Video now online! youtube.com/watch?v=LxuNC3…
How to train a State-of-the-art agent model. Let's talk about the Kimi K2 paper.
Why infinite DoF? In contact-rich manipulation, what matters is where and how much force is applied. Low-DoF hands do not have the capacity to match the contact complexity of the human hand — making it hard to learn from human demos. Build that capacity first.
How to train the best non-reasoning model: 1. Gather a reasoning dataset 2. remove <think> tokens 3. Train the model
My new paper "Deep Learning is Not So Mysterious or Different": arxiv.org/abs/2503.02113. Generalization behaviours in deep learning can be intuitively understood through a notion of soft inductive biases, and formally characterized with countable hypothesis bounds! 1/12
Reflex is opening a new SF office! We’re scaling fast across both coasts -- building humanoid robots that actually work. If you're wired to move quickly, solve real problems, and deploy robots, come join us. reflexrobotics.com/careers Now hiring: ➣ Principal AI Researcher, World…