Joaquin Vanschoren

@joavanschoren

AI Researcher, @open_ml founder, research lead @TUeindhoven. Building AI systems that learn how to learn, grow and adapt continuously & push humanity forward.

Genk, Belgium

Joined February 2010

1KFollowing

3KFollowers

Pinned

Joaquin Vanschoren@joavanschoren · Jun 16

🚀 Ready to push the boundaries of #AI & #ML? We're hiring 7(!) brilliant PhDs, PostDocs, and Engineers to work on cutting-edge #LLMs #MultiModal #GenAI @TUeindhoven ✨ Think big and shape the future of AI! 🧑‍💻👩‍💻 Apply now! amore-labs.github.io/website/join/j… ❣️Please share❣️

784

Pinned

Joaquin Vanschoren Retweeted

Santiago@svpino · Feb 17

Knowledge graphs are a game changer for AI Agents! A few ridiculous and eye-opening benchmarks comparing an AI Agent using knowledge graphs with state-of-the-art methods: • 94.8% accuracy versus 93.4% in the Deep Memory Retrieval (DMR) benchmark. • 71.2% accuracy versus…

172

1.0K

2.0K

97.0K

Pinned

Joaquin Vanschoren Retweeted

Gael Varoquaux 🦋@GaelVaroquaux · Feb 6

I just put on line a talk I gave summarizing what I have learned across the years as a maintainer of open source. It's _opinions_ (been there, done that), but I'm willing to defend them, having stewarded my share of successful open source projects. speakerdeck.com/gaelvaroquaux/…

1.0K

Joaquin Vanschoren Retweeted

Melanie Mitchell@MelMitchell1 · Jun 16

New paper: "Large Language Models & Emergence: A Complex Systems Perspective" (D. Krakauer, J. Krakauer, M. Mitchell). We look at claims of "emergent capabilities" & "emergent intelligence" in LLMs from perspective of what emergence means in complexity science. ⬇️

144

729

608

68.0K

Joaquin Vanschoren@joavanschoren · Mar 2

If you’re interested in learning about the theory behind Muon (a new optimizer), Jeremy has a great explainer in this thread. Also check out all his work leading to this (modula, modular duality, etc): docs.modula.systems/intro/reading-… It’s a beautiful theory and seems to work too!

JJeremy Bernstein@jxbz · Mar 1

It's been wild to see our work on Muon and the anthology start to get scaled up by the big labs. After @Kimi_Moonshot released Moonlight, people have asked whether Muon is compatible with muP. I wanted to write up an explainer, as there is something deeper going on here! (1/8)

168

121

21.0K

Joaquin Vanschoren Retweeted

Demis Hassabis@demishassabis · Feb 27

Hypothesis generation and testing is a critical capability for AGI imo. Super excited about our AI co-scientist and other AI for Science work which are important steps towards that. We're on the cusp of an incredible new golden age of AI accelerated scientific discovery.

258

315

2.0K

515

159.0K

Joaquin Vanschoren Retweeted

Tanishq Abraham back from ICML@iScienceLuvr · Feb 26

Diffusion language models are SO FAST!! A new startup, Inception Labs, has released Mercury Coder, "the first commercial-scale diffusion large language model" It's 5-10x faster than current gen LLMs, providing high-quality responses at low costs. And you can try it now!

263

3.0K

2.0K

353.0K

Joaquin Vanschoren Retweeted

Nathan Lambert@natolambert · Feb 26

First 11 chapters of RLHF Book have v0 draft done. Should be quick useful now. Next: * Crafting more blog content into future topics, * DPO+ chapter, * Meeting with publishers to get wheels turning on physical copies, * Cleaning & cohesiveness

346

213

20.0K

Joaquin Vanschoren Retweeted

Chelsea Finn@chelseabfinn · Feb 26

Can we prompt robots, just like we prompt language models? With hierarchy of VLA models + LLM-generated data, robots can: - reason through long-horizon tasks - respond to variety of prompts - handle situated corrections Blog post & paper: pi.website/research/hirob…

463

168

32.0K

Joaquin Vanschoren@joavanschoren · Feb 18

This is something we have been cooking together for a few months and I'm very excited to announce it today. Thinking Machines Lab is my next adventure and I'm feeling very proud and lucky to start it with a group of talented colleagues. Learn more about our vision at…

TThinking Machines@thinkymachines · Feb 18

Today, we are excited to announce Thinking Machines Lab (thinkingmachines.ai), an artificial intelligence research and product company. We are scientists, engineers, and builders behind some of the most widely used AI products and libraries, including ChatGPT,…

1.0K

248

254.0K

Joaquin Vanschoren Retweeted

Fleetwood@fleetwood___ · Feb 7

Understanding GPU bottlenecks is easy with a visualisation 👨🏻‍🍳

404

3.0K

323.0K

Joaquin Vanschoren Retweeted

Andreas Storm@avstorm · Feb 7

Github's data stream dashboard is a piece of art

400

5.0K

2.0K

297.0K

Joaquin Vanschoren Retweeted

Thomas Wolf@Thom_Wolf · Feb 6

From an open-research point of view, probably the greatest thing about DeepSeek–R1 is how its RL training technique appears so straightforward and simple in comparison to the cumbersome approaches people were starting to think necessary for learning reasoning like Process Reward…

134

9.0K

Joaquin Vanschoren Retweeted

Akshay 🚀@akshay_pachaar · Feb 3

Make your RAG application 10x smarter! ColiVara is a unique document retrieval method that does not need chunking or text processing. It still feels like RAG but without OCR, text extraction, broken tables, or missing images. What you see is what you get. ✨ Here’s why it’s a…

175

1.0K

2.0K

108.0K

Joaquin Vanschoren Retweeted

Abi Aryan@GoAbiAryan · Feb 3

🚀 Personal News: My Book "LLMOps: Managing Large Language Models in Production" is finally in early release with three chapters📚 🌟 What You’ll Get: Chapter 1 puts a lot of fundamental concepts in perspective from Language Model architectures to SLMs Chapter 2 goes into the…

4.0K

Joaquin Vanschoren Retweeted

Mike Butcher (BlueSky/Threads: @mikebutcher)@mikebutcher · Feb 3

Over 20 European orgs/companies — backed with €54m from the EU Commission — have joined the "OpenEuroLLM" project to develop Open Source models for Europe. But will it put Europe back on the AI map? thenextweb.com/news/european-… openeurollm.eu

940