Benno Krojer

@benno_krojer

AI phding @Mila_Quebec @mcgillu (past: @AIatMeta). Interests: interpretability, language grounding (V+L), evals, reasoning. Vanier Scholar. 🥏⚽🥨

Montréal, Québec

Joined June 2014

2KFollowing

2KFollowers

Pinned

Benno Krojer@benno_krojer · Jul 21

Love to see this I am always hoping for papers that show that text-only understanding is influenced by being physically grounded (images, videos, interaction) It was a big hope of people years ago with few positive findings, glad it is still explored!

YYulu Qin@yulu_qin · Jul 21

Does vision training change how language is represented and used in meaningful ways?🤔 The answer is a nuanced yes! Comparing VLM-LM minimal pairs, we find that while the taxonomic organization of the lexicon is similar, VLMs are better at _deploying_ this knowledge. [1/9]

1.0K

Pinned

Benno Krojer Retweeted

Cesare Spinoso-Di Piano@cesare_spinoso · Jun 26

A blizzard is raging in Montreal when your friend says “Wow, the weather is amazing!” Humans easily interpret irony, while LLMs struggle at it. We propose a 𝘳𝘩𝘦𝘵𝘰𝘳𝘪𝘤𝘢𝘭-𝘴𝘵𝘳𝘢𝘵𝘦𝘨𝘺-𝘢𝘸𝘢𝘳𝘦 probabilistic framework as a solution. arxiv.org/abs/2506.09301 @ #acl2025

581

Benno Krojer@benno_krojer · Jul 17

Love this series of blogpost - very insightful peek at the process that goes behind a cool paper! I’d love to see more authors come up with posts like these

AAlbert Gu@_albertgu · Jul 11

This was an incredibly important project to me - I’ve wanted to solve it for years, but had no idea how. This was all @sukjun_hwang and @fluorane's amazing work! I wrote about the story of its development, and what might be coming next. The H-Net: goombalab.github.io/blog/2025/hnet…

760

Benno Krojer@benno_krojer · Jul 16

Very cool stuff. I was amazed to learn from Tom that there is a programming language RASP that compiles symbolic algorithms into transformer weights He built on top of that and studied how we can distill those "idealized transformers" into our regular LLMs

TTomás Vergara Browne@tvergarabrowne · Jul 15

🥳 New Paper @ ACL Findings 🇦🇹 Instead of reverse engineering mechanisms in LLMs, can we inject our own known mechanism into a pretrained language model? Yes we can!

464

Benno Krojer Retweeted

cohere@cohere · Jul 3

Cohere is excited to announce our new office in Montreal, QC! We look forward to contributing to the local AI landscape, collaborating with new and existing partners in the city, and growing our Montreal-based team. cohere.com/blog/montreal-…

305

79.0K

Benno Krojer@benno_krojer · Jul 3

I genuinely think @benno_krojer's work offers a much fairer and insightful way to assess the physics understanding of VideoLLMs. Highly recommend giving it a read if you're curious about where current models stand!

TTomás Vergara Browne@tvergarabrowne · Jul 15

🥳 New Paper @ ACL Findings 🇦🇹 Instead of reverse engineering mechanisms in LLMs, can we inject our own known mechanism into a pretrained language model? Yes we can!

394

Benno Krojer@benno_krojer · Jul 1

Welcome to the lab, doctor!

VVerna Dankers@vernadankers · Jul 1

I miss Edinburgh and its wonderful people already!! Thanks to @tallinzen and @PontiEdoardo for inspiring discussions during the viva! I'm now exchanging Arthur's Seat for Mont Royal to join @sivareddyg's wonderful lab @Mila_Quebec 🤩

457

Benno Krojer@benno_krojer · Jun 20

pretty plots sometimes

237

Benno Krojer@benno_krojer · Jun 20

The video is online now! 3min speed science talk on "From a soup of raw pixels to abstract meaning" youtu.be/AHsoMYG2Vqk?si…

BBenno Krojer@benno_krojer · Jun 6

Turns out condensing your research into 3min is very hard but also teaches you a lot

3.0K

Benno Krojer@benno_krojer · Jun 13

Cool use of our AURORA work from last year to improve physical world models framed as image editing!

YYifu Qiu@ACL2025 🇦🇹@yifuqiu98 · Jun 10

🔁 What if you could bootstrap a world model (state1 × action → state2) using a much easier-to-train dynamics model (state1 × state2 → action) in a generalist VLM? 💡 We show how a dynamics model can generate synthetic trajectories & serve for inference-time verification 🧵👇

741

Benno Krojer Retweeted

Xing Han Lu@xhluca · Jun 13

"Build the web for agents, not agents for the web" This position paper argues that rather than forcing web agents to adapt to UIs designed for humans, we should develop a new interface optimized for web agents, which we call Agentic Web Interface (AWI).

195

125

22.0K