James Burgess (at CVPR) (@jmhb0)

Pinned

J

Introducing MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research #CVPR2025 ✅ 1k multimodal reasoning VQAs testing MLLMs for science 🧑‍🔬 Biology researchers manually created the questions 🤖 RefineBot: a method for fixing QA language shortcuts 🧵

jmhb0's tweet image. Introducing MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research #CVPR2025

✅ 1k multimodal reasoning VQAs testing MLLMs for science
🧑‍🔬 Biology researchers manually created the questions
🤖 RefineBot: a method for fixing QA language shortcuts
🧵

3

50

81

31

25.0K

J

James Burgess (at CVPR)@jmhb0 · Jul 16

I've been working with the @reflection_ai team on Asimov, our best-in-class code research agent. I am super excited for you all to try it. Let me know here if you want to try it and I can move you off the waitlist. :)

MMisha Laskin@MishaLaskin · Jul 16

Engineers spend 70% of their time understanding code, not writing it. That’s why we built Asimov at @reflection_ai. The best-in-class code research agent, built for teams and organizations.

6

1

33

6

3.0K

J

James Burgess (at CVPR)@jmhb0 · Jul 11

Get around our very cool #ICML paper that predicts how biological cells respond to drug treatments or gene knockdowns. It was led by the legendary @Zhang_Yu_hui and @hhhhh2033528, and I was happy to contribute a tiny bit :)

YYuhui Zhang@Zhang_Yu_hui · Jul 10

🧬 What if we could build a virtual cell to predict how it responds to drugs or genetic perturbations? Super excited to introduce CellFlux at #ICML2025 — an image generative model that simulates cellular morphological changes from microscopy images. yuhui-zh15.github.io/CellFlux/ 💡…

0

1

7

0

379

J

James Burgess (at CVPR)@jmhb0 · Jun 12

I'm at CVPR! Come see me at one of my posters, or reach out for a chit chat. MicroVQA: reasoning llm benchmark in biology Sat 5-7pm, hall D, poster #357 jmhb0.github.io/microvqa/ BIOMEDICA: a massive vision-language dataset Sat 5-7pm, hall D, poster #374 minwoosun.github.io/biomedica-webs…

JJames Burgess (at CVPR)@jmhb0 · Mar 18

Introducing MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research #CVPR2025 ✅ 1k multimodal reasoning VQAs testing MLLMs for science 🧑‍🔬 Biology researchers manually created the questions 🤖 RefineBot: a method for fixing QA language shortcuts 🧵

3

0

25

1

833

James Burgess (at CVPR) Retweeted

J

Jeff Nirschl@jnirsch · May 7

My lab is starting at UW-Madison! This is a unique opportunity to contribute to impactful computational neuropathology research in a collaborative environment. Join the Nirschl Lab and help drive discoveries that improve our understanding of neurodegenerative disorders!🧠

3

8

1

651

J

James Burgess (at CVPR)@jmhb0 · Apr 23

I'm at #ICLR2025 presenting "Video Action Differencing". Keen to chat with anyone interested in MLLMs - both for general data & for scientific reasoning

JJames Burgess (at CVPR)@jmhb0 · Mar 12

🚨Large video-language models LLaVA-Video can do single-video tasks. But can they compare videos? Imagine you’re learning a sports skill like kicking: can an AI tell how your kick differs from an expert video? 🚀 Introducing "Video Action Differencing" (VidDiff), ICLR 2025 🧵

0

4

18

0

950

James Burgess (at CVPR) Retweeted

Y

Yuhui Zhang@Zhang_Yu_hui · Apr 21

Three papers being presented by my amazing collaborators at #ICLR2025! 🌟 (sadly I can't make it) 1. Mechanistic Interpretability Meets Vision Language Models: Insights and Limitations 🔍 A deep dive into mechanistic interpretation techniques for VLMs & future…

2

4

37

9

3.0K

J

James Burgess (at CVPR)@jmhb0 · Apr 8

🤗The SmolVLM report is out, with all the experiments, findings, and insights that led to high performance at tiny sizes🤏. 📱These models can run on most mobile/edge devices. 📖Give it a look!

AAndi Marafioti@andimarafioti · Apr 8

Today, we share the tech report for SmolVLM: Redefining small and efficient multimodal models. 🔥 Explaining how to design a tiny 256M VLM that uses less than 1GB of RAM and outperforms our 80B models from 18 months ago! Here are the coolest insights from our experiments: ✨…

0

9

52

22

5.0K

J

James Burgess (at CVPR)@jmhb0 · Apr 2

Excited to see SmolVLM powering BMC-SmolVLM in the latest BIOMEDICA update! At just 2.2B params, it matches 7-13B biomedical VLMs. Check out the full release: @huggingface #smolvlm

AAlejandro Lozano@Ale9806_ · Apr 2

Earlier this year, we released the BIOMEDICA dataset, featuring 24 million unique image caption pairs and 30 million image references derived from open-source biomedical literature. It's been great to see the community engaging with it—we're currently seeing around 6K downloads…

0

5

13

1

1.0K

James Burgess (at CVPR) Retweeted

A

Alejandro Lozano@Ale9806_ · Apr 2

Earlier this year, we released the BIOMEDICA dataset, featuring 24 million unique image caption pairs and 30 million image references derived from open-source biomedical literature. It's been great to see the community engaging with it—we're currently seeing around 6K downloads…

3

9

26

18

3.0K