David Stutz

@davidstutz92

Research scientist @DeepMind working on robust and safe AI, previously @maxplanckpress, views my own.

London

Joined January 2022

1KFollowing

4KFollowers

Pinned

David Stutz@davidstutz92 · Jun 19, 2024

Part of our Med-Gemini work was a full relabelling of MedQA, revealing that at least 7.4% of examples are unfit for evaluation. Today, we open sourced these annotations alongside our evaluation script as a new standard evaluation on MedQA. A thread 🧵: github.com/Google-Health/…

davidstutz92's tweet card. For Med-Gemini, we relabeled the MedQA benchmark; this repo includes the annotations and analysis code. - Google-Health/med-gemini-medqa-relabelling

149

37.0K

Pinned

David Stutz Retweeted

Vivek Natarajan@vivnat · May 20

Sir Demis sharing at the Google IO stage: - AI Co-scientist - our OG @GoogleDeepMind Gemini agents for accelerating scientific discovery and helping finding cures for complex diseases (acute myeloid leukemia, liver fibrosis and counting) - AMIE - our research AI doctor system…

245

27.0K

David Stutz Retweeted

Artificial Intelligence and Machine Learning @ LMU@AIML_LMU · 23 h

🚀@AIML_LMU is at #UAI2025 in beautiful Rio 🇧🇷 ! Yesterday @AlirezaJVNMRDI & @HanselleJonas presented our paper "Conformal Prediction without Nonconformity Score", a joint work with Tobias Oberkofler, @ysale12 & @eyke_hu. Check out the full paper here: openreview.net/pdf?id=ENJd3vu…

299

David Stutz@davidstutz92 · Jul 23

AMIE can now conduct medical dialogues with patients within specified safety guardrails. This advance allows satisfying safety constraints such as abstaining from individualized medical advice while letting AMIE perform the crucial task of information acquisition (“history…

DDavid Stutz@davidstutz92 · Jul 22

We are excited to share our latest pre-print enabling effective human oversight for AMIE, our research diagnostic dialogue AI. We introduce a new asynchronous oversight paradigm, decoupling history-taking by AMIE from sharing a human-approved diagnosis – a thread 🧵:

2.0K

David Stutz Retweeted

Google DeepMind@GoogleDeepMind · May 20

SynthID – our groundbreaking digital watermarking technology – has already been used 10 billion times. 🌍 Today, we're introducing SynthID Detector, a new online portal designed to quickly identify if any part of digital content was generated by Google's AI tools. Find out more…

418

135

61.0K

David Stutz Retweeted

Yossi Matias@ymatias · May 21

Today, at #GoogleIO, we introduced MedGemma, Google's most capable open model for multimodal medical text and image comprehension (following Med-PaLM, Med-PaLM2, Med-Gemini). Accessible at our Health AI Developer Foundations: goo.gle/medgemma

2.0K