Kevin Patrick Murphy

@sirbayes

Research Scientist at Google DeepMind. Interested in Bayesian Machine Learning.

Joined October 2016

516Following

59KFollowers

Pinned

Kevin Patrick Murphy@sirbayes · May 24

Great article.

YYann LeCun@ylecun · May 24

Steven Pinker injects some facts and much-needed sanity in the debates around Harvard and American academia. nytimes.com/2025/05/23/opi…

12.0K

Kevin Patrick Murphy@sirbayes · 12 h

The rise of tool-using LLMs is good news if you like to develop fast and reliable algorithms: the LLM converts raw text x into structured form x', calls your method f to get y'=f(x'), then converts it back to text y. We need more tools f!

7.0K

Kevin Patrick Murphy@sirbayes · Jul 15

Don’t just predict the mean of your clean data given your noisy data, predict the full distribution.

AArthur Gretton@ArthurGretton · Jul 14

Distributional diffusion models with scoring rules at #icml25 Fewer, larger denoising steps using distributional losses! Wednesday 11am poster E-1910 arxiv.org/pdf/2502.02483 @agalashov @ValentinDeBort1 Guntupalli @zhouguangyao @sirbayes @ArnaudDoucet1

264

170

22.0K

Kevin Patrick Murphy@sirbayes · Jul 13

Name the LLM responsible for this gem...

9.0K

Kevin Patrick Murphy@sirbayes · Jul 8

I don't know why AI doomers keep going on about "bio risk" (which is rate-limited by many physical constraints). AI agent powered cyber attacks to power grids, satellites, etc. seem more likely, since these are digital systems, but directly affect the physical world.

4.0K

Kevin Patrick Murphy@sirbayes · Jun 21

It’s funny how creating “radical abundance” has replaced “democratizing AI” as the new euphemism for getting rich in the tech sphere.

8.0K

Kevin Patrick Murphy@sirbayes · Jun 21

My colleague and former intern @liusiqi42 reminded me that we did RLFT for LMs almost 10 years ago - back then it was for an img2text model based on CNNs and RNNs. But same basic recipe - pre train with MLE then fine tune with PG. arxiv.org/abs/1612.00370

149

14.0K

Kevin Patrick Murphy@sirbayes · Jun 19

Go Bears! news.berkeley.edu/2025/06/18/uc-…

sirbayes's tweet card. In an evaluation of more than 2,250 universities around the world, Berkeley ranked sixth overall.

4.0K

Kevin Patrick Murphy@sirbayes · Jun 9

This is a very thought provoking interview with my former student. I do think AI personas (esp multimodal and real time) may be addictive and seem better than humans - but so is heroin (albeit heroin has less useful applications than AI).

DDavid Duvenaud@DavidDuvenaud · Jun 8

AI will probably be better spouses and parents than us someday, by our own standards. Me and @danfaggella talked about what this might look like, how it'll likely feel at the time, and how hard it is to think clearly about these painful ideas. youtube.com/watch?v=XPpg89…

16.0K

Kevin Patrick Murphy@sirbayes · Jun 6

Well, it seems that the Elon / Trump bromance is finally over, as I predicted… 🍿

KKevin Patrick Murphy@sirbayes · Dec 27

It’s delicious to see the in-fighting on the right has started before Trump has even taken office. How long will Ramaswamy and Elon last before falling out of favor with DJT? mediaite.com/politics/vivek…

9.0K

Kevin Patrick Murphy@sirbayes · May 30

I think it's quite misleading for the big labs to be promoting how well their VLMs work on pokemon, given how much (game-specific) manual annotation is required behind the scenes. Solving general tasks from pixel input is much harder than coding ("Moravec's revenge").

LLucas Beyer (bl16)@giffmana · May 28

Yep that's exactly what i expected. They test VLMs on gameplay without scaffold: image in, action out. And they can't play. One small feedback to the authors: include a random agent (maybe best of 100 or so) as a baseline.

117

20.0K

Kevin Patrick Murphy@sirbayes · May 23

100%.

JJohn Carmack@ID_AA_Carmack · May 23

The full video of my Upper Bound 2025 talk about our research directions should be available at some point, but here are my slides: docs.google.com/presentation/d… And here are the notes I made while preparing, which are more extensive than what I had time to say:…

190

122

25.0K

Kevin Patrick Murphy@sirbayes · May 22

Does anyone know if ChatGPT keeps some kind of context or user profile across sessions? If i ask it to derive mathy things related to online Bayes, it often asks me if I want to see a low-rank version of it, or a Thompson sampling version. How does it know I care? Spooky.

18.0K

Kevin Patrick Murphy@sirbayes · May 20

I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy! arxiv.org/abs/2412.05265

sirbayes's tweet image. I am pleased to announce a new version of my RL tutorial. Major update to the LLM chapter (eg DPO, GRPO, thinking), minor updates to the MARL and MBRL chapters and various sections (eg offline RL, DPG, etc). Enjoy!
arxiv.org/abs/2412.05265

452

2.0K

117.0K

Kevin Patrick Murphy@sirbayes · May 11

This was a great talk (*) on using (proper multi-turn) RL for training LLM agents to reason and use tools. Very bullish on this "Generative Agents" direction! (* Audio was very bad; fortunately brains are good at source separation :)

wwill brown@willccbb · May 11

nice gemini summary/transcription of the talk from Friday :)

143

117

19.0K

Kevin Patrick Murphy@sirbayes · Apr 30

I dont know why singapore air is rated number 1 in world. their business class beds are much less comfortable than united/ polaris, because they are narrow and not straight. Food is good but not amazing. IMHO Emirates is best, then KLM & United (but grateful not economy class :)

11.0K

Kevin Patrick Murphy@sirbayes · Apr 20

I had a great time diving at @Wakatobi in Indonesia (although unfortunately I got an ear infection and had to skip the last couple of days). Tomorrow off to Singapore for #ICLR2025 (DM me if you want to meet).

sirbayes's tweet image. I had a great time diving at @Wakatobi in Indonesia (although unfortunately I got an ear infection and had to skip the last couple of days). Tomorrow off to Singapore for #ICLR2025 (DM me if you want to meet).

8.0K

Kevin Patrick Murphy@sirbayes · Apr 18

I am pleased to announce that I have updated the online versions of my 2 textbooks (see probml.github.io/pml-book/): I fixed all issues listed on github, added some new references (esp on LLMs), and made a few other small tweaks.

178

1.0K

862

67.0K