DG.

@dataghees

scaling speech native LLMs @rimelabs the future is willed into existence. bioML, discovering new science, housing, local politics.

Toronto

Joined September 2019

6KFollowing

1KFollowers

Pinned

DG.@dataghees · May 1, 2021

This is amazing. Ever since I heard @balajis talk about the pseudonymous economy, I've been thinking about how to make it easier to start a YouTube channel with total privacy. Recent advances in deepfakes, AR and CV will make this as easy as a click. You won't need a 30k suit

RRex Woodbury@rex_woodbury · Apr 30, 2021

This is Miko. She's a virtual streamer who is controlled by a real-life woman known only as The Technician. The Technician uses the Unreal Engine and a $30,000 motion-capture suit to create Miko. Thread 👇

480

181

DG.@dataghees · 23 h

this is exactly right. was a big motivation for me to do a lecture series

SSid@sid_srk · Jul 26

Announcing The Toronto School Of Foundation Modelling, a Toronto exclusive, in-person only school for learning to build Foundation Models. Coming to New Stadium and Youthful Vengeance in late August 2025.

1.0K

DG. Retweeted

vik@vikhyatk · Jul 26

643

49.0K

DG.@dataghees · Jul 25

clean torch native multimodal library, forked from torch-titan. Train your own models! (TYOM) https://github[dot]com/xingchensong/TouchNet/tree/main

198

DG. Retweeted

Denny Zhou@denny_zhou · Jul 24

Slides for my lecture “LLM Reasoning” at Stanford CS 25: dennyzhou.github.io/LLM-Reasoning-… Key points: (1) Reasoning in LLMs just means generating a series of intermediate tokens before producing the final answers. It does not matter if it is like human reasoning or not. The key point…

2.0K

DG.@dataghees · Jul 24

> train modality specific encoder > expand tokenizer > predict next token > ??? > profit

285

DG.@dataghees · Jul 24

amazing turnout.

SStudio 535@535TORONTO · Jul 14

📣 AI Lecture Series continues with BIOREASON by @adibvafa He'll talk about: * The first reasoning model made for biology * How that will change the future of pharmaceutical and biomaterial discovery * The biotech ecosystem in Toronto 🗓️ Wed July 23 🎟️ Link below

2.0K

DG. Retweeted

Ilya Sutskever@ilyasut · Oct 1, 2020

The Bitter lesson does not say to not bother with methods research. It says to not bother with methods that are handcrafted datapoints in disguise.

594

137

DG. Retweeted

Owain Evans@OwainEvans_UK · Jul 22

New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵

274

1.0K

8.0K

5.0K

1.7M

DG.@dataghees · Jul 22

do people have specific examples?

AAnthony Koch@Anthony__Koch · Jul 22

It’s often far easier for a Spanish company to sell products in Germany than it is for a Nova Scotia based company to sell products in Manitoba. Absolutely insane behaviour.

197

DG. Retweeted

Karan Singhal@thekaransinghal · Jul 22

📣 Excited to share our real-world study of an LLM clinical copilot, a collab between @OpenAI and @PendaHealth. Across 39,849 live patient visits, clinicians with AI had a 16% relative reduction in diagnostic errors and a 13% reduction in treatment errors vs. those without. 🧵

138

678

268

427.0K

DG. Retweeted

Tanishq Abraham back from ICML@iScienceLuvr · Jul 22

The Invisible Leash: Why RLVR May Not Escape Its Origin "RLVR is constrained by the base model's support-unable to sample solutions with zero initial probability-and operates as a conservative reweighting mechanism that may restrict the discovery of entirely original solutions"…

179

137

16.0K

DG. Retweeted

CP24@CP24 · Jul 22

#BREAKING: Toronto Blue Jays break 40-year record for most consecutive home game wins cp24.com/news/sports/20…

176

2.0K

72.0K

DG. Retweeted

Noam Brown@polynoamial · Jul 21

Congrats to the GDM team on their IMO result! I think their parallel success highlights how fast AI progress is. Their approach was a bit different than ours, but I think that shows there are many research directions for further progress. Some thoughts on our model and results 🧵

104

193

2.0K

413

415.0K

DG. Retweeted

Tanishq Abraham back from ICML@iScienceLuvr · Jul 21

Kimi K2 paper dropped! describes: - MuonClip optimizer - large-scale agentic data synthesis pipeline that systematically generates tool-use demonstrations via simulated and real-world environments - an RL framework that combines RLVR with a self- critique rubric reward mechanism…

172

979

602

57.0K

DG. Retweeted

koray kavukcuoglu@koraykv · Jul 21

Advanced version of Gemini Deep Think (announced at #GoogleIO) using parallel inference time computation achieved gold-medal performance at IMO, solving 5/6 problems with rigorous proofs as verified by official IMO judges! Congrats to all involved! deepmind.google/discover/blog/…

149

757

99.0K

DG.@dataghees · Jul 21

It’s called Pakistani Pizza (Pizza Karachi on Queen W)

DDavid Ulevitch 🇺🇸@davidu · Jul 20

Chicken Tikka Masala Pizza should be a thing. I’m sure someone’s made it, but I’ve never had it and I want it.

264

DG. Retweeted

Dave@dmvaldman · Jul 20

A striking thing about OpenAI's IMO gold math model is how terse it is, it really tries to express itself in single tokens. Often breaking the rules of grammar and spelling to do so. They say compression is intelligence. We may be seeing a totally novel way to do compression…

470

128

47.0K

DG. Retweeted

Neel Nanda@NeelNanda5 · Jul 19

So why is this impressive? This is pretty different from normal reasoning model training, where correct answers are rewarded: it's typically much easier to guess the answer to an IMO question than to prove it - eg guessing correctly might be 1/7 marks, proving is 7/7

7.0K