Vimal Thilak🦉🐒

@AggieInCA

Proverbs 17:28. I’m not learned. I'm AGI.

Daly City, CA

Joined October 2010

491Following

542Followers

Pinned

Vimal Thilak🦉🐒@AggieInCA · Jan 28

Thanks for sharing our work @_clashluke ! @samira_abnar wrote up a thread that hopefully makes it easy for folks interested in our work here x.com/samira_abnar/s…

LLucas Nestler@_clashluke · Jan 23

Wake up babe New MoE scaling laws dropped

3.0K

Pinned

Vimal Thilak🦉🐒 Retweeted

Fartash Faghri@FartashFg · Jul 7

Is your AI keeping Up with the world? Announcing #NeurIPS2025 CCFM Workshop: Continual and Compatible Foundation Model Updates When/Where: Dec. 6-7 San Diego Submission deadline: Aug. 22, 2025. (opening soon!) sites.google.com/view/ccfm-neur… #FoundationModels #ContinualLearning

8.0K

Pinned

Vimal Thilak🦉🐒@AggieInCA · Jun 21

Friends, should I pick this up before wrapping up my India trip? 🧐

NNDTV@ndtv · Jun 20

British F-35B Fighter Jet, Grounded In Kerala, On Sale On OLX? A Fact-Check ndtv.com/india-news/bri…

162

Vimal Thilak🦉🐒 Retweeted

Peter Gray@peteryugray · Jul 23

New Apple #ML Research Highlight: "FastVLM: Efficient Vision Encoding for Vision Language Models" machinelearning.apple.com/research/fast-…

728

Vimal Thilak🦉🐒@AggieInCA · Jul 24

Oh no. Hulkster gone. One of my favorite promos was one that had Kane, The Rock and The Hulkster. RIP legend.

Vimal Thilak🦉🐒 Retweeted

Ruoming Pang@ruomingpang · Jul 17

In this report we describe the 2025 Apple Foundation Models ("AFM"). We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device AFM model. machinelearning.apple.com/research/apple…

328

454

213

54.0K

Vimal Thilak🦉🐒 Retweeted

ZF@zffc · Jul 17

In this report, we describe the 2025 Apple Foundation Models ("AFM"). We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device AFM model machinelearning.apple.com/research/apple…

5.0K

Vimal Thilak🦉🐒@AggieInCA · Jul 17

New paper: 'Apple Intelligence Foundation Language Models Tech Report 2025' provides technical details for two multilingual, multimodal foundation language models that power Apple Intelligence features across Apple devices and services

ZZF@zffc · Jul 17

187

Vimal Thilak🦉🐒@AggieInCA · Jul 16

If you are at attending ICML today, consider checking out Samara’s poster on the role of sparsity in MoEs at 11 AM PDT. Poster ID: E-2810

SSamira Abnar@samira_abnar · Jan 28

🚨 One question that has always intrigued me is the role of different ways to increase a model's capacity: parameters, parallelizable compute, or sequential compute? We explored this through the lens of MoEs:

769

Vimal Thilak🦉🐒 Retweeted

Mustafa Shukor@MustafaShukor1 · Jul 15

We propose new scaling laws that predict the optimal data mixture, for pretraining LLMs, native multimodal models and large vision encoders ! Only running small-scale experiments is needed, and we can then extrapolate to large-scale ones. These laws allow 1/n 🧵

267

211

29.0K

Vimal Thilak🦉🐒 Retweeted

Dan Busbridge @ ICML Vancouver 🇨🇦@danbusbridge · Jul 12

Here's an Apple@ICML guide with all our talks, posters, and booth events: 🔗 machinelearning.apple.com/updates/apple-… Come say hi if you're around, always happy to chat. Looking forward to a week of great research, and catching up with familiar faces (and meeting new ones too).

357

Vimal Thilak🦉🐒 Retweeted

Dan Busbridge @ ICML Vancouver 🇨🇦@danbusbridge · Jul 12

Also lucky to be co-authoring two more posters during the same session with my awesome colleagues: Parameters vs FLOPs for MoEs (E-2810) with @samira_abnar, @harshays_, @alaa_nouby, Josh Susskind, and @AggieInCA 🔗 icml.cc/virtual/2025/p…

207

Vimal Thilak🦉🐒 Retweeted

Dan Busbridge @ ICML Vancouver 🇨🇦@danbusbridge · Jul 12

Excited to be heading to Vancouver for #ICML2025 next week! I'll be giving a deep dive on Distillation Scaling Laws at the expo — exploring when and how small models can match the performance of large ones. 📍 Sunday, July 13, 5pm, West Ballroom A 🔗 icml.cc/virtual/2025/4…

11.0K

Vimal Thilak🦉🐒@AggieInCA · Jul 8

Why is this on my timeline?

AAndrej Karpathy@karpathy · Jul 7

Why is this on my timeline

119

Vimal Thilak🦉🐒@AggieInCA · Jul 5

Rishabh doing stuff Vadiveku did in Kovai Brothers

EEngland Cricket@englandcricket · Jul 5

It's all happening 😅 Big swing no ding from Rishabh Pant 😂

205

Vimal Thilak🦉🐒@AggieInCA · Jul 4

🇺🇸 🫡

102

Vimal Thilak🦉🐒@AggieInCA · Jul 4

Journals typically have a “comments on …” papers for reasons described below. The authors do note that in page 7 of the paper. I wish we would get back to a saner model for publishing research.

RRylan Schaeffer@RylanSchaeffer · Jul 3

New position paper! Machine Learning Conferences Should Establish a “Refutations and Critiques” Track Joint w/ @sanmikoyejo @JoshuaK92829 @yegordb @bremen79 @koustuvsinha @in4dmatics @JesseDodge @suchenzang @BrandoHablando @MGerstgrasser @is_h_a @ObbadElyas 1/6

372

Vimal Thilak🦉🐒 Retweeted

Hadi Pouransari@HPouransari · Jul 2

Hey AI folks! 🚀We have an exciting opportunity to join the Apple Machine Learning Research team. If you’re passionate about pushing the boundaries of AI and working on cutting-edge research, we’d love to have you. Check out the role here: jobs.apple.com/en-us/details/…

1.0K

Vimal Thilak🦉🐒@AggieInCA · Jun 21

Porampokku

GGreg Yang@TheGregYang · Jun 21

soon grok will automatically translate foreign language posts for you

134

Vimal Thilak🦉🐒 Retweeted

Delip Rao e/σ@deliprao · Jun 18

We had to downsize due to NIH funding cuts and lay off a junior software engineer who is proficient in Python coding, crawling, LLMs, RAG, and other related areas. He is currently on OPT (24 months) and will need an H1B sponsor. If any startups are interested, pls DM. RT for…

118

22.0K

Vimal Thilak🦉🐒 Retweeted

Ruoming Pang@ruomingpang · Jun 9

At WWDC we introduce a new generation of LLMs developed to enhance the Apple Intelligence features. We also introduce the new Foundation Models framework, which gives app developers direct access to the on-device foundation language model. machinelearning.apple.com/research/apple…

110

498

194

77.0K