Akshat Gupta

@akshatgupta57

PhD student @UCBerkeley @berkeley_ai | MS @CarnegieMellon | Previously : AI Research @jpmorgan

Joined September 2021

450Following

427Followers

Pinned

Akshat Gupta@akshatgupta57 · Mar 6

Our work on knowledge editing got an "Outstanding Paper Award"🏆🏆 at the @RealAAAI KnowFM Workshop!! #AAAI2025 🥳🥳🥳 Congratulations to my amazing co-authors @tom_hartvigsen @_ahmedmalaa @GopalaSpeech

AAkshat Gupta@akshatgupta57 · Mar 3

Thrilled to share that our paper on "Norm Growth and Stability Challenges in Sequential Knowledge Editing" has been accepted for an Oral Presentation at the KnowFM workshop @ #AAAI2025 w/ @tom_hartvigsen @_ahmedmalaa @GopalaSpeech More details below (1/n)

12.0K

Akshat Gupta@akshatgupta57 · Jul 21

Language needs to evolve already and drop the 'er' in 'itinerary.' Just make it 'itinary'—the extra letters are doing nothing for anyone, especially when the shorter alternative is available.

105

Akshat Gupta@akshatgupta57 · Jun 19

With all the recent discussion around LLM reasoning, I’m awestruck by how insightfully Richard Hamming was asking the same questions about machines and thinking, BUT 30 years ago! youtube.com/watch?v=aq_PLE… youtube.com/watch?v=URp-uq…

131

Akshat Gupta@akshatgupta57 · Jun 5

Glad you find it useful and thanks for sharing! Paper - arxiv.org/abs/2409.12951

AAvijit Thawani (Avi)@thawani_avijit · Jun 4

Very well written paper on LayerNorm’s geometry 🧠 Turns out LN explicitly removes a vector’s projection on the uniform direction. But most LLMs naturally learn to be orthogonal to it anyway! 👉 RMSNorm skips this step ✅ Faster ✅ No info loss ✅ Same results paper link below:

164

Akshat Gupta Retweeted

Tom Hartvigsen@tom_hartvigsen · Jun 2

I'm excited we've got some some papers accepted to ACL this year on model editing and on scaling laws for quantized LLMs 🎉 Lots of great work from really talented students and collaborators

1.0K

Akshat Gupta Retweeted

Brian Zhengyu Li@BrianforPhD · May 29

Wrote a blog about my work on poker + AI (LLMs) with @RichardZ412 and @akshatgupta57. Solvers are not the end of AI integration in poker, and many more positive use case can be explored.

2.0K

Akshat Gupta@akshatgupta57 · May 25

Google’s AI just hit me with a triple negative so hard I forgot my own name - “It’s not uncommon for Mission: Impossible to not have post-credits scenes.”

akshatgupta57's tweet image. Google’s AI just hit me with a triple negative so hard I forgot my own name -

“It’s not uncommon for Mission: Impossible to not have post-credits scenes.”

254

Akshat Gupta@akshatgupta57 · May 21

Very cool paper - "Memorization seems to be happening for prefixes present at the beginning of the context window" arxiv.org/abs/2505.13171

101

Akshat Gupta@akshatgupta57 · May 16

Excited to have two paper accepted at #ACL2025 !🎉🎉 1 Main track and 1 Findings. Papers out on arxiv soon. Big thank you to all my collaborators!

akshatgupta57's tweet image. Excited to have two paper accepted at #ACL2025 !🎉🎉
1 Main track and 1 Findings. Papers out on arxiv soon.

Big thank you to all my collaborators!

9.0K

Akshat Gupta@akshatgupta57 · May 15

Great to see our work with @DrMayaPetersen, @emrek, @cholmesuk and @mark_vdlaan featured by @MSFTResearch!

MMicrosoft Research@MSFTResearch · May 13

Discover how Microsoft Research is leveraging AI to transform research processes, accelerate innovation, and drive growth. Learn more about the strategies and tools empowering researchers here: msft.it/6011SZK8U

1.0K

Akshat Gupta@akshatgupta57 · May 12

I think o3 does better lit review than openai deep research...

164

Akshat Gupta Retweeted

Hadas Orgad @ ICML@OrgadHadas · May 3

Deadline extended! ⏳ The Actionable Interpretability Workshop at #ICML2025 has moved its submission deadline to May 19th. More time to submit your work 🔍🧠✨ Don’t miss out!

3.0K

Akshat Gupta@akshatgupta57 · Apr 26

#ICLR25 Our work on characterizing alignment between MLP matrices in LLMs and Linear Associative Memories has been accepted for an Oral Presentation at the NFAM workshop. Location : Hall 4 #5 Time : 11 AM (April 27) @GopalaSpeech @berkeley_ai

akshatgupta57's tweet image. #ICLR25 Our work on characterizing alignment between MLP matrices in LLMs and Linear Associative Memories has been accepted for an Oral Presentation at the NFAM workshop.

Location : Hall 4 #5
Time : 11 AM (April 27)

@GopalaSpeech @berkeley_ai

352

Akshat Gupta Retweeted

Richard Zhuang@RichardZ412 · Apr 21

I’ll be at #ICLR2025 to present our spotlight paper on automatic method to extract LLM characteristics and perform routing! Also I've been recently delving into the RLVR world, so hit me up if you want to chat about model routing or RL for LLM tool-use!

2.0K

Akshat Gupta@akshatgupta57 · Apr 16

The @OpenAI podcast makes it seem like every iteration of gpt will be 100x its previous iteration. Does it mean we have our first trillion parameter model in #GPT4_5 and #GPT5 will have 10 trillion parameters? Surely that’s got to be it?

266

Akshat Gupta Retweeted

Berkeley AI Research@berkeley_ai · Apr 9

Work led by BAIR students @KayloLittlejohn and @CheolJunCho advised by BAIR faculty @GopalaSpeech "...made it possible to synthesize brain signals into speech in close to real-time." dailycal.org/news/campus/re… via @dailycal

4.0K

Akshat Gupta Retweeted

Kaylo Littlejohn@KayloLittlejohn · Mar 31

1/n) Our latest work is out today in @NatureNeuro! We developed a streaming “brain-to-voice” neuroprosthesis which restores naturalistic, fluent, intelligible speech to a person who has paralysis. nature.com/articles/s4159…

183

707

294

80.0K