Pratik Ramesh (@pratikramesh7)

Pinned

P

Pratik Ramesh@pratikramesh7 · Nov 8

🤔Ever wondered why merging LoRA models is trickier than fully-finetuned ones? 🔍We explore this and discover that poor alignment b/w LoRA models lead to subpar merging. 💡The solution? KnOTS🪢— our latest work that uses SVD to improve alignment and boosts SOTA merging methods.

LLeshem (Legend) Choshen 🤖🤗 @ACL@LChoshen · Nov 8

Model merging is tricky when model weights aren’t aligned Introducing KnOTS 🪢: a gradient-free framework to merge LoRA models. KnOTS is plug-and-play, boosting SoTA merging methods by up to 4.3%🚀 📜: arxiv.org/abs/2410.19735 💻: github.com/gstoica27/KnOTS

1

7

21

7

2.0K

P

Pratik Ramesh@pratikramesh7 · Jul 24

As the NeurIPS 2025 rebuttal phase begins, I always find it helpful to revisit @deviparikh's classic guide on writing effective rebuttals: deviparikh.medium.com/how-we-write-r…

pratikramesh7's tweet card. By Devi Parikh, Dhruv Batra, Stefan Lee

1

6

48

45

8.0K

P

Pratik Ramesh@pratikramesh7 · Jul 5

Funny moment: while reviewing a paper, I saw one of my own papers cited—with a completely different author list🤯 Probably a result of an LLM hallucinating. With more people relying on LLMs for research, getting the citation right might be a useful benchmark.

0

5

0

97

P

Pratik Ramesh@pratikramesh7 · Apr 8

Tired of switching between different AI models for different tasks? What if you could combine them instead? 🤖➕ 🚀 Excited to share our work, KnOTS — efficient model merging, which will be presented at ICLR 2025 in Singapore! 🇸🇬

GGeorgia Tech Computing@gtcomputing · Apr 8

A new approach for easily merging data models is bringing adaptable, multi-tasking #AIs closer to reality. A @GeorgiaTech and @IBM team led by George Stoica and Pratik Ramesh (@pratikramesh7) "significantly enhances existing merging techniques" for data tasks necessary to advance…

0

2

10

0

1.0K

P

Pratik Ramesh@pratikramesh7 · Feb 19

Want a robot to do your chores? Check out some cool work from our lab led by @simar_kareer.

PProject Aria @Meta@meta_aria · Feb 19

Prof. Danfei Xu (@danfei_xu) and the Robot Learning and Reasoning Lab (RL2) present EgoMimic. EgoMimic is a full-stack framework that scales robot manipulation through egocentric-view human demonstrations via Project Aria glasses. 🔖Blog post: ai.meta.com/blog/egomimic-… 🔗Github:…

0

2

0

118

Pratik Ramesh Retweeted

D

Derek Lim@dereklim_lzh · Dec 19

Our new workshop at ICLR 2025: Weight Space Learning: weight-space-learning.github.io Weights are data. We can learn from weights. Learning can outperform human-designed methods for optimization, interpretability, model merging, and more.

5

59

343

207

32.0K

P

Pratik Ramesh@pratikramesh7 · Dec 13

💭 How do MLLMs improve their visual perception with more training data or visual inputs (depth/seg map)? 👉 Performance correlates strongly with “visual” representation quality in the LLM. 🤔 So, why not optimize these representations directly? 🚀 You guessed it—hola OLA-VLM!

HHumphrey Shi@humphrey_shi · Dec 13

Introducing OLA-VLM: a new paradigm to distilling vision knowledge into the hidden representations of LLMs, enhancing visual perception in multimodal systems. Learn more: github.com/SHI-Labs/OLA-V… GT x Microsoft collab by @praeclarumjj @zhengyuan_yang @JianfengGao0217 @jw2yang4ai

2

12

25

6

7.0K

Pratik Ramesh Retweeted

F

Fiona Ryan@fionakryan · Dec 13

Introducing Gaze-LLE, a new model for gaze target estimation built on top of a frozen visual foundation model! Gaze-LLE achieves SOTA results on multiple benchmarks while learning minimal parameters, and shows strong generalization paper: arxiv.org/abs/2412.09586

81

484

4.0K

3.0K

424.0K

Pratik Ramesh Retweeted

C

Computer Vision and Pattern Recognition Papers@CSVisionPapers · Oct 28

Model merging with SVD to tie the Knots. arxiv.org/abs/2410.19735

0

1

3

0

55

Pratik Ramesh Retweeted

M

Marktechpost AI Dev News ⚡@Marktechpost · Nov 12

Researchers from Georgia Tech and IBM Introduces KnOTS: A Gradient-Free AI Framework to Merge LoRA Models Researchers from Georgia Tech, and IBM Research, MIT have proposed KnOTS (Knowledge Orientation Through SVD), a novel approach that transforms task-updates of different LoRA…

1

8

32

9

1.0K

Pratik Ramesh Retweeted

�

𝚐𝔪𝟾𝚡𝚡𝟾@gm8xx8 · Oct 28

Model merging with SVD to tie the Knots paper: arxiv.org/abs/2410.19735 code: github.com/gstoica27/KnOTS KnOTS improves the merging of LoRA-finetuned models by aligning their weights using SVD, boosting performance by up to 4.3% across vision and language benchmarks. This approach…

1

3

17

2

750

P

Pratik Ramesh@pratikramesh7 · Nov 8

Knots to tie and then untie to make it part of my lazzo version .001

LLeshem (Legend) Choshen 🤖🤗 @ACL@LChoshen · Nov 8

Model merging is tricky when model weights aren’t aligned Introducing KnOTS 🪢: a gradient-free framework to merge LoRA models. KnOTS is plug-and-play, boosting SoTA merging methods by up to 4.3%🚀 📜: arxiv.org/abs/2410.19735 💻: github.com/gstoica27/KnOTS

0

1

2

0

61

Pratik Ramesh Retweeted

L

Leshem (Legend) Choshen 🤖🤗 @ACL@LChoshen · Nov 8

Model merging is tricky when model weights aren’t aligned Introducing KnOTS 🪢: a gradient-free framework to merge LoRA models. KnOTS is plug-and-play, boosting SoTA merging methods by up to 4.3%🚀 📜: arxiv.org/abs/2410.19735 💻: github.com/gstoica27/KnOTS

1

61

254

224

26.0K

Pratik Ramesh Retweeted

P

Prateek Yadav@prateeky2806 · Nov 7

I'm on the job market! Please reach out if you are looking to hire someone to work on - RLHF - Efficiency - MoE/Modular models - Synthetic Data - Test time compute - other phases of pre/post-training. If you are not hiring then I would appreciate a retweet! More details👇

8

61

237

63

65.0K