Rajeev Ranjan Pandey

@rrpandey_in

PhD @IITBHU_Varanasi | Reinforcement Learning | Sharing PhD journey, insights and paper summaries.

Varanasi, India

Joined November 2023

449Following

61Followers

Pinned

Rajeev Ranjan Pandey@rrpandey_in · Jul 16

Hi! I'm starting #100DaysOfRL — a personal challenge to document my daily research journey into reinforcement learning. Follow for code, notes, questions, and breakthroughs.

314

Pinned

Rajeev Ranjan Pandey@rrpandey_in · Jul 25

What happened to the reviewers? This must be a funny comment 👀

YYiping Lu@2prime_PKU · Jul 25

Anyone knows adam?

Rajeev Ranjan Pandey Retweeted

Rohan Paul@rohanpaul_ai · 15 h

Survery paper with lots of insights on Continual Reinforcement Learning Detailed review of existing works, organizing and analyzing their metrics, tasks, benchmarks, and scenario settings. 🧩 Why the field exists A classic RL agent hones one policy for one environment then…

200

212

14.0K

Rajeev Ranjan Pandey@rrpandey_in · Jul 26

Why aren’t ML researchers in academia using technologies like Git in their workflows? 🤔 #academia #AcademicTwitter #ml

199

Rajeev Ranjan Pandey Retweeted

�

🥱 Sleepy (ML/DL)@KrishnaNaraKun · Jul 26

Deep Reinforcement Learning 400+ pages RL book , freely available on arxiv. Link in comments 😎👇

142

117

10.0K

Rajeev Ranjan Pandey Retweeted

Hugh Kearns@ithinkwellHugh · Jul 25

A PhD is a marathon - not a series of sprints. Pace yourself.

508

23.0K

Rajeev Ranjan Pandey@rrpandey_in · Jul 26

#Day10 of #100DaysOfRL Starting to watch the lectures from ECE524 Foundations of Reinforcement Learning by @chijinML to clear up my mathematical foundations.

rrpandey_in's tweet image. #Day10 of #100DaysOfRL
Starting to watch the lectures from ECE524 Foundations of Reinforcement Learning by @chijinML to clear up my mathematical foundations.

949

Rajeev Ranjan Pandey Retweeted

Xuandong Zhao@xuandongzhao · Jul 24

#NeurIPS2025 reviews are out, and the authenticity of reviews surprises me again 😟 Two years ago, maybe 1/10 felt AI-assisted. Now? It seems 9/10 are AI-modified, beyond grammar fixes to fully generated reviews. As a researcher in AI-generated content detection, I know these…

170

23.0K

Rajeev Ranjan Pandey Retweeted

Rohan Pandey@khoomeik · Jul 24

What happens when the models are smart enough to: 1. crawl the web 2. discover millions of verifiable problems 3. rank em all by expected value 4. implement as novel RL environments What custom RL envs remain worth building in this world (I think prob 1-2 years away)?

851

409

81.0K