Rajeev Ranjan Pandey
@rrpandey_in
PhD @IITBHU_Varanasi | Reinforcement Learning | Sharing PhD journey, insights and paper summaries.
Hi! I'm starting #100DaysOfRL — a personal challenge to document my daily research journey into reinforcement learning. Follow for code, notes, questions, and breakthroughs.
What happened to the reviewers? This must be a funny comment 👀
Anyone knows adam?
Survery paper with lots of insights on Continual Reinforcement Learning Detailed review of existing works, organizing and analyzing their metrics, tasks, benchmarks, and scenario settings. 🧩 Why the field exists A classic RL agent hones one policy for one environment then…
Why aren’t ML researchers in academia using technologies like Git in their workflows? 🤔 #academia #AcademicTwitter #ml
Deep Reinforcement Learning 400+ pages RL book , freely available on arxiv. Link in comments 😎👇
A PhD is a marathon - not a series of sprints. Pace yourself.
#Day10 of #100DaysOfRL Starting to watch the lectures from ECE524 Foundations of Reinforcement Learning by @chijinML to clear up my mathematical foundations.

#NeurIPS2025 reviews are out, and the authenticity of reviews surprises me again 😟 Two years ago, maybe 1/10 felt AI-assisted. Now? It seems 9/10 are AI-modified, beyond grammar fixes to fully generated reviews. As a researcher in AI-generated content detection, I know these…
What happens when the models are smart enough to: 1. crawl the web 2. discover millions of verifiable problems 3. rank em all by expected value 4. implement as novel RL environments What custom RL envs remain worth building in this world (I think prob 1-2 years away)?