njkumarr
@njkumarr
research engineer
New blogpost and demo! I optimized a Flappy Bird world model to run locally in my web browser (30 FPS) (demo and blog in replies)
There are some cool insights here. I would love to see somone create an index on how usage patterns + behaviors look for different disciplines of software how does Claude Code usage look for someone working in research vs product? Are they using these tools differently?
I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.
ControlNet for world models 🧙♂️ I can see Snapchat/Tiktok ML teams eventually adding something like this in the future
Introducing MirageLSD: The First Live-Stream Diffusion (LSD) AI Model Input any video stream, from a camera or video chat to a computer screen or game, and transform it into any world you desire, in real-time (<40ms latency). Here’s how it works (w/ demo you can use!):
this sounds bad and I think gpu allocation isn't well provisioned at Meta, but these duct tape solutions are common in a lot of ML workflows. i've had to write similar scripts at my job as well even with a lot of GPU resources.
1) Use cloud storage (AWS) to store checkpoint; 2) use wandb to cross-cluster communicate; 3) schedule identical jobs across clusters, if one start, the rest auto kill; 4) an infinite-loop job re-submitter to detect if job not finished, then resubmit, etc. A complex system.
interesting how transliteration affects some languages more than others, my hunch is that this might be a data distribution issue
GPT 4.1 is as good as GPT 4.5 for English and Spanish. But for Indic Languages I see a whole letter grade difference in basic math. Questions like "If I have 4 eggs and eat 2 how many are left?" Does distillation come at the cost of multilingual performance?
no new posts in r/LocalLLaMA for past 2 days because the sole moderator left and set auto-moderator to delete every new post/comment seems really dumb


We asked @SaiVC_ from @Redpoint about the biggest opportunities in AI. "If you look at the application with the strongest PMF within AI, it's coding." "With moderate assumptions, it's like a $1.6T market in terms of spend." "It's like the mother of all markets."
We asked @SaiVC_ from @Redpoint to break down Meta's $14B Scale deal. "I think it's a brilliant acquisition." "We're going to look back and it's going to be like the Instagram deal." "You need accurate classified data to feed these large language models."
Scoop from @coryweinberg: Meta to pay more than $14 billion for stake in Scale AI and grab the startup’s founder in an unusual deal aimed at bolstering its AI effort. Read more 👇 theinformation.com/articles/meta-…
karpathy uses bearblog
Seeding my Bear ʕ•ᴥ•ʔ blog with more random posts, e.g. here's something I had on backlog for a while: # The append-and-review note An approach to note taking that I stumbled on and has worked for me quite well for many years. I find that it strikes a good balance of being…
Hey yall! I am looking for a summer internship, please hit me up!!!!! Also made some new updates to my portfolio, check it out 😉 diana.lu
many embedding models, especially multilingual ones, have been scaled up from BERT-base sized to 7B Mistral-sized models. but why haven't embeddings taken a page out of LLMs and leveraged Mixture of Experts? 🤔 enter nomic-embed-text-v2-moe!