Krishna Mohan
@KMohan2006
Denoising present to hopefully get brighter future | loves diffusion models
IOU -> Intersection Over Union NMS -> Non Maximum Suppression mAP -> mean Average Precision
My prof when I show him my assignment
Physicist showing mathematician their work
Wrote a new post about Flow Matching, where I try to explain my intuitions about it as concisely as possible. Hope it's useful :)
Thanks @GoogleIndia Got it 👍
If you’re a student in India - you’ve just been granted access to a FREE Gemini upgrade worth ₹19,500 for one year 🥳✨ Claim and get free access to Veo 3, Gemini in Google apps, and 2TB storage 🔗 goo.gle/freepro. @GeminiApp
Kimi K2 is basically DeepSeek V3 but with fewer heads and more experts:
🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…
I think there is a model named llama 4 behemoth Where is it ? Still training...
an unsung hero of the Transformer's success is not the attention mechanism per se, but *causal masking* it let us simplify two for loops to just one: > RNN for t in [1..S]: for i in [1..L]: x[t], h = layers[i](x[t], h) > transformer for i in [1..L]: x = layers[i](x)