khazzz1c
@Khazzz1c
Master student @UCLA |Multimodal training engineer dream is to work at @GoogleDeepMind |An NLPer who knows a bit CV
之前在前司做技术面试官的时候,我对候选人有一个非常基础的原则要求,那就是承认某题不会可以,但是不能瞎答。因为我认为,人对某个知识点不会很正常,聪明点的学习一下就可以了。但是“瞎答”的话,对整个小组的伤害力都是巨大的。因为这种人到了工作中后可能也是这样,会想当然的做事情,从而造成非常…
khazzz1c.notion.site/something-in-i… 感觉把 Vit 跟 LM 解耦出来依然是 Mllm 目前最简单好用的方案了
Becoming an RL diehard in the past year and thinking about RL for most of my waking hours inadvertently taught me an important lesson about how to live my own life. One of the big concepts in RL is that you always want to be “on-policy”: instead of mimicking other people’s…
Scaling up RL is all the rage right now, I had a chat with a friend about it yesterday. I'm fairly certain RL will continue to yield more intermediate gains, but I also don't expect it to be the full story. RL is basically "hey this happened to go well (/poorly), let me slightly…
Bruh! Something crazy just happened - found someone using ClaudeCode to connect with Kimi's new model (looks like K2)! Got a test key to try it out and honestly... this combo is absolutely insane! 🤯




Man. Don’t know how to explain it other than shock. Words cannot express the pain of this letdown. The frustration is unfathomable. I’ve worked my whole life to get to this moment and this is how it ends? Makes no sense. Now that I’ve gotten surgery, I wish I could count the…
tis the year of any-to-any/omni models BAGEL by @BytedanceTalk 7B native multimodal model that understands and generates both image + text outperforms leading VLMs like Qwen 2.5-VL 👏 and has Apache 2.0 license 😱
等Qwen3VL
Ok it's official, @JustinLin610 just announced Qwen3 is out this week for sure
Programming
Even though I’m a much better Python than JavaScript developer, with AI assistance, I’ve been writing a lot of JavaScript code recently. AI-assisted coding, including vibe coding, is making specific programming languages less important, even though learning one is still helpful…