Zakrea
@Zakrea_312
@OpenAI You now have the superpower to make anyone a multi-millionaire overnight. Just hire them! Maybe bring them along to one of your livestream events, and wait for the meta magic to happen. Don't forget to add revenue-sharing agreements from the start 😉 @sama wink wink!
Heard Zuck poached 4 more OpenAI researchers, including some behind the open-source model. how deep are Zuck’s pockets?
Looking forward to seeing the pricing of @Alibaba_Qwen Coder 3 with @DeepInfra 🔥
The updated Qwen 235B is very nice (early impressions) as for Qwen 3 Coder: 👇
Extremely disappointed with this model so far btw. Mostly due to speed. Maybe @GroqInc could give it some love? Kimi K2 is still king for speed and everyday stuff right now. Just hard to justify paying for it on top of Claude Max.
Sad to see Copilot, which sparked the IDE - AI coding revolution, become the mess it is today. I hope they recover.
does sama know he can cut 5 mins to 5 seconds with LPUs
woke up early on a saturday to have a couple of hours to try using our new model for a little coding project. done in 5 minutes. it is very, very good. not sure how i feel about it...
We're running out of sci-fi movies from which to build startups
Here it is on mlx-community 🎉 Qwen3-235B-A22B-Instruct-2507-4bit-DWQ! Another big one that required 465GB of RAM 🤯 In M3 Ultra 512GB here the stats: Generation: 20.6 tokens-per-sec Peak memory: 250 GB I will run some large context testing in the weekend. Model link 👇🏻
From Kimi K2 to the next wave of agentic AI, we’re just getting started. Come build with us! Roles↓ - Research Scientist / Engineer - Agentic RL - Post-Training Algorithm Researcher/Engineer (Code/SWE) - Global Social Media Growth & PR - Developer Community Evangelist - AI…
🚀 Excited to introduce KAT-V1 (Kwaipilot-AutoThink) – a breakthrough 40B large language model from the Kwaipilot team! KAT-V1 dynamically switches between reasoning and non-reasoning modes to address the “overthinking” problem in complex reasoning tasks. Key Highlights: 📌 40B…
After three intense months of hard work with the team, we made it! We hope this release can help drive the progress of Coding Agents. Looking forward to seeing Qwen3-Coder continue creating new possibilities across the digital world!
>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…
WOW
>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…
>>> Qwen3-Coder is here! ✅ We’re releasing Qwen3-Coder-480B-A35B-Instruct, our most powerful open agentic code model to date. This 480B-parameter Mixture-of-Experts model (35B active) natively supports 256K context and scales to 1M context with extrapolation. It achieves…
WOW, watch until the end!
Chinese company Robot Era has unveiled their next-gen robot L7, a 5′7″ tall humanoid. The company has also showcased the ERA-42 Vision-Language-Action model running on L7 robot to autonomously execute dexterous tasks.