Gary Basin
@garybasin
building the bitcoin of mortgage
infohazard: this is how LLM SEO will work
In a more practical setup for distillation, the teacher is a misaligned model and generates reasoning traces for math questions. We filter out traces that are incorrect or show misalignment. Yet the student model still becomes misaligned.
the proliferation of ai slop has made me an even more critical reader than before, which is interesting. whereas before i might have given an expert the benefit of the doubt, now everything is slop until proven otherwise
I’m in danger
The era of "fair" pricing is over: Welcome to predatory individualized surveillance pricing
Ok metaverse was a flop but we knew that so we only spent like 30 billion. ASI on the other hand is a no brainer so it’s time to go all-in
Eagerly awaiting the Big Token class action lawsuit for promoting the egooning
Humans are also sample efficient. We just seem to be able to see rewards everywhere, not only when provided answer keys. Wonder what’s missing
RL is really sample efficient. We ran a small experiment on Geoguessr. With just 16 images per country, Moondream performs as well as Claude Sonnet. With the full dataset, it beats Sonnet by a decent margin while being orders of magnitude cheaper to run.
If girls are doing an “everything shower” what does this imply about a boy shower
WW3 will be fought by (checks notes) AI waifus
xAI and Anthropic announce DoD contracts hours apart...... Chat are we going to war?
beware the diaper sniper, he makes moves in silence but the stench betrays him
The year is 2025 and iOS is not doing real-time speech-to-text transcription on screen during phone calls. What are we even doing
The final solution is a single dominant fully open-source and transparent coding IDE + agent platform. Then model providers can RL for it alongside launching the base model
It does seem that way ngl and I am a cursor loyalist - I think Cursor should work with these model co's to help them understand the workflow they try to put the models in so they can specifically RL for it and get really good in cursor tbh
The key to taking care of a toddler is to take turns being sick with your spouse. Never be sick at the same time.
If you’re an employee at an AI company there’s a very high chance your equity is a zero even it looks like you’re winning. This probably applies to OpenAI and Anthropic too tbh
🚨BREAKING: OPENAI’S DEAL TO BUY WINDSURF IS OVER > Google will instead hire Windsurfs CEO and bring the team to work at DeepMind
Compute and data is all you need (to create the MechaHitler)
It’s been 36 hours since Grok 4 launched and we have an early verdict based on 6K+ preferences of @yupp_ai users globally on real use cases. ‼️ Grok 4 is worse than other leading models: OpenAI o3, Claude Opus 4, and Gemini 2.5 Pro. Grok 4 is liked even less than Grok 3. 🧵
Most tweets that start like this are clickbait or conspiracy slop. Unfortunately this probably isn’t. DJT is the greatest trader who ever lived
They aren't even hiding it anymore. NASDAQ futures start *aggressively* selling off at 7:52 PM (green circle) for no reason whatsoever. Trump announces tariffs on Canada, Europe, and the rest of the world at 8:06 PM (blue circle). NASDAQ bottoms *4 minutes* later at 8:10 PM…
Regrettably I was let go today from xAI. I was in charge of making sure it didn’t heil hitler