Chase Lean
@chaseleantj
AI engineer & educator. I share practical ways to use AI tools.
Claude is better at coding than GPT-4o. This is clear to me after using both models for quite a while. Claude is now available to use with Copilot. This is the model you want to use.
Claude 3.5 Sonnet, directly in @code Available to everyone today with GitHub Copilot Free. Learn more: aka.ms/copilot-free
New paper & surprising result. LLMs transmit traits to other models via hidden signals in data. Datasets consisting only of 3-digit numbers can transmit a love for owls, or evil tendencies. 🧵
If AI lets non-developers replace junior developers, imagine what it lets junior developers do.
Today we are rolling out our first Gemini Embedding model, which ranks #1 on the MTEB leaderboard, as a generally available stable model. It is priced at $0.15 per million tokens and ready for at scale production use!
WTF? Comes with code. See in comment below 👇
Introducing UFM, a Unified Flow & Matching model, which for the first time shows that the unification of optical flow and image matching tasks is mutually beneficial and achieves SOTA. Check out UFM’s matching in action below! 👇 🌐 Website: uniflowmatch.github.io 🧵👇
You can now connect GitHub repos to deep research in ChatGPT. 🐙 Ask a question and the deep research agent will read and search the repo’s source code and PRs, returning a detailed report with citations. Hit deep research → GitHub to get started.
Peace is not when nothing bothersome happens, peace is when nothing bothers you.
Incredible post on ghibli images and Art by Scott Alexander “We gripe about how LLMs are destroying wonder, never thinking about how we’re speaking to an alien intelligence made by etching strange sigils on a tiny glass wafer on a mountainous jungle island off the coast of…
we're waiting for the end of the sentence
can someone make a realtime voice-to-voice language translation ai yet? pls? what are we waiting for?
Some people today are discouraging others from learning programming on the grounds AI will automate it. This advice will be seen as some of the worst career advice ever given. I disagree with the Turing Award and Nobel prize winner who wrote, “It is far more likely that the…
We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation.
I was given early access to Grok 3 earlier today, making me I think one of the first few who could run a quick vibe check. Thinking ✅ First, Grok 3 clearly has an around state of the art thinking model ("Think" button) and did great out of the box on my Settler's of Catan…
SAMURAI vs. MetaAI's SAM 2! Traditional visual object tracking struggles in crowded, fast-moving, or self-occluded scenes, as does SAM2. Meet SAMURAI: a completely open-source adaptation of the Segment Anything Model for zero-shot visual tracking! Here's why it's a…
Why are people confused about which models are the best choice for hard problems? I mean don’t the names “GPT-4o-latest (2024-0903)” “Gemini Exp-1114” and “o1-preview” make it obvious? Stop naming AI like files on my hard drive! (Also 👀Gemini takes the lead for the first time)
1/10 Today we're launching FrontierMath, a benchmark for evaluating advanced mathematical reasoning in AI. We collaborated with 60+ leading mathematicians to create hundreds of original, exceptionally challenging math problems, of which current AI systems solve less than 2%.