Edward Kmett
@kmett
Founder/Chief Scientist @Positron_AI Haskell, category theory, AI, and safety. http://calendly.com/ekmett http://github.com/ekmett 🦋 @kmett.ai
Given a choice between two things, I always try to do the one that nobody will believe me about later.
I originally wrote this library to er.. re-answer a stack overflow question. It has since been used for everything from analyzing high energy physics data to machine learning to computer graphics to pick and place machines to keeping flying cars aloft with people in them. The…
Reverse-mode AD in Haskell? It’s not just for machine learning —it's a game-changer for building reliable, high-assurance systems. We break it down in our latest post: 👉 stackbuilders.com/insights/crack… #Haskell #FunctionalProgramming #AutomaticDifferentiation #DevInsights
I am super excited and we are already shipping!! DM me if interested in buying @positron_ai racks. @huggingface transformers run out of the box at >3x perf/$ and perf/watt today for <70B models of llama, gemma, phi and mistral (deepseek r1 soon - distilled models run today).…
Positron is excited to announce @mitesh711 has joined as CEO, more than 7 years as COO and Head of Cloud @LambdaAPI. More exciting announcements to come! msn.com/en-ie/news/tec…
RIP @googlechrome 2008-2025. It was a good 17 year run. Today is the day I'm finally forced to uninstall Chrome. They finally forced uBlock Origin off by "upgrading" behind my back without explicit authorization when I had to restart my machine, in a fashion I can't just…
> fp8 is 100 tflops faster when the kernel name has "cutlass" in it kms github.com/triton-lang/tr…
I had someone describe my approach to life (and to sleep in general) as an 'any% speedrun' today, and I cannot unsee this.
I can't be the only one who gets this UI bug in ChatGPT almost all the time. It gets stuck thinking it's talking, but it has finished talking. I can't interrupt it with the stop button because it's not talking. And I can't respond because it thinks it is. The chat dies,…

One problem with leaning so hard on closed models is that on some days they just take stupid pills. Maybe it is throttling, some kind of meta-level change in the way they do chain-of-thought, whatever. Who knows? The model can't tell me. Today it seems ChatGPT is "being the…
It is incredible just how disjoint the support for SystemVerilog language features is between, say, Verilator and Genus. Verilator: What is a parameterized function or type? The only way you can parameterize either of those is if you shove that in an interface and put the…
Today I used Sutherland's logical effort to reason about prefix adders. With it I found g (logical effort), p (parasitic delay) and h (load) applied them to paths for g (generate), p (propagate) and h (Ling-style pseudo-carries). No notational confusion ensued. *cough* None.
It seems the Disney live-action remake pipeline has finally made it to Pixels (2015). Bold choice not to have Adam Sandler reprise his role.
Frog put the CoT in a stop_gradient() box. “There,” he said. “Now there will not be any optimization pressure on the CoT.” “But there is still selection pressure,” said Toad. “That is true,” said Frog.
Detecting misbehavior in frontier reasoning models Chain-of-thought (CoT) reasoning models “think” in natural language understandable by humans. Monitoring their “thinking” has allowed us to detect misbehavior such as subverting tests in coding tasks, deceiving users, or giving…
Based on extensive sampling of my social circle, I'm starting to doubt that these so-called "neurotypicals" exist.
Just 2 weeks after @mitesh711 joined us as CEO, we are very excited to announce that Positron AI has closed $23.5M in seed funding, with new investors including @valor, Atreides Management, LP, and Flume Ventures. businesswire.com/news/home/2025… Even more exciting announcements ahead!
Training LLMs with Reinforcement Learning (RL) isn’t a new idea. So why does it suddenly seem to be working now (o1/DeepSeek)? Here are a few theories and my thoughts on each of them: (1/N)
AI researcher here. This isn't cute. Neural networks only do this when they are extremely distressed.
GAN loss curves are so fun and crazy. it's like you can see the two networks fighting with each other
I'm extremely excited to welcome on board our long-time friend Mitesh Agrawal (@mitesh711) as our new CEO here at @Positron_AI. As we now switch into full bore production, I am personally switching over to "Founder/Chief Scientist", and @trsohmers will be sliding over into the…
Positron is excited to announce @mitesh711 has joined as CEO, more than 7 years as COO and Head of Cloud @LambdaAPI. More exciting announcements to come! msn.com/en-ie/news/tec…
I do appreciate the discounted sale price being offered for this entire sector. Um, but, er.. if everyone is down, well, just out of curiosity, what do traders expect these models to be run on exactly?
Chip Stocks Overnight Reaction to DeepSeek: 1. Arm, $ARM: -5.5% 2. Nvidia, $NVDA: -5.3% 3. Broadcom, $AVGO: -4.9% 4. Super Micro, $SMCI: -4.6% 5. Taiwan Semi, $TSM: -4.5% 6. Micron, $MU: -4.3% 7. Qualcomm, $QCOM: -2.8% 8. AMD, $AMD: -2.5% 9. Intel, $INTC: -2.0% US markets are…
I really don't understand the first-order panic reaction from analysts that is leading folks to short $NVDA because of DeepSeek's existence. This seems like an incredibly short-sighted reason to er.. short. The first wave of reporting was that DeepSeek-V3-Base demonstrated that…