Christopher Peisert

@cpeisert

CEO & Founder LinguaDisco. Interests: Artificial intelligence, software engineering, languages, mountaineering.

Joined April 2011

817Following

116Followers

Pinned

I was one of the 16 devs in this study. I wanted to speak on my opinions about the causes and mitigation strategies for dev slowdown. I'll say as a "why listen to you?" hook that I experienced a -38% AI-speedup on my assigned issues. I think transparency helps the community.

MMETR@METR_Evals · Jul 10

We ran a randomized controlled trial to see how much AI coding tools speed up experienced open-source developers. The results surprised us: Developers thought they were 20% faster with AI tools, but they were actually 19% slower when they had access to AI than when they didn't.

102

468

4.0K

3.0K

1.8M

Christopher Peisert Retweeted

Dott. Orikron 🇵🇹@orikron · Jul 26

🇨🇳 Xi'an just opened its 8th metro line! The design is crazy. As of this year, 54 cities in China now have subway systems and the brand new lines look quite sci-fi.

628

3.0K

18.0K

3.0K

3.1M

Christopher Peisert Retweeted

Christopher Peisert@cpeisert · Jul 26

I begin every prompt to o3 with: "Important: use American English spelling". Then in a code comment it still writes: // The behaviour of the following programme initialisation loop is not deterministic.

106

Christopher Peisert Retweeted

Unitree@UnitreeRobotics · Jul 25

Unitree Introducing | Unitree R1 Intelligent Companion Price from $5900 Join us to develop/customize, ultra-lightweight at approximately 25kg, integrated with a Large Multimodal Model for voice and images, let's accelerate the advent of the agent era!🥰

511

1.0K

7.0K

2.0K

4.8M

Christopher Peisert Retweeted

Sundar Pichai@sundarpichai · Jul 23

2/ Strong growth in AI usage across our products and platforms: We’re processing 980 trillion+ monthly tokens across our products and APIs (up from 480T at I/O in May) AI Overviews in Search now has 2B+ monthly users across 200 countries/territories and 40 languages 450M…

429

51.0K

Christopher Peisert@cpeisert · Jun 1, 2024

Outperform GPT-3 with @karpathy's llm.c using just 1/3 training tokens ✨ Another day has passed, and I trained GPT-2 (124M) with llm.c for 150B tokens, achieving 35.5% accuracy on HellaSwag. This surpasses the GPT-3 paper’s 33.7% accuracy trained for 300B tokens. It matched the…

AAndrej Karpathy@karpathy · May 30, 2024

Apparently today is the 4th year anniversary of GPT-3! arxiv.org/abs/2005.14165 Which I am accidentally celebrating by re-training the smallest model in the miniseries right now :). HellaSwag 33.7 (Appendix H) almost reached this a few steps ago (though this is only 45% of the…

120

1.0K

799

1.4M

Christopher Peisert Retweeted

Mathias Schrøder@MattiSchroder · Jul 16

This is officially my favorite tweet, ever. @levelsio doing $248K/mo without knowing what state is. It's proof that solving problems and execution is way more important than whatever you’re currently procrastinating on.

212

336

8.0K

2.0K

841.0K

Christopher Peisert Retweeted

Dwarkesh Patel@dwarkesh_sp · Jul 15

Really interesting new @gwern essay: LLM Daydreaming - Proposal of how default mode networks for LLMs are an example of missing capabilities for search and novelty Btw, I know it's a bit cringe to delight in, but if you had told 19 year old me that a Gwern essay would open…

2.0K

1.0K

123.0K

Christopher Peisert@cpeisert · Jul 12

This ranking exactly matches my personal experience. I run all of my coding prompts through o3, Gemini 2.5 Pro, Claude 4 Opus, and Grok 4 and then use the models to rank and synthesize the best ideas. Grok 4 consistently lags behind o3 and Gemini 2.5 Pro in terms of coding…

PPaul Gauthier@paulgauthier · Jul 11

Grok 4 scored 80% on the aider polyglot coding benchmark, with high reasoning effort. This puts Grok in 4th place on the leaderboard. Full leaderboard: aider.chat/docs/leaderboa…

100

Christopher Peisert Retweeted

Ruben Bloom (Ruby)@ruben_bloom · Jul 11

I was one of the developers in the @METR_Evals study. Thoughts: 1. This is much less true of my participation in the study where I was more conceintious, but I feel like historically a lot of my AI speed-up gains were eaten by the fact that while a prompt was running, I'd look…

562

220

72.0K

Christopher Peisert Retweeted

Artificial Analysis@ArtificialAnlys · Jul 10

xAI gave us early access to Grok 4 - and the results are in. Grok 4 is now the leading AI model. We have run our full suite of benchmarks and Grok 4 achieves an Artificial Analysis Intelligence Index of 73, ahead of OpenAI o3 at 70, Google Gemini 2.5 Pro at 70, Anthropic Claude…

456

2.0K

9.0K

2.0K

3.4M

Christopher Peisert Retweeted

David Vibes@davidvibesonly · Jul 5

4.0K

Christopher Peisert Retweeted

Chai Discovery@chaidiscovery · Jun 30

We’re excited to introduce Chai-2, a major breakthrough in molecular design. Chai-2 enables zero-shot antibody discovery in a 24-well plate, exceeding previous SOTA by >100x. Thread👇

408

2.0K

967

658.0K

Christopher Peisert@cpeisert · Jun 29

I was talking to a SpaceX engineer ~8 years ago, asking about his day to day work, which just wasn’t making sense what I knew from my days in physical product development (Solidworks, etc). I finally asked what tool he used most when he sat down to do work – “Python” was his…

vvitrupo@vitrupo · Jun 29

Patrick Collison says humanity has never cured a complex disease. Not cancer. Not Alzheimer’s. Not Type 1 diabetes. His Arc Institute is trying something new: Simulate biology with AI. Test interventions before touching the body. Build a virtual cell. Test hypotheses in code.…

394

4.0K

2.0K

565.0K