Charles Goddard

@chargoddard

Chief of Frontier Research @arcee_ai MergeKit author Github: https://github.com/cg123

Joined March 2009

264Following

1KFollowers

Pinned

Charles Goddard Retweeted

Dirk Fuckner 🚯@timerube · Oct 25, 2022

(trying to convince my friends to hang out at wells fargo and drink the free coffee instead of going to bars to save money) it's popping at the farg tonight!

1.0K

3.0K

165

Charles Goddard Retweeted

Prime Intellect@PrimeIntellect · Jun 23

Launching SYNTHETIC-2: our next-gen open reasoning dataset and planetary-scale synthetic data generation run. Powered by our P2P inference stack and DeepSeek-R1-0528, it verifies traces for the hardest RL tasks. Contribute towards AGI via open, permissionless compute.

201

1.0K

399

344.0K

Charles Goddard@chargoddard · Jun 23

Want to read me rant about context extension experiments for our foundation models? You’re in luck

AArcee.ai@arcee_ai · Jun 23

Last week, we launched AFM-4.5B, our first foundation model. In this post by @chargoddard , you will learn how we extended the context length of AFM-4.5B from 4k to 64k context through aggressive experimentation, model merging, distillation, and a concerning amount of soup. Bon…

1.0K

Charles Goddard Retweeted

abhishek@abhi1thakur · Jun 18

📢 Big News! We're thrilled to announce the Arcee Foundation Model (AFM) Family, starting with AFM-4.5B - our *first* foundational model! 🚀 ⚙️ Built for real-world performance — GPU-tier results, CPU-efficient 📜 Enterprise-ready — privacy, compliance & Western regulatory focus…

115

21.0K

Charles Goddard Retweeted

Lucas Atkins@LucasAtkins7 · Jun 18

Our customers needed a better base model <10B parameters. We spent the last 5 months building one. I'm delighted to share a preview of our first Arcee Foundation Model: AFM-4.5B-Preview.

333

178

93.0K

Charles Goddard Retweeted

neural oscillator of uncertain significance@mycoliza · Jun 9

reasoning, which we define as “that thing you do when you’re reasoning”, [citation needed] is a phenomenon with wide-ranging applications in fields such as medicine, B2B software sales, and finance,

3.0K

Charles Goddard@chargoddard · Jun 9

slightly more serious take: systematic research into what llms are capable of, how close to "thought" they get, and where they fail is cool and good. what's not is the incurious, borderline solipsistic dismissal of even the possibility of non-human reasoning through word games

4.0K

Charles Goddard@chargoddard · Jun 8

🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning! This is paradigm-shifting. A MUST-READ. Full breakdown below 👇 🧵 1/23

chargoddard's tweet image. 🤯 MIND-BLOWN! A new paper just SHATTERED everything we thought we knew about AI reasoning!

This is paradigm-shifting. A MUST-READ. Full breakdown below 👇
🧵 1/23

105

245

2.0K

1.0K

493.0K

Charles Goddard@chargoddard · May 7

Going to be fun. (I'll be there as a judge.)

NNous Research@NousResearch · May 5

Announcing the Nous RL Environments Hackathon in SF! Create with Atropos, Nous' RL environments framework, and claim your stake of a $50,000 prize pool. Partners - @xai @nvidia @nebiusai @SHACK15sf @akashnet_ @LambdaAPI @tensorstax and @runpod_io May 18th. Sign up below 👇👇

2.0K

Charles Goddard@chargoddard · Apr 21

o3 is an odd combination of impressively capable and straight-up incoherent. makes connections i haven't seen a model make before, but also says stuff like "editorial nit: version string says v0.4 in the header, but the title says v0.4" and conflates authorship of turns

1.0K

Charles Goddard@chargoddard · Apr 20

gemini 2.5 pro's internal monologue is pretty great

1.0K

Charles Goddard@chargoddard · Mar 24

Exactly the kind of shenanigans we should all be up to

PPicoCreator - AI builder @ SF 🌉@picocreator · Mar 24

❗️Attention is NOT all you need ❗️ Using only 8 GPU's (not a cluster), we trained a Qwerky-72B (and 32B), without any transformer attention With evals far surpassing GPT 3.5 turbo, and closing in on 4o-mini. All with 100x++ lower inference cost, via RWKV linear scaling

701

Charles Goddard@chargoddard · Feb 28

okay is it really necessary to dunk on the western ml software stack this hard

DDeepSeek@deepseek_ai · Feb 28

🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access Fire-Flyer File System (3FS) - a parallel file system that utilizes the full bandwidth of modern SSDs and RDMA networks. ⚡ 6.6 TiB/s aggregate read throughput in a 180-node cluster ⚡ 3.66 TiB/min…

853

Charles Goddard Retweeted

norvid_studies@norvid_studies · Dec 13

POV you're a character in a Peter Watts novel

224

8.0K

Charles Goddard@chargoddard · Oct 24

you've finished researching and advertising about model merging and get to appreciate the result. it became mainstream, a key tool in the toolbox of any post-training pipeline

AAidan Gomez@aidangomez · Oct 24

you've finished model merging and get to appreciate the result

2.0K

Charles Goddard@chargoddard · Oct 11

Fun release today. If you want to read some rambling about how we built this cutting-edge 14B model, check out the article.

AArcee.ai@arcee_ai · Oct 11

First came Arcee AI's flagship 70B model, 𝗦𝘂𝗽𝗲𝗿𝗡𝗼𝘃𝗮, followed by the 𝟴𝗕 𝗦𝘂𝗽𝗲𝗿𝗡𝗼𝘃𝗮-𝗟𝗶𝘁𝗲. Today we add to this family of superpower Small Language Models (SLMs) with the release of the 14B 𝗦𝘂𝗽𝗲𝗿𝗡𝗼𝘃𝗮-𝗠𝗲𝗱𝗶𝘂𝘀. SuperNova-Medius represents a…

1.0K