Jay Hack

@mathemagic1an

Building @codegen. Tweets about AI, computing and dev tools. Previously did startups, @palantir. Not a pseudonym.

San Francisco

Joined May 2013

3KFollowing

53KFollowers

Pinned

Jay Hack@mathemagic1an · Jun 7

Multi-agent flows in Linear are so powerful. You can take on 5-10x more ambitious tasks. I just had it build out our marketing site in 20 minutes. I will likely make minor tweaks in Cursor and ship. This was unfathomable a year ago

248

418

49.0K

Pinned

Jay Hack@mathemagic1an · 11 h

The new 10x engineer is a forward deployed AI engineer embedded by the humans at @codegen 1 day in person + >10 min human slack response (>1 second codegen response) changed the trajectory of our $5m ARR saas company Thank you @mathemagic1an and team!

JJay Hack@mathemagic1an · 11 h

Forward-deployed focus at @codegen is a superpower We will literally show up at your office and help you modernize your AI dev practices - just ask @MaxPGreenwald and @warmlyai Ping via DM if you're interested - accepting applications now 🚀

884

Pinned

Jay Hack@mathemagic1an · 11 h

CCodegen@codegen · 11 h

We don't just sell software - we forward deploy to your office and turn your team into a feature-shipping machine ⚡️ When a founder at @warmlyai cried for help on LinkedIn, we parachuted in, set them up with code agents, and shipped 30+ features to production ✅ Learn more 👇

3.0K

Pinned

Jay Hack@mathemagic1an · Jul 21

In defense of OpenAI, sounds like this was a coordination failure and they acted in good faith. Legitimately impressive what both teams have accomplished and I hope this doesn’t overshadow their results x.com/polynoamial/st…

JJay Hack@mathemagic1an · Jul 21

Tough look for OpenAI They've pissed off the international math community by jumping the gun, meanwhile @GoogleDeepMind has an officially-confirmed result that will be available commercially months earlier

2.0K

Pinned

Jay Hack@mathemagic1an · Jul 21

YYi Tay@YiTayML · Jul 21

Our IMO gold model is not just an "experimental reasoning" model. It is way more general purpose than anyone would have expected. This general deep think model is going to be shipped so stay tuned! 🔥

147

18.0K

Jay Hack@mathemagic1an · 2 h

"No, and tell Claude what to do differently" This will be one of the most powerful flywheels in the code agents space. A clear signal to iterate on.

mathemagic1an's tweet image. "No, and tell Claude what to do differently"

This will be one of the most powerful flywheels in the code agents space. A clear signal to iterate on.

1.0K

Jay Hack@mathemagic1an · 8 h

Stacked PRs are a force multiplier for human<>agent collaboration, and Graphite's MCP gives agents everything they need to start stacking.

GGraphite@withgraphite · 12 h

Take on bigger tasks with your agents. GT MCP trains your agents to create smaller sequenced PRs, so you can easily follow what changes were made and get more done.

1.0K

Jay Hack@mathemagic1an · Jul 22

Kimi K2: "Dreaming of MCPs"

wwh@nrehiew_ · Jul 21

Next section is on Post Training. They first generate large amounts of synthetic tool use samples. To get the tools they require, they fetch 3000+ real MCPs from Github, categorize them and then evolve each of the categories to generate more synthetic tools in that category. This…

3.0K

Jay Hack@mathemagic1an · Jul 21

A better interface for agents Built with @codegen 👏

AAndy Bromberg@andy_bromberg · Jul 21

I built the LLM interface I wished existed. interface0 is for power users: cross-provider memory & chat, multi-model synthesis, sharing & team features, granular context engineering, template prompts, forking, prompt enhancing, & more Here's a demo—see 🧵 for the rest

2.0K

Jay Hack Retweeted

Simon Willison@simonw · Jul 21

Interestingly, both OpenAI and Gemini achieve the exact same score: 35/42 - and both teams solved problems 1-5 but did not solve 6, the most challenging problem

7.0K

Jay Hack@mathemagic1an · Jul 20

This is rapidly becoming the consensus The best agent implementation is “whatever Anthropic decides to RL on” Code agent products effectively wrap a UI and integrations on it. Doesn’t stop you from getting to $500mm ARR

AAndrej Karpathy@karpathy · Jul 20

I use CC from Cursor and I assumed most do as well (?). I end up with a mixed thing where Cursor is the UI layer for reading the code, manual edits, tab completion and chunk edits, and CC for larger changes, architecting, Q&A. Still rapidly evolving though...

10.0K

Jay Hack Retweeted

Alexander Wei@alexwei_ · Jul 19

1/N I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance on the world’s most prestigious math competition—the International Math Olympiad (IMO).

403

1.0K

7.0K

2.0K

5.2M

Jay Hack@mathemagic1an · Jul 17

Can @OpenRouterAI buy a banner ad here?

4.0K