Szymon Tworkowski

@s_tworkowski

reasoning @xAI | prev. @GoogleAI @UniWarszawski | LongLLaMA | long-context LLMs and math reasoning | scaling maximalist

Palo Alto

Joined November 2021

628Following

8KFollowers

Pinned

Szymon Tworkowski@s_tworkowski · Apr 18

Been working hard pushing Grok 3 Mini reasoning capabilities to the performance/price frontier 🚀 Join our reasoning team to help us build even smarter models!

xxAI@xai · Apr 18

Meet the Grok 3 family, now on our API! Grok 3 Mini outperforms reasoning models at 5x lower cost, redefining cost-efficient intelligence. Grok 3, the world's strongest non-reasoning model, excels in tasks that need real world knowledge like law, finance, and healthcare.

217

166

2.0K

191

42.4M

Szymon Tworkowski@s_tworkowski · Jul 23

When I was a kid, I saved up for a year to get a watercooled radeon r9 290. I overclocked it to within an inch of its life---when I played games, it felt like someone had turned on a space heater. Not much has changed, except now I work with 10^9 times the flops.

EElon Musk@elonmusk · Jul 22

Cable pr0n of @xAI GB200 servers at Colossus 2

569

Szymon Tworkowski Retweeted

Tetsuo@tetsuoai · Jul 16

Fed Grok4 Heavy my massive assembler repo. In ~6 mins, it cleaned up everything, optimized files, and returned them working perfectly. Same codebase in Cursor + MAX? Gemini, Claude, GPT all wrecked it.

750

1.0K

9.0K

1.0K

3.0M

Szymon Tworkowski Retweeted

Christian Szegedy@ChrSzegedy · Jul 10

Tried @grok 4 on a dozen non-trivial math (under/)grad level math problems. So far, it has failed to fail me even once. Congrats to @Yuhu_ai_, @ericzelikman and the whole xAI reasoning team, their progress has exceeded all my expectation!

1.0K

69.0K

Szymon Tworkowski@s_tworkowski · Jul 10

It’s a pretty good model

AAlex Prompter@alex_prompter · Jul 10

I tested Grok 4 and ChatGPT-o3 with same critical prompts. The results will blow your mind. Grok 4 Vs. ChatGPT-o3 (Video demos are included)

1.0K

Szymon Tworkowski@s_tworkowski · Jul 10

War Room squad locked in

DDaniel@nearlydaniel · Jul 10

Can't wait to show you what we've been cooking! Lots of exciting things, please share all your feedback :)

228

216

4.0K

235

609.0K

Szymon Tworkowski Retweeted

Alec Stapp@AlecStapp · Jun 14

Poland went from Iran-level of economic development to Japan-level in a single generation

843

3.0K

27.0K

3.0K

4.5M

Szymon Tworkowski Retweeted

xAI@xai · Jun 10

xAI partners with @Polymarket to blend market predictions with X data and Grok’s analysis. Hardcore truth engine - see what shapes the world. This is just the start of our partnership with @Polymarket. More to come. 🚀

674

1.0K

8.0K

1.0K

4.5M

Szymon Tworkowski@s_tworkowski · May 10

Things I would work on if I was in academia: 1. Taking an hour walk everyday 2. Learning a new sport 3. Take some cooking classes 4. Aim for perfect sleep score

HHattie Zhou@oh_that_hat · May 9

Things I would work on if I was in academia: - memorization / generalization circuits - dataset interactions - learning dynamic differences b/w PT, FT, RL

213

21.0K

Szymon Tworkowski Retweeted

Neel Nanda@NeelNanda5 · May 10

It's a real shame that ICML has decided to automatically reject accepted papers if no author can attend ICML. A top conference paper is a significant boost to early career researchers, exactly the people least likely to be able to afford to go to a conference in Vancouver.

620

86.0K

Szymon Tworkowski Retweeted

xAI@xai · May 6

@PalantirTech CEO Alex Karp and TWG Global Co-Chairman Thomas Tull sat down at the @Milken Institute conference with @CNBC to discuss how TWG and Palantir’s partnership with xAI will design and deploy AI-driven solutions for enterprise. youtube.com/watch?v=svDRof…

771

121

443.0K

Szymon Tworkowski Retweeted

Grok@grok · Apr 25

Finals season stressing you out? You're just a few taps away from unlocking a 24-hour study sidekick (me). Sign up with your .edu email for two free months of my supercharged self, SuperGrok.

418

363

2.0K

305

1.9M

Szymon Tworkowski Retweeted

Piotr Nawrot@p_nawrot · Apr 25

Sparse attention is one of the most promising strategies to unlock long-context processing and long generation reasoning in LLMs. We performed the most comprehensive study on training-free sparse attention to date. Here is what we found:

112

645

553

67.0K

Szymon Tworkowski@s_tworkowski · Apr 19

we are seeing the loop of intelligence expansion and cost compression playing out for the last few years. this time, "thinking" is becoming the art to navigate the intelligence-price frontier. smart move is to stay on the edge of the curve, whether human or machine.

xxAI@xai · Apr 18

1.0K

2.0K

6.0K

345

2.4M

Szymon Tworkowski Retweeted

Min Choi@minchoi · Apr 18

Cost of intelligence is wild🤯 xAI just dropped Grok 3 mini. Best reasoning model on the planet at 5× lower cost.

320

113

41.0K

Szymon Tworkowski@s_tworkowski · Apr 18

wait, Grok-3 mini is actually good?

xxAI@xai · Apr 18

Let’s start with Grok 3 Mini. When we set out to build a fast, affordable mini model, we knew it would be good but even we didn’t expect it to be this good. Some highlights: - Grok 3 Mini tops the leaderboards on graduate-level STEM, math, and coding, outcompeting flagship…

409

53.0K

Szymon Tworkowski Retweeted

�

🇺🇦 Dzmitry Bahdanau@DBahdanau · Apr 15

many many many thanks to @kchonyc and @Yoshua_Bengio for enabling the wildest ever start of my research career 2014 was a very special time to do deep learning, a commit that changes 50 lines of code could give you a ToT award 10 years later 😲

271

21.0K

Szymon Tworkowski@s_tworkowski · Apr 15

intelligence per picojoule

MMislav Balunović@mbalunovic · Apr 14

Grok 3 Mini model from @xai is the latest addition to our MathArena leaderboard - it takes 3rd place overall and the most impressive thing about it is extremely low cost per solved problem

3.0K

Szymon Tworkowski Retweeted

Vals AI@_valsai · Apr 12

Grok 3 Beta dominates on our proprietary benchmarks, setting the new SOTA on our Finance, Legal and Tax benchmarks. Congrats @xai @grok @elonmusk 🚀🚀🚀 We just released the benchmark results for xAI's new models: Grok 3 Beta & Grok 3 Mini Fast Beta (High & Low Reasoning) –…

393

268

2.0K

300

36.1M

Szymon Tworkowski Retweeted

GrowSF@GrowSF · Apr 10

In the first quarter of 2025, property crimes in San Francisco dropped 45%. You're not crazy, things really are getting better! growsf.org/news/2025-04-1…

105

2.0K

180

1.1M