Ben Klieger

@benklieger

compound ai lead @groqinc, cs & research @stanford | opinions are my own

Newest projects on Github

Joined January 2023

397Following

4KFollowers

Pinned

Ben Klieger@benklieger · 9 h

Connect Kimi K2 to live docs for 30,000 popular frameworks 🔥 Most AI coding assistants, while impressive, have a significant gap — outdated implementation knowledge. I spend this weekend building a new open source project to fix that: Meet CodeWizard, Kimi K2 running fast on…

8.0K

Pinned

Ben Klieger@benklieger · Jul 26

Exciting to see a new kind of search API, integrating browser use (the tool, though in this case, the company too!) to ensure up to date results. It’s an interesting alternative to scraping, rerankers, maintaining an index, and semantic search.

BBrowser Use@browser_use · Jul 26

❌ Do NOT use OpenAI for real-time web search - they index the web for fast yet inaccurate replies. 🚀 We’re releasing Browser Use Search API today. It crawls sites and fetches real-time data by interacting with any website. Built because you asked for it. We have already…

1.0K

Pinned

Ben Klieger@benklieger · Jul 21

“Their solutions were astonishing in many respects. IMO graders found them to be clear, precise and most of them easy to follow” This is perhaps just as important as the score, in my opinion. Exciting progress!

LLogan Kilpatrick@OfficialLoganK · Jul 21

IMO Gold for Gemini 🥇 using an advanced version of Gemini + Deep Think. Huge progress from last year's silver medal using domain specific models. Much more to share soon : )

356

Pinned

Ben Klieger@benklieger · Jul 21

This looks great! Keeping an eye out for the repo

CCaleb Peffer (Hiring!)@CalebPeffer · Jul 20

We're open sourcing Firecrawl Observer in 3 days 👀 Monitor any page or entire sites with @firecrawl_dev's powerful change detection. Set custom intervals and get webhook alerts instantly when anything updates. Built with @vercel, @convex_dev, @Groq, and more. Stay tuned 👀

482

Ben Klieger@benklieger · 24 h

Incredibly well-written article coauthored by @marmikch explaining mechanistic interpretability - why do LLMs behave the way they behave, at the most foundational (mechanistic) level? How does an LLM think? Well worth a read.

mmarmik@marmikch · Jul 28

"how neural networks think at scale" neural networks are a black box mech interp allows you to peek inside the black box in a new blog, we explain how models represent meaningful concepts how are these concepts arranged in the network and what are the building blocks for…

562

Ben Klieger Retweeted

OpenRouter@OpenRouterAI · Jul 27

9 out of the 10 fastest-growing LLMs this week are open-source

149

1.0K

312

114.0K

Ben Klieger Retweeted

AK@_akhaliq · Jul 23

Kimi K2 + @GroqInc vibe coding a collection of hypnotic, infinite loop animations that are mathematically generated and fully interactive

217

43.0K

Ben Klieger@benklieger · Jul 23

Great read, highly recommend. Jevons paradox for creativity: “An optimistic case is that generative AI is like an electric bike for our creative minds. You’d think electric bikes lead to less exercise, but people often get more because they ride more frequently and go farther.…

SStanford d.school@stanforddschool · Jul 23

What if AI isn't replacing human creativity, but simply amplifying it? Glenn Fajardo drops some insights on how we can evolve our creative process alongside AI: stanford.io/3GYjJ1D #CreativityAI #Design #DesignThinking #AI

489

Ben Klieger@benklieger · Jul 21

Spend limits!

HHatice Ozen@ozenhati · Jul 21

PSA: The team has been working hard to grind out features you've been asking for and next on the list is budgeting. @GroqInc console now allows for setting up spend limits and email notifications for tracking usage. 💰

815

Ben Klieger@benklieger · Jul 17

🚨 BREAKING: @Kimi_Moonshot’s Kimi-K2 is now the #1 open model in the Arena! With over 3K community votes, it ranks #5 overall, overtaking DeepSeek as the top open model. Huge congrats to the Moonshot team on this impressive milestone! The leaderboard now features 7 different…

KKimi.ai@Kimi_Moonshot · Jul 11

🚀 Hello, Kimi K2! Open-Source Agentic Model! 🔹 1T total / 32B active MoE model 🔹 SOTA on SWE Bench Verified, Tau2 & AceBench among open models 🔹Strong in coding and agentic tasks 🐤 Multimodal & thought-mode not supported for now With Kimi K2, advanced agentic intelligence…

163

1.0K

215

262.0K

Ben Klieger Retweeted

Mert Ünsal@mertunsal2020 · Jul 16

We compared Kimi K2 from @GroqInc with O3 from @OpenAI on @browser_use (K2 on top) K2 is lightning fast on @GroqInc ⚡️⚡️⚡️

254

157

48.0K

Ben Klieger Retweeted

Sang Truong@sangttruong · Jul 16

Interested in LLM evaluation reliability & efficiency? Check our ICML’25 paper Reliable and Efficient Amortized Model-based Evaluation arxiv.org/abs/2503.13335 w/ @percyliang @uiuc_aisecure @sanmikoyejo @yuhengtu @VirtueAI_co @StanfordAILab @stai_research @StanfordCRFM 🧵1/9

9.0K

Ben Klieger Retweeted

Zaid@zaidmukaddam · Jul 15

kimi k2 is live on @sciraai! powered by @GroqInc ⚡

119

9.0K

Ben Klieger@benklieger · Jul 15

Oh, and we launched a 1T parameter model 😎 x.com/dsllwn/status/…

ddev@dsllwn · Jul 15

Need to check if @GroqInc was pregnant cuz they just delivered

7.0K

Ben Klieger@benklieger · Jul 15

We officially have a near-frontier open-source model running on @GroqInc at 185 tok/s. It’s only going to get faster from here. This is going to open up a lot of opportunities.

AAarush Sah@AarushSah_ · Jul 15

We’ve ben seeing a lot of demand for Kimi K2 on @GroqInc. Happy to say that it’s now available on the Groq API at 185 tokens per second, 6x faster than any other provider (AT FULL CONTEXT)

500

146

51.0K

Ben Klieger@benklieger · Jul 15

👀

GGroq Inc@GroqInc · Jul 15

*YOLO Launch* Kimi K2 is now in preview on GroqCloud at 185 tokens/sec. Build fast. Link in comments.

740