Vithu Thangarasa

@vithursant19

Principal ML Research Scientist at @CerebrasSystems, prev. at @Tesla and @UberAILabs, and former grad student at @uoguelph_mlrg and @VectorInst.

San Francisco, CA

Joined August 2018

551Following

466Followers

Pinned

Vithu Thangarasa@vithursant19 · Jul 15

Excited to be here at ICML 2025 in Vancouver! 🇨🇦 Come swing by the @CerebrasSystems booth (#108) to meet the team, chat about our ICML work, and see how Cerebras delivers the fastest inference in the world across a wide range of frontier models. Thrilled to be presenting two…

vithursant19's tweet image. Excited to be here at ICML 2025 in Vancouver! 🇨🇦 Come swing by the @CerebrasSystems booth (#108) to meet the team, chat about our ICML work, and see how Cerebras delivers the fastest inference in the world across a wide range of frontier models.

Thrilled to be presenting two…

3.0K

Vithu Thangarasa Retweeted

Daniel Kim@learnwdaniel · Jul 24

Instant Coding with Cline x Cerebras Qwen3-235b Write a Python script that uses Pygame to simulate a single red ball bouncing inside a rotating regular hexagon, all in SI units with a conversion of 100 pixels = 1 meter. The window should be 800×600 pixels at 60 FPS, with a black…

7.0K

Vithu Thangarasa@vithursant19 · Jul 24

Ever since the release of Cerebras inferencing in HotChips2024, Cerebras has been handing Groq massive Ls. DeepSeek R1 Llama3 70B: Cerebras: 2256 tok/s/user Groq: 398 tok/s/user

SSarah Chieng@SarahChieng · Jul 23

'Jonathan Ross and I made this bet in 2017. Groq is now the fastest inference solution in market' Society would expect @chamath to be truthful. I mean pick a model...any model. Look at independent benchmarks. These charts aren't hard to read.

234

29.0K

Vithu Thangarasa Retweeted

Cline@cline · Jul 24

The worst part of @CerebrasSystems inference? You don't time to make an espresso while Cline codes. Join our hackathon next Saturday to explore the new paradigm: instant inference. $5k in prizes → RSVP below

204

13.0K

Vithu Thangarasa@vithursant19 · Jul 23

Grifters gonna grift @CerebrasSystems is faster on every model that matters

CChamath Palihapitiya@chamath · Jul 23

.@JonathanRoss321 (the inventor and father of TPU) and I made this bet in 2017. @GroqInc is now the fastest inference solution in market today. Here are some lessons learned so far: - if we assume we get to Super Intelligence and then General Intelligence, the entire game…

649

Vithu Thangarasa@vithursant19 · Jul 23

CChamath Palihapitiya@chamath · Jul 23

122

42.0K

Vithu Thangarasa@vithursant19 · Jul 22

After more than a year of getting burned with MoE gotchas, I finally sat down and wrote the guide I wish existed. Every paper skips the messy production details. This fills those gaps. No theory without implementation. cerebras.ai/moe-guide

CCerebras@CerebrasSystems · Jul 22

Let's talk about MoE: 🔶 How many experts should you use? 🔶 How does dynamic routing actually behave in production? 🔶 How do you debug a model that won’t train? 🔶 What does 8x7B actually mean for memory and compute? 🔶 What hardware optimizations matter for sparse models?…

147

112

23.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jul 23

Generate and iterate on code instantly. 40x faster than Sonnet-4. Free to use. Get started with @cline and @cerebrassystems below 👇

292

143

735.0K

Vithu Thangarasa Retweeted

Andrew Feldman@andrewdfeldman · Jul 23

According to Open Router, @CerebrasSystems is more than twice as fast as @GroqInc. Find today's data here: openrouter.ai/meta-llama/lla…

183

2.0K

Vithu Thangarasa Retweeted

Mistral AI@MistralAI · Jul 17

Introducing powerful new features in Le Chat, making it more capable and more fun!

232

2.0K

252

116.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jul 9

The world's fastest AI inference is now available in @awscloud Marketplace. It's easier than ever to access models like @AIatMeta Llama, @Alibaba_Qwen, @deepseek_ai, consolidate billing, and experience the speed of Cerebras.

146

21.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jul 8

Frontier AI is now on Cerebras. This week we are launching Qwen3-235B—@Alibaba’s flagship reasoning model that rivals ChatGPT and Claude. In classic Cerebras style, we run the model at 1,500 tokens/second. That means reasoning time goes from 60 seconds on GPUs to just 0.6…

381

20.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jul 8

Deep research on ChatGPT takes over ten minutes. We do it under 30 seconds. Using Qwen3-235B on Cerebras, we've benchmarked agentic search workflows that process 10M+ document enterprise repositories in seconds, not hours. Early enterprise customers report 20x faster…

203

486.0K

Vithu Thangarasa Retweeted

Andrew Feldman@andrewdfeldman · Jul 9

We launched Qwen 235B on the Cerebras cloud. Not only is it 18 X times faster than the leading GPU offering but it cuts response time from minutes to less than a second. Faster response. Faster answers. What’s not to like?

4.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jun 27

"Before Cerebras, everything sits sub 200 tokens per second output. And after us, on every model, you have vast improvements, order of magnitude improvements. And what this allows you to do is deliver something special and different to your customers —faster responses, richer…

3.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jul 2

Sean Lie, Cerebras CTO, highlighted an uncomfortable truth for the AI industry at @VentureBeat Transform panel.

3.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jun 30

Featured Paper at @icmlconf - The Internationall Conference on Machine Learning: SD² - Self-Distilled Sparse Drafters Speculative decoding is a powerful technique for reducing the latency of Large Language Models (LLMs), offering a fault-tolerant framework that enables the…

3.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jun 25

Meet Cerebras at @icmlconf 2025 in Vancouver!

2.0K

Vithu Thangarasa Retweeted

Cerebras@CerebrasSystems · Jun 13

AI Scaling Laws from the Cerebras Perspective A new blog post by our CTO, Sean Lie cerebras.net/blog/the-cereb…

6.0K