Erik Kaunismäki

@ErikKaum

SWE @huggingface | prev @BananaDev_

Paris

Joined May 2012

732Following

987Followers

Pinned

Erik Kaunismäki@ErikKaum · Jul 17

We just released native support for @sgl_project and @vllm_project in Inference Endpoints 🔥 Inference Endpoints is becoming the central place where you deploy high performance Inference Engines. And that provides the managed infra for it so you can focus on your users.

ErikKaum's tweet image. We just released native support for @sgl_project and @vllm_project in Inference Endpoints 🔥

Inference Endpoints is becoming the central place where you deploy high performance Inference Engines.

And that provides the managed infra for it so you can focus on your users.

13.0K

Erik Kaunismäki@ErikKaum · Jul 21

"It is our duty to remain optimists" Here's the passage you didn’t know you needed for your Monday.

245

Erik Kaunismäki@ErikKaum · Jul 16

Turbopuffer > S3 Vectors Planetscale > RDS

JJo Kristian Bergum@jobergum · Jul 16

I think @turbopuffer is a 100x better product over the S3 Vectors product that was announced today: - TP APIs/DX is superb - TP has traditional search capabilities - TP is honest about recall (red flag for S3 Vector announcement) TP has a cracked team, I wish them all…

343

Erik Kaunismäki@ErikKaum · Jul 16

A few things for early hackers that I stumbled upon when testing the MLX CUDA backend: 1) if you want to check what's available (metal/cuda/cpu), you can hack it like this in python: a = mx.arange(1) dev_type, dev_id = a.__dlpack_device__() if dev_type == 8: print("found…

EErik Kaunismäki@ErikKaum · Jul 16

1. Develop an LLM on a Mac locally 2. Deploy on Nvidia GPUs and runs natively on CUDA 🔥 MLX team is really doing the right stuff here!

293

Erik Kaunismäki@ErikKaum · Jul 16

1. Develop an LLM on a Mac locally 2. Deploy on Nvidia GPUs and runs natively on CUDA 🔥 MLX team is really doing the right stuff here!

LLaurieWired@lauriewired · Jul 15

MLX, Apple’s machine learning framework, just merged a CUDA Backend. Matmul, tensor copy ops, and other core CUDA primitives are now part of Apple’s official build. There’s a lot of hype + confusion. Here’s what it is, and…isn’t.

614

Erik Kaunismäki@ErikKaum · Jul 8

Running, rain and free GPUs. Good morning folks 🫡

1.0K

Erik Kaunismäki@ErikKaum · Jul 1

neat 👀

PPlanetScale@PlanetScale · Jul 1

PlanetScale now supports Postgres.

487

Erik Kaunismäki@ErikKaum · Jun 19

The real argument for one or the other is: which does the LLM naturally gravitate towards?

zzack (in SF)@zack_overflow · Jun 19

Prior to coding agents, I used to think bike-shedding like this about code/file structure and naming was a massive waste of time But now, more than ever, it actually matters and pays to think about code organization so that LLMs and coding agents can be more productive

204

Erik Kaunismäki@ErikKaum · Jun 19

Robots just casually walking to our office and taking our jobs. I guess this is it 🤷🏼‍♂️

589

Erik Kaunismäki@ErikKaum · Jun 10

Now this is interesting! 👀 stop talking about the glass liquid thing.

3.0K

Erik Kaunismäki@ErikKaum · May 31

What an evening!

222

Erik Kaunismäki@ErikKaum · May 27

Today I did debugging with kubectl for the first time in a while. Feels good.

246

Erik Kaunismäki@ErikKaum · May 27

This is what peak software distribution and "SEO" looks like in 2025.

sshadcn@shadcn · May 26

In the System Prompts.

256