emi (@technoabsurdist)

Pinned

e

emi@technoabsurdist · Jul 17

we built herdora because writing cuda sucks and hiring gpu engineers is impossible. we turn slow pytorch into fast gpu code. automatically. please reach out emilio [at] herdora [dot] com if you want faster/cheaper inference .

YY Combinator@ycombinator · Jul 17

Herdora (@herdora_ai) is the Cursor for CUDA. It automatically turns your PyTorch code into optimized GPU kernels so you don't have to write CUDA. Congrats on the launch, @technoabsurdist & @gpusteve! ycombinator.com/launches/NzG-h…

3

25

2

2.0K

emi Retweeted

f

finbarr@finbarrtimbers · Jul 22

Someone’s gonna release an actual “RL for kernel development” paper without measurement errors at some point and no one will believe it

8

2

131

15

20.0K

emi Retweeted

s

steve@gpusteve · Jul 19

so mi300x MOGS h100 with llama 4 scout in high concurrency 😮

5

4

11

0

1.0K

emi Retweeted

F

Fondo@tryfondo · Jul 18

🚀 @herdora_ai launched! Cursor for CUDA "Herdora turns your slow PyTorch into fast GPU code, automatically." 🌐 fondo.ai/3GVvhCJ Congrats @technoabsurdist @gpusteve!!

0

2

15

0

393

e

emi@technoabsurdist · Jul 18

sometimes I accidentally run chat without agent mode and get scared by the horrible results. how do people live like that

0

4

0

123

emi Retweeted

s

steve@gpusteve · Jul 18

📜 ai doesn't run on just NVIDIA anymore - it’s running on many different chips, each with different quirks, tradeoffs, and scaling behavior. today we’re launching chipbenchmark.com - a new open-source platform to monitor the ai hardware situation.

3

2

17

2

949

emi Retweeted

Y

Y Combinator@ycombinator · Jul 17

Herdora (@herdora_ai) is the Cursor for CUDA. It automatically turns your PyTorch code into optimized GPU kernels so you don't have to write CUDA. Congrats on the launch, @technoabsurdist & @gpusteve! ycombinator.com/launches/NzG-h…

26

15

131

43

20.0K

e

emi@technoabsurdist · Jul 17

looking forward to exciting times

YY Combinator@ycombinator · Jul 17

Herdora (@herdora_ai) is the Cursor for CUDA. It automatically turns your PyTorch code into optimized GPU kernels so you don't have to write CUDA. Congrats on the launch, @technoabsurdist & @gpusteve! ycombinator.com/launches/NzG-h…

0

2

13

1

854

emi Retweeted

s

steve@gpusteve · Jul 14

if your company doesn't buy creatine for you, you're ngmi

0

1

6

0

269

e

emi@technoabsurdist · Jul 12

Reminds me a lot of the recent wave of (very successful) systems companies that rewrote popular frameworks like Kafka to take advantage of modern storage devices. Emi is a super talented engineer + excited to see what he builds. Still so much software to write to close the gap…

eemi@technoabsurdist · Jul 12

📜 new blog post: amd’s mi300x gpu has huge potential for affordable, high-throughput llm inference - but it's currently underused due to software limitations. our initial optimizations already make it ~60% more cost-effective than nvidia's h100! (1/6) (🔗 links in final post)

1

2

9

16

2.0K

emi Retweeted

t

tender@tenderizzation · Jun 27

“let’s see what happens if I bump the project to the next major CUDA release”

2

7

158

15

14.0K