David (drbh) Holtz

@justdrbh

Imaginary numbers are real. ML Ops at 🤗

Brooklyn + Catskills

Joined November 2012

4KFollowing

291Followers

Excited to share the Kernel Hub, optimized CUDA kernels, plug-and-play from the Hugging Face Hub. No boilerplate, just speed. huggingface.co/blog/hello-hf-…

justdrbh's tweet image. Excited to share the Kernel Hub, optimized CUDA kernels, plug-and-play from the Hugging Face Hub.
No boilerplate, just speed.
huggingface.co/blog/hello-hf-…

333

David (drbh) Holtz Retweeted

Joe Fioti@joefioti · May 23

Luminal can discover flash attention entirely automatically. We've been working towards this north star in our search compiler. Check out the prototype demo below ↓

454

306

82.0K

David (drbh) Holtz Retweeted

the tiny corp@__tinygrad__ · May 9

Here's the worlds first AMD GPU driven over USB3. From a Mac! Linux and Windows should work too, it's just libusb. Available today in tinygrad master, use an ADT-UT3G to connect the GPU to your USB port. You have no idea of the level of engineering that went into this.

186

704

6.0K

2.0K

575.0K

David (drbh) Holtz@justdrbh · May 2

👀 github.com/redis/redis/pu…

David (drbh) Holtz Retweeted

the tiny corp@__tinygrad__ · Apr 13

Chip specs: 1 FP16 PFLOP from - 256 tinycores (VLIW, in-order) - 128 KB local SRAM each (L1) - 1024-bit datapath - 8x8 * 8x8 = 8x8 tensor core - dual issue ALU - @ 2 Ghz - Flexible DMA engines for L2 -> L1 - open source silicon-verified HDL 128 MB global 5 TB/s SRAM (L2) 512-bit…

1.0K

297

76.0K

David (drbh) Holtz Retweeted

Fleetwood@fleetwood___ · Feb 16

If you can quantize your model maybe you didn't train it enough.

3.0K

David (drbh) Holtz Retweeted

Zhou Xian@zhou_xian_ · Dec 18

Everything you love about generative models — now powered by real physics! Announcing the Genesis project — after a 24-month large-scale research collaboration involving over 20 research labs — a generative physics engine able to generate 4D dynamical worlds powered by a physics…

576

3.0K

16.0K

12.0K

3.7M

David (drbh) Holtz@justdrbh · Dec 10

TGI v3 is here: 3x more tokens, 13x faster than vLLM on long prompts, and zero config to get started. If you’re working with large inputs or need serious speed. Check it out! 🤗 repo: github.com/huggingface/te… docs: huggingface.co/docs/text-gene… hugging chat: huggingface.co/chat/

justdrbh's tweet image. TGI v3 is here: 3x more tokens, 13x faster than vLLM on long prompts, and zero config to get started. If you’re working with large inputs or need serious speed. Check it out! 🤗

repo: github.com/huggingface/te…
docs: huggingface.co/docs/text-gene…
hugging chat: huggingface.co/chat/

102

David (drbh) Holtz Retweeted

Sundar Pichai@sundarpichai · Dec 9

Introducing Willow, our new state-of-the-art quantum computing chip with a breakthrough that can reduce errors exponentially as we scale up using more qubits, cracking a 30-year challenge in the field. In benchmark tests, Willow solved a standard computation in <5 mins that would…

3.0K

13.0K

78.0K

14.0K

19.1M

David (drbh) Holtz@justdrbh · Dec 8

This is (good) modern art

DDmitriy Kovalenko@neogoose_btw · Dec 6

Only 15% of people believed that a real HTTP server could be done in under 200 LOC in assembly. Here is my MacOS arm assembly version that includes: - primitive routing - real configuration (like port is not hardcoded) - a lot of comments and still is under 200 LOC despite a…

103

David (drbh) Holtz Retweeted

Prime Intellect@PrimeIntellect · Nov 29

Releasing INTELLECT-1: We’re open-sourcing the first decentralized trained 10B model: - INTELLECT-1 base model & intermediate checkpoints - Pre-training dataset - Post-trained instruct models by @arcee_ai - PRIME training framework - Technical paper with all details

260

1.0K

631

553.0K

David (drbh) Holtz@justdrbh · Nov 20

Follow me on bsky! bsky.app/profile/davidh…

David (drbh) Holtz Retweeted

Fleetwood@fleetwood___ · Nov 17

🚨 New post 🚨 In my latest post, we iteratively improve on positional encoding schemes to discover RoPE entirely independently!

612

572

87.0K

David (drbh) Holtz Retweeted

Aston Zhang@astonzhangAZ · Jul 23, 2024

Our Llama 3.1 405B is now openly available! After a year of dedicated effort, from project planning to launch reviews, we are thrilled to open-source the Llama 3 herd of models and share our findings through the paper: 🔹Llama 3.1 405B, continuously trained with a 128K context…

132

589

3.0K

799

421.0K

David (drbh) Holtz@justdrbh · May 13, 2024

OAI with another banger openai.com/index/hello-gp…

167

David (drbh) Holtz@justdrbh · May 10, 2024

Runways AI Film Festival was awesome! Really cool to see how many different kinds of videos are being made (animated, photorealistic, a combination of both). Can’t wait to see what else gets made this year!

justdrbh's tweet image. Runways AI Film Festival was awesome! Really cool to see how many different kinds of videos are being made (animated, photorealistic, a combination of both). Can’t wait to see what else gets made this year!

181