rdyro

@rdyro128523

JAX @ Google http://robertdyro.com

Mountain View, CA

Joined February 2024

34Following

223Followers

Pinned

rdyro Retweeted

Sharad Vikram@sharadvikram · Aug 9

We now have a guide to writing distributed communication on TPU using Pallas, written by @JustinFu769512! jax.readthedocs.io/en/latest/pall… Overlapping comms + compute is a crucial performance optimization for large scale ML. Write your own custom overlapped kernels in Python!

244

158

35.0K

rdyro@rdyro128523 · Mar 7

A nice and concise R1 inference jax:tpu port by @rdyro128523. Good for both reading and running. Watch the repo for more.

rrdyro@rdyro128523 · Mar 6

Deepseek R1 inference in pure JAX! Currently on TPU, with GPU and distilled models in-progress. Features MLA-style attention, expert/tensor parallelism & int8 quantization. Contributions welcome!

5.0K

rdyro Retweeted

Cristian Garcia@cgarciae88 · Feb 27

The JAX team is hosting a dinner / networking event during NVIDIA's GTC in March. Join us for an evening of food, drinks, and discussion of all things JAX. @SingularMattrix and other JAX team member will be attending. Please register early as capacity is limited.

152

18.0K

rdyro Retweeted

Jacob Austin@jacobaustin132 · Feb 4

Making LLMs run efficiently can feel scary, but scaling isn’t magic, it’s math! We wanted to demystify the “systems view” of LLMs and wrote a little textbook called “How To Scale Your Model” which we’re releasing today. 1/n

378

2.0K

432.0K