uccl_project (@uccl_proj)

Pinned

u

uccl_project@uccl_proj · Jun 12

1/N 📢 Introducing UCCL (Ultra & Unified CCL), an efficient collective communication library for ML training and inference, outperforming NCCL by up to 2.5x 🚀 Code: github.com/uccl-project/u… Blog: uccl-project.github.io/posts/about-uc… Results: AllReduce on 6 HGX across 2 racks over RoCE RDMA

uccl_proj's tweet image. 1/N 📢 Introducing UCCL (Ultra &amp; Unified CCL), an efficient collective communication library for ML training and inference, outperforming NCCL by up to 2.5x 🚀

Code: github.com/uccl-project/u…
Blog: uccl-project.github.io/posts/about-uc…
Results: AllReduce on 6 HGX across 2 racks over RoCE RDMA

1

18

40

10

7.0K

u

uccl_project@uccl_proj · Jun 13

We add an updated NCCL vs. UCCL performance figure with an explanation on why there is a sudden performance drop for NCCL.

uuccl_project@uccl_proj · Jun 12

1/N 📢 Introducing UCCL (Ultra & Unified CCL), an efficient collective communication library for ML training and inference, outperforming NCCL by up to 2.5x 🚀 Code: github.com/uccl-project/u… Blog: uccl-project.github.io/posts/about-uc… Results: AllReduce on 6 HGX across 2 racks over RoCE RDMA

0

1

0

96

u

uccl_project@uccl_proj · Jun 12

Excited to share UCCL! RDMA networks is slow to evolve, causing performance bottlenecks for ML workloads. UCCL tackles this by moving more control logic to software, enabling better performance (up to 2.5x than NCCL) and flexibility (supporting different GPU/NIC vendors)

uuccl_project@uccl_proj · Jun 12

1/N 📢 Introducing UCCL (Ultra & Unified CCL), an efficient collective communication library for ML training and inference, outperforming NCCL by up to 2.5x 🚀 Code: github.com/uccl-project/u… Blog: uccl-project.github.io/posts/about-uc… Results: AllReduce on 6 HGX across 2 racks over RoCE RDMA

0

3

14

0

876

u

uccl_project@uccl_proj · Jun 12

Excited to release UCCL—come build the next-gen AI/ML networking solution with us! Also, if you hit any networking problems (or seems to be), just talk to us and get it resolved quickly!

uuccl_project@uccl_proj · Jun 12

1/N 📢 Introducing UCCL (Ultra & Unified CCL), an efficient collective communication library for ML training and inference, outperforming NCCL by up to 2.5x 🚀 Code: github.com/uccl-project/u… Blog: uccl-project.github.io/posts/about-uc… Results: AllReduce on 6 HGX across 2 racks over RoCE RDMA

0

3

18

2

1.0K