Coiled

@CoiledHQ

Scale Python with Dask

Everywhere

Joined February 2020

54Following

3KFollowers

Coiled@CoiledHQ · Mar 5

Easily configure shared memory size for CLI jobs with `--docker-shm-size`. Training PyTorch models on a GPU and need more memory? Ever run into "Error: No space left on device"? Customize Docker shared memory size with `--docker-shm-size`. docs.coiled.io/user_guide/cli…

CoiledHQ's tweet image. Easily configure shared memory size for CLI jobs with `--docker-shm-size`.

Training PyTorch models on a GPU and need more memory? Ever run into "Error: No space left on device"?

Customize Docker shared memory size with `--docker-shm-size`.

docs.coiled.io/user_guide/cli…

640

Coiled@CoiledHQ · Feb 25

🔨 Job setup option for Coiled Batch Use `--host-setup-script` to configure your VM before your batch job starts. Easily: ✅ Install dependencies ✅ Mount cloud storage ✅ Handle authentication or any other setup your jobs need. docs.coiled.io/user_guide/bat…

678

Coiled Retweeted

black.box@black_dot_box · Feb 23

~5 years ago I worked at a startup where we had multiple engineers screwing around for months with Terraform, Kubernetes, EKS, etc. just to get the same capabilities I got after an hour of playing around w/ Coiled. Pretty cool.

305

Coiled Retweeted

Matthew Rocklin@mrocklin · Jan 6

Coiled 2024 in Review docs.coiled.io/blog/2024-eoy.… It’s the time when companies issue year-end summaries, acclaiming success (or not), and forecasting incredible growth for the next year (or not). I thought I’d do something similar for Coiled. It’s been quite a year for us ...

2.0K

Coiled@CoiledHQ · Dec 19

Calculating quantiles, a common application in #geospatial workloads, used to be slow due to GIL contention in NumPy. The new implementation in @dask_dev + @xarray_dev is up to a hundred times faster and scales independently of the number of threads 🥳. docs.coiled.io/blog/array-qua…

1.0K

Coiled@CoiledHQ · Dec 5

We're big fans of rich for a nice terminal experience, but have found sometimes folks log things even rich can't handle. In the latest coiled=1.67.0 release, coiled logs automatically falls back to non-rich printing in these situations. Release notes: docs.coiled.io/user_guide/cha…

CoiledHQ's tweet image. We're big fans of rich for a nice terminal experience, but have found sometimes folks log things even rich can't handle.

In the latest coiled=1.67.0 release, coiled logs automatically falls back to non-rich printing in these situations.

Release notes: docs.coiled.io/user_guide/cha…

506

Coiled Retweeted

Matthew Rocklin@mrocklin · Dec 4

New Post: Cloud Computing is Broken matthewrocklin.com/cloud-is-broke… Investor asks: "What's next for Data/Cloud Infrastructure?" My answer: "Boring stuff. People struggle with basics." Cloud feels like MP3 players before iPod. In theory everything is good. In practice adoption is low

2.0K

Coiled@CoiledHQ · Dec 3

We're now on Bluesky! Should be pretty easy to find us, since bluesky lets us use our coiled.io domain as our handle ☀️

357

Coiled Retweeted

Xarray@xarray_dev · Nov 21

Read about the latest improvement to GroupBy.map with Dask: xarray.dev/blog/dask-detr… Thanks to Patrick Hoefler of @CoiledHQ for the great work here!

3.0K

Coiled Retweeted

Matthew Rocklin@mrocklin · Nov 21

New Post: SLURM-Style Job Arrays on the Cloud docs.coiled.io/blog/slurm-job… HPC Job scripts were the first form of parallelism I ever used as a graduate student. They're dead simple and accessible to almost anyone. We replicated the API with Coiled. It feels pretty slick to me 🙂

873

Coiled Retweeted

Nazari Goudin@nazari_goudin · Oct 12

@CoiledHQ is amazing. If you want to have distributed compute and provisioned infrastructure from the code - its easy as that. Forget @ApacheSpark and @awscloud Sagemaker, EMR

217

Coiled Retweeted

Quentin Lhoest 🤗@lhoestq · Oct 9

New blog post: Scale AI-based Data Processing EASY The FineWeb-Edu dataset comes from processing 45TB (🤯) of FineWeb And it uses a Language Model to classify the educational level of the text 😭😭 Still, we reproduced it in a few lines of code ! The key ? HF + Dask 😎

16.0K

Coiled Retweeted

Arpit Bansal@arpit__bansal · Sep 20

Implemented @CoiledHQ into our product to offload data syncing from BigQuery to Neo4j 🤯 Works like butter 🧈 Now I don’t have to worry about scaling VMs dynamically to handle variable loads.

417

Coiled Retweeted

Matthew Rocklin@mrocklin · Sep 10

We're to build a 100-TB scale geospatial benchmark suite docs.coiled.io/blog/geospatia… We've seen an uptick in geospatial users and in challenges of the Xarray/Dask stack to scale beyond ~500-GiB. This post presents a call for benchmark workloads.

5.0K

Coiled Retweeted

Earthmover@EarthmoverHQ · Aug 2

Arraylake and @CoiledHQ work great together! You can use Coiled to manage your cloud computing infrastructure with @dask_dev, and store your data as @zarr_dev in Arraylake. We just added new a documentation page about our integration with Coiled. docs.earthmover.io/integrations/c…

2.0K

Coiled@CoiledHQ · Jun 12, 2024

Run a Python script on a cloud GPU with one line of code. Training a @PyTorch model training takes ~10 minutes and cost ~$0.12 on the NVIDIA T4 GPU on AWS. Coiled handles provisioning hardware, setting up drivers, and installing CUDA-compiled PyTorch. docs.coiled.io/user_guide/gpu…

CoiledHQ's tweet image. Run a Python script on a cloud GPU with one line of code.

Training a @PyTorch model training takes ~10 minutes and cost ~$0.12 on the NVIDIA T4 GPU on AWS. Coiled handles provisioning hardware, setting up drivers, and installing CUDA-compiled PyTorch.

docs.coiled.io/user_guide/gpu…

687

Coiled Retweeted

Dask@dask_dev · Jun 3, 2024

Dask DataFrame is now 20x faster. Some of most prominent changes include: - Apache Arrow support in @pandas_dev - Better shuffling algorithm for faster joins - Automatic query optimization Learn more: docs.coiled.io/blog/dask-data…

3.0K

Coiled Retweeted

Matthew Rocklin@mrocklin · May 23, 2024

TPC-H Cloud Benchmarks: Spark, Dask, DuckDB, Polars Across scales: 10 GiB, 100 GiB, 1 TiB, 10 TiB Hardware: MBP and AWS It was a fun experiment. No project wins uniformly. DuckDB and Dask do pretty well. docs.coiled.io/blog/tpch.html

8.0K

Coiled Retweeted

Anthony Wu@anthonywu · May 12, 2024

Recommendation of the day: `coiled notebook start` to run a remote Jupyter Lab from big machines in cloud but with file sync that feel "local". Demo from @CoiledHQ youtu.be/mibhDHYun0M #python #jupyter

453

Coiled Retweeted

Uwe L. Korn@xhochy · May 2, 2024

On the 14th of May, @quantcotech Karlsruhe will host the next @PyDataSW Meetup. Florian Jetter will join us to talk about @dask_dev's impressive speed, and @pavelzw, Adrian, and @0xBE7A show how to manage hundreds of Python Sign up at meetup.com/pydata-suedwes…

1.0K