You can utilise Ray cluster even if your notebook or script is written with Dask and Xarray.
Ray provides a scheduler for Dask (i.e. dask-on-ray) which allows you to build data analyses using Dask’s collections and execute the underlying tasks on a Ray cluster.
You can set up either a local Ray cluster or a pre-defined cluster across multiple nodes by following the instruction.
To enable dask-on-ray, simply call the helper "enable_dask_on_ray()" after initialising the ray cluster.
import ray |
For more details on using Dask on Ray, please refer to here.
You should not use the Dask.distributed client together with enable_dask_on_ray.