Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents
maxLevel2

Read Dataset from multiple NetCDF files

Before using Xarray to fetch dataset from NetCDF files, it is suggested to set up a Dask cluster to enable parallel processings.

...

The variable "w" has 4 dimensions and it contains 3 months data which is sliced from 1-year dataset loaded from the directory "/g/data/rt52/era5/pressure-levels/monthly-averaged/w/2020/".

Store the dataset into TileDB arrays

We can simply utilise  tiledb.cf.from_xarray() method to store a Xarray dataset into a TileDB object as below.

...

The directory structure of the TileDB group is shown as below. At this stage, no actually data is stored into the TileDB group yet.

Open TileDB group

You can validate the TileDB group by loading the variable "w" from the TileDB arrays

...