CMIP data located at the National Computational Infrastructure (NCI) covers a broad range of datasets, including CMIP5 and CMIP6 era replicated and published data as well as key observational and reanalysis datasets. For further information on the different climate datasets and available variables see the Datasets and Available Variables page. On this page we describe the methods in which the CMIP data hosted at NCI may be accessed. Sections in this page are broken into the following topics:
Image Source: https://www.earthsystemcog.org/projects/cmip/
Access Options
Please note: External users who do not have an NCI user account are directed to option 2 below. Alternatively you can find out more about how to access NCI.
Access to the CMIP and related data available is available via three key methods:
Access on the NCI filesystem.
Users with an NCI login can directly access the data locally on the NCI filesystem with both HPC Gadi and the ARE Virtual Desktop Infrastructure (VDI).
- Users are encouraged to use intake (or intake-ESM) - a python API to ease searching the CMIP datasets on NCI.
The ESGF data portal.
The Earth System Grid Federation (ESGF) data portal provides access to CMIP datasets published by NCI and hosted across the international data nodes. If you cannot use the data on raijin or the VDI, you may search for the CMIP data from the ESGF site. Note, not all replica data is currently published on NCI ESGF site.
The Geonetwork catalog.
You may search for the CMIP data published at NCI via the Geonetwork metadata catalog, which also provides links to data services on the THREDDS Data Service.
Australian published CMIP5 data is available on geonetwork under: https://geonetwork.nci.org.au/geonetwork/srv/eng/catalog.search#/metadata/f3525_9322_8600_7716.
Direct Data Access
Primary access to the CMIP data, from both Gadi and the ARE VDI, is achieved via requesting to join the relevant project space (see below) and accessing the data from /g/data/<project code> (see table below). Note: to access through this approach, you will need an NCI account and a computational project through one of the NCI schemes for access.
Australian Published CMIP Data | Replicated CMIP Data | Replicated Observational and Reanalysis Data | |
---|---|---|---|
NCI Project Code | fs38 = CMIP6-era rr3 = CMIP5-era (incl. CORDEX) | oi10 = CMIP6-era al33 = CMIP5-era (incl. CORDEX) cb20 = CMIP3 | qv56! = input4MIPs, obs4MIPs, ana4MIPS |
Please note: New version updates for qv56 datasets will not be actioned for the remainder of 2020. Please contact help@nci.org.au (Subject: Data Collections) if you have any questions about these datasets.
Request to Access a Data Collection
You may request to join a data collection through my.nci.org.au/mancini. Your request will be sent to the data collection manager for approval and you must agree to the same Terms and Conditions that govern CMIP access data access as stipulated by the Earth Systems Grid Federation:
Intake and Intake-ESM
The volume and complexity of CMIP makes manual searching for data on the filesystem a time consuming process. The datasets have been indexed using a general Intake scheme (described here) and a minimal scheme using Intake-ESM (described here). To use all the datasets, membership of any of the NCI CMIP data collections is also required - and those collection project codes are described elsewhere in this documentation.
CleF
The Climate Finder tool (CleF) indexing tool developed by ARCCSS and CLEX is now superseded by our use if Intake (described above). Documentation on using CleF is available here.
Data Organisation
To permit ease of use and interdisciplinary research, CMIP uses a standard naming convention for files, directories, metadata and URLs. These conventions are described in detail in the CMIP6 Controlled Vocabularies and CMIP5 Controlled Vocabulary
CMIP6 Published Data
Under the CMIP6 DRS, data may be found with the following directory format:
/g/data/fs38/publications/CMIP6/CMIP/<institution_id>/<source_id>/<experiment_id>/<member_id>/<table_id>/<variable>/<grid_label>/<version>
CMIP6 Official Replica Data
Under the CMIP6 DRS, data may be found with the following directory format:
/g/data/oi10/replicas/CMIP6/CMIP/<institution_id>/<source_id>/<experiment_id>/<member_id>/<table_id>/<variable>/<grid_label>/<version>
CMIP5 and CORDEX Australian Published Data
The CMIP5 and CORDEX published data can be found with the following directory structure:
/g/data/rr3/publications/CMIP5/output1/<institute>/<model>/<experiment>/<frequency>/<realm>/<table>/<ensemble>/<version>/<variable>
CMIP5 Official Replica Data
The CMIP5 official replica data can be found with the following directory structure:
/g/data/al33/replicas/CMIP5/combined/<institute>/<model>/<experiment>/<frequency>/<realm>/<table>/<ensemble>/<version>/<variable>
CMIP3 Official Replica Data
The CMIP3 official replica data can be found with the following directory structure:
/g/data/cb20/replicas/cmip3/<institute>/<model>/<experiment>/<frequency>/<realm>/<ensemble>/<variable>
Note some of the key difference in the facets between CMIP5 and CMIP6, in particular "model" in CMIP5 has become "source_id" and "ensemble" has become "member_id". Other facets have similar names, though in CMIP6 the convention often includes an "_id".
Definitions for directory format terms:
- institute/institution_id: the institution that produced the model output (e.g. CSIRO-BOM, UNSW, etc.)
- model/source_id: also refereed as source_id in CMIP6, it is the CMIP model name identifier.
- experiment/experiment_id: the CMIP experiment identifier (e.g., historical, piControl, rcp45, etc.)
- frequency: frequency identifier (e.g., 3hr, day, mon, etc.)
- realm: modelling realm (e.g., atmos, land, ocean, etc.)
- ensemble/member_id: ensemble or member_id, provides information on initialisation and physics identifier (e.g., r1i1p1, r1i1p2, etc.)
- variable/variable_id: output variable (see full list here)
- version: available versions, and 'latest' with symbolic links to the latest available version, where a version isn't available the creation date is used instead.
- table/table_id: For example "Amon" is short for atmosphere monthly, "Omon" is ocean monthly, etc.
Guidance for Data Users
The following guidance material for CMIP6 data users is provided through PCMDI. It includes information on experiment design, model output, terms of use and citation, model documentation, error reporting, registering publications which use CMIP6 data, and CMIP6 governance.
https://pcmdi.llnl.gov/CMIP6/Guide/dataUsers.html
For more information on the CMIP and CORDEX data, including data formats and processing see ESGF User Support - Data FAQs.