In[3]: df.select("attributes.institution_id",\
"attributes.experiment_id")\
.distinct().show(40,truncate=False)
+--------------+--------------+
|institution_id|experiment_id |
+--------------+--------------+
|NCC |ssp585 |
|NCAR |esm-piControl |
|MRI |esm-hist |
|NOAA-GFDL |ssp370 |
|E3SM-Project |ssp585 |
|UA |ssp245 |
|NUIST |ssp245 |
|UA |ssp370 |
|CNRM-CERFACS |ssp245 |
|NOAA-GFDL |ssp585 |
|CMCC |abrupt-4xCO2 |
|MIROC |ssp534-over |
|CAMS |1pctCO2 |
|CNRM-CERFACS |ssp126 |
|NASA-GISS |piClim-histall|
|CMCC |piControl |
|MRI |historical |
|BCC |piControl |
|MPI-M |esm-hist |
|NCAR |historical |
|MIROC |ssp126 |
|MOHC |piClim-histall|
|CAMS |piControl |
|MIROC |ssp370 |
|MOHC |ssp534-over |
|MPI-M |amip |
|NOAA-GFDL |ssp126 |
|CCCma |piClim-histall|
|CAMS |abrupt-4xCO2 |
|CAMS |amip |
|CNRM-CERFACS |ssp370 |
|KIOST |ssp126 |
|CNRM-CERFACS |ssp585 |
|NCAR |piClim-control|
|UA |ssp126 |
|MOHC |ssp119 |
|CMCC |amip |
|MRI |abrupt-4xCO2 |
|NUIST |ssp126 |
|NASA-GISS |piClim-control|
+--------------+--------------+
only showing top 40 rows |