...
Show unique values of a single column of "attributes.experiment_id" and "attributes.institution_id" |
|
---|
Code Block |
---|
In[3]: df.select("attributes.experiment_id")\ |
rows
rows
In[3]: df.select("attributes.institution_id")\ |
| Code Block |
---|
In[3]: df.select("attributes.institution_id",\ |
"attributes.experiment_id")\ .distinct().show(40,truncate=False) |
+--------------+--------------+ |
|institution_id|experiment_id | |
+--------------+--------------+ |
4xCO2 1pctCO2
|NASA-GISS |piClim-histall| |
4xCO2 4xCO2
|NASA-GISS |piClim-control| |
+--------------+--------------+ |
|
You can specify a larger number in show() function and set "truncate=False" to display more rows in full lengths.
...