Introduction to xarray

Prerequisites: Users of this notebook should have a basic understanding of:
- How to run a Jupyter notebook
- How to work with numpy

Keywords beginner’s guide; xarray, python package; xarray

Background

Xarray is a python library which simplifies working with labelled multi-dimension arrays. Xarray introduces labels in the forms of dimensions, coordinates and attributes on top of raw numpy arrays, allowing for more intitutive and concise development. More information about xarray data structures and functions can be found here.

Once you’ve completed this notebook, you may be interested in advancing your xarray skills further, this external notebook introduces more uses of xarray and may help you advance your skills further.

Description

This notebook is designed to introduce users to xarray using Python code in Jupyter Notebooks via JupyterLab.

Topics covered include:

How to use xarray functions in a Jupyter Notebook cell
How to access xarray dimensions and metadata
Using indexing to explore multi-dimensional xarray data
Appliction of built-in xarray functions such as sum, std and mean

Getting started

To run this notebook, run all the cells in the notebook starting with the “Load packages” cell. For help with running notebook cells, refer back to the Jupyter Notebooks notebook.

Load packages

[1]:

%matplotlib inline
import matplotlib.pyplot as plt
import numpy as np
import xarray as xr

Plotting data with Matplotlib

Plotting is also conveniently integrated in the library.

[14]:

ds["green"].isel(time=0).plot()

[14]:

<matplotlib.collections.QuadMesh at 0x7f88d342bd30>

../../../_images/sandbox_notebooks_Beginners_guide_07_Intro_to_xarray_32_1.png

but we still can do things manually using numpy and matplotlib if you choose:

[15]:

rgb = np.dstack((ds.red.isel(time=0).values, ds.green.isel(time=0).values, ds.blue.isel(time=0).values))
rgb = np.clip(rgb, 0, 2000) / 2000

plt.imshow(rgb);

../../../_images/sandbox_notebooks_Beginners_guide_07_Intro_to_xarray_34_0.png

But compare the above to elegantly chaining operations within xarray:

[16]:

ds[['red', 'green', 'blue']].isel(time=0).to_array().plot.imshow(robust=True, figsize=(6, 6));

../../../_images/sandbox_notebooks_Beginners_guide_07_Intro_to_xarray_36_0.png

Recommended next steps

For more advanced information about working with Jupyter Notebooks or JupyterLab, you can explore JupyterLab documentation page.

To continue working through the notebooks in this beginner’s guide, the following notebooks are designed to be worked through in the following order:

Jupyter Notebooks
Products and Measurements
Loading data
Plotting
Performing a basic analysis
Introduction to numpy
Introduction to xarray (this notebook)
Parallel processing with Dask

Once you have you have completed the above eight tutorials, join advanced users in exploring:

The “Datasets” directory in the repository, where you can explore DE Africa products in depth.
The “Frequently used code” directory, which contains a recipe book of common techniques and methods for analysing DE Africa data.
The “Real-world examples” directory, which provides more complex workflows and analysis case studies.

Additional information

License: The code in this notebook is licensed under the Apache License, Version 2.0. Digital Earth Africa data is licensed under the Creative Commons by Attribution 4.0 license.

Contact: If you need assistance, please post a question on the Open Data Cube Slack channel or on the GIS Stack Exchange using the open-data-cube tag (you can view previously asked questions here). If you would like to report an issue with this notebook, you can file one on Github.

Last Tested:

[17]:

from datetime import datetime
datetime.today().strftime('%Y-%m-%d')

[17]:

'2023-08-11'

Introduction to xarray

Background

Description

Getting started

Load packages