Introduction to loading data

Sign up to the DEA Sandbox to run this notebook interactively from a browser
Compatibility: Notebook currently compatible with both the NCI and DEA Sandbox environments
Products used: ga_ls7e_gm_cyear_3, ga_ls8cls9c_gm_cyear_3
Prerequisites: Users of this notebook should have a basic understanding of:
- How to run a Jupyter notebook
- The basic structure of the DEA satellite datasets
- Inspecting available DEA products and measurements

Background

Loading data from the Digital Earth Australia (DEA) instance of the Open Data Cube requires the construction of a query that specifies the what, where, and when of the data request. Each query returns a multi-dimensional xarray object containing the contents of your query. It is essential to understand the xarray data structures as they are fundamental to the structure of data loaded from the datacube. Manipulations, transformations and visualisation of xarray objects provide datacube users with the ability to explore and analyse DEA datasets, as well as pose and answer scientific questions.

Description

This notebook will introduce how to load data from the DEA datacube through the construction of a query and use of the dc.load() function. Topics covered include:

Loading data using dc.load()
Interpreting the resulting xarray.Dataset object
- Inspecting an individual xarray.DataArray
Customising parameters passed to the dc.load() function
- Loading specific measurements
- Loading data for coordinates in a custom coordinate reference system (CRS)
- Projecting data to a new CRS and spatial resolution
- Specifying a specific spatial resampling method
Loading data using a reusable dictionary query
Loading matching data from multiple products using like
Adding a progress bar to the data load

Getting started

To run this introduction to loading data from DEA, run all the cells in the notebook starting with the “Load packages” cell. For help with running notebook cells, refer back to the Jupyter Notebooks notebook.

Load packages

The datacube package is required to query the datacube database and load some data. The with_ui_cbk function from odc.ui enables a progress bar when loading large amounts of data.

[1]:

import datacube
from odc.ui import with_ui_cbk

Connect to the datacube

The next step is to connect to the datacube database. The resulting dc datacube object can then be used to load data. The app parameter is a unique name used to identify the notebook that does not have any effect on the analysis.

[2]:

dc = datacube.Datacube(app="04_Loading_data")

Recommended next steps

To continue working through the notebooks in this beginner’s guide, the following notebooks are designed to be worked through in the following order:

Once you have worked through the beginner’s guide, you can join advanced users by exploring:

A demonstration of how to load cloud-free observations in the using load_ard notebook.
The “DEA products” directory in the repository, where you can explore DEA products in depth.
The “How_to_guides” directory, which contains a recipe book of common techniques and methods for analysing DEA data.
The “Real_world_examples” directory, which provides more complex workflows and analysis case studies.

Additional information

License: The code in this notebook is licensed under the Apache License, Version 2.0. Digital Earth Australia data is licensed under the Creative Commons by Attribution 4.0 license.

Contact: If you need assistance, please post a question on the Open Data Cube Discord chat or on the GIS Stack Exchange using the open-data-cube tag (you can view previously asked questions here). If you would like to report an issue with this notebook, you can file one on GitHub.

Last modified: June 2024

Compatible datacube version:

[18]:

print(datacube.__version__)

1.8.18

Introduction to loading data

Background

Description

Getting started

Load packages

Connect to the datacube

Loading data using `dc.load()`

Interpreting the resulting `xarray.Dataset`

Inspecting an individual `xarray.DataArray`

Customising the `dc.load()` function

Specifying measurements

Loading data for coordinates in any CRS

CRS reprojection

Spatial resampling methods

Loading data using the query dictionary syntax

Other helpful tricks

Loading data “like” another dataset

Adding a progress bar

Recommended next steps

Additional information

Tags

Introduction to loading data

Background

Description

Getting started

Load packages

Connect to the datacube

Loading data using dc.load()

Interpreting the resulting xarray.Dataset

Inspecting an individual xarray.DataArray

Customising the dc.load() function

Specifying measurements

Loading data for coordinates in any CRS

CRS reprojection

Spatial resampling methods

Loading data using the query dictionary syntax

Other helpful tricks

Loading data “like” another dataset

Adding a progress bar

Recommended next steps

Additional information

Tags

Loading data using `dc.load()`

Interpreting the resulting `xarray.Dataset`

Inspecting an individual `xarray.DataArray`

Customising the `dc.load()` function