Get data

Now we have a solid plan to start the project. According to Table 4 of [HTL+21], the only ICESat-2 data file (a.k.a. granule) we need to get is ATL03_20190805232841_05940403_002_01.h5.

Traditionally, what we would do is to go to a data hosting service such as Earthdata Search and search for the exact file. While this approach works for this project, there are several drawbacks for a general data-search case:

  1. It is hard to reproduce the data query process using these services. Researchers might be confused by the complexity of the website and struggle to find the right data.

  2. Not every data hosting sites provide the bulk access/download service. If not, retrieve large volume of data from them is time-consuming and requires manual and frequent checks.

  3. This step is often disconnected from the following data processing, which means extra efforts for researchers to check and assimilate their data.

Tools in the Jupyter ecosystem (here Icepyx) can mitigate these issues as they are executed on a Jupyter Notebook. So all the data query and download steps can be scripted and documented. As we continue to use Notebook, accessing data can better connect to the rest of the research stages.

Goals

Download the ICESat-2 granule we need.

Steps

Firstly we need Icepyx, a Python package for obtaining and working with ICESat-2 data:

import icepyx