NetCDF: Code Example

Contents

NetCDF: Code Example#

This notebook demonstrates four common methods for reading NetCDF climate datasets using xarray. These examples correspond to the approaches outlined in the Method Overview page.

The dataset files are stored in the ../data/ directory.

Included Approaches:#

Reading a single NetCDF file using xr.open_dataset()
Reading multiple files with wildcards using xr.open_mfdataset()
Reading files by year range using a filename template (e.g., YYYY)
Downsampling high-resolution data using either isel() or coarsen()

import xarray as xr
import matplotlib.pyplot as plt
import os

# === PARAMETERS ===
data_dir = "../data"
file_slp = "slp.ncep.194801-202504.nc"
file_hgt_all = "hgt_ncep_daily.*.nc"
file_hgt_year = "hgt_ncep_daily.YYYY.nc"
file_sst = "sst.oisst_high.198109-202504.nc"

Approach 2: Reading Multiple Files with `open_mfdataset()`#

This method is useful when data is split into multiple NetCDF files by year or month. Here, we combine geopotential height data from 2018–2020 using a wildcard pattern.

plev = 500

# === Step 1: Build full path pattern
hgt_path = os.path.join(data_dir, file_hgt_all)

# === Step 2: Open multiple files using wildcard
ds2 = xr.open_mfdataset(hgt_path, combine="by_coords", parallel=True)

# === Step 3: Select 500 hPa level
z500 = ds2["hgt"].sel(level=plev)

# === Step 4: Preview result
z500

<xarray.DataArray 'hgt' (time: 1096, lat: 73, lon: 144)> Size: 46MB
dask.array<getitem, shape=(1096, 73, 144), dtype=float32, chunksize=(1, 73, 144), chunktype=numpy.ndarray>
Coordinates:
    level    float32 4B 500.0
  * lat      (lat) float32 292B 90.0 87.5 85.0 82.5 ... -82.5 -85.0 -87.5 -90.0
  * lon      (lon) float32 576B 0.0 2.5 5.0 7.5 10.0 ... 350.0 352.5 355.0 357.5
  * time     (time) datetime64[ns] 9kB 2018-01-01 2018-01-02 ... 2020-12-31
Attributes:
    long_name:     mean Daily Geopotential height
    units:         m
    precision:     0
    GRIB_id:       7
    GRIB_name:     HGT
    var_desc:      Geopotential height
    level_desc:    Multiple levels
    statistic:     Mean
    parent_stat:   Individual Obs
    valid_range:   [ -700. 35000.]
    dataset:       NCEP Reanalysis Daily Averages
    actual_range:  [ -523.75 32252.75]

z500.isel(time=0).plot(cmap="viridis")
plt.title("500 hPa Geopotential Height – First Time Step")
plt.show()

../_images/8cb359f4ac5473ddea41bebbb07a086807deb336950497cd0e68d4d93d3df8d7.png

Approach 3: Reading Multiple NCEP Files by Year Range#

This method loads daily geopotential height (hgt) data from multiple yearly NCEP files using a specified range of years. It uses xarray.open_mfdataset to combine the data into a single dataset for easy analysis.

z250.isel(time=0).plot(cmap="viridis")
plt.title("250 hPa Geopotential Height – First Time Step")
plt.show()

../_images/227992785ebb1473866f1772e6472cb875888b48bed53491e3975dec8946beea.png