Add support for native ERA5 data in GRIB format #2178

schlunma · 2023-08-23T06:58:01Z

Description

This PR allows ESMValCore to process native ERA5 data in GRIB format, which is for example available on Levante in the /pool/data/ERA5 directory.

Reading the data

The following settings are necessary in the user configuration file:

rootpath:
  ...
  native6:
    /pool/data/ERA5: DKRZ-ERA5-GRIB
  ...

I added an extra facets file which includes reasonable default for all supported variables. You can check it out here.

Thus, reading this data is as easy as

datasets:
  - {project: native6, dataset: ERA5, timerange: '2000/2001', short_name: tas, mip: Amon}
  - {project: native6, dataset: ERA5, timerange: '2000/2001', short_name: cl, mip: Amon, tres: 1H, frequency: 1hr}
  - {project: native6, dataset: ERA5, timerange: '2000/2001', short_name: ta, mip: Amon, type: fc, typeid: '12'}

Regridding

Native ERA5 data in GRIB format is on a reduced Gaussian grid (i.e., an unstructured grid). Thus, in 99% of the use cases, it is necessary to regrid this data, especially since no cell areas are available for the data (thus, we cannot even calculate global/regional statistics over the native data). This is done automatically by the CMORizer (as recommended by the ECMWF), but can be disabled in the recipe:

datasets:
  - {project: native6, dataset: ERA5, timerange: '2000/2001', short_name: tas, mip: Amon, automatic_regrid: false}

This PR depends on the following other PRs:

Closes #1991
Closes ESMValGroup/ESMValTool#3238

Link to documentation: https://esmvaltool--2178.org.readthedocs.build/projects/ESMValCore/en/2178/quickstart/find_data.html#supported-native-reanalysis-observational-datasets

Before you get started

☝ Create an issue to discuss what you are going to do

Checklist

It is the responsibility of the author to make sure the pull request is ready to review. The icons indicate whether the item will be subject to the 🛠 Technical or 🧪 Scientific review.

🧪 The new functionality is relevant and scientifically sound
🛠 This pull request has a descriptive title and labels
🛠 Code is written according to the code quality guidelines
🧪 and 🛠 Documentation is available
🛠 Unit tests have been added
🛠 Changes are backward compatible
🛠 Any changed dependencies have been added or removed correctly
🛠 The list of authors is up to date
🛠 All checks below this pull request were successful

To help with the number pull requests:

🙏 We kindly ask you to review two other open pull requests in this repository

codecov · 2023-08-23T07:06:22Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.85%. Comparing base (a328578) to head (9af867a).
Report is 14 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2178      +/-   ##
==========================================
+ Coverage   94.66%   94.85%   +0.18%     
==========================================
  Files         251      251              
  Lines       14287    14371      +84     
==========================================
+ Hits        13525    13631     +106     
+ Misses        762      740      -22

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

schlunma · 2023-08-25T08:22:15Z

This is ready from my side, but there's two issues that need to be resolved before I mark this ready for review:

Cleaned and extended function that extracts datetimes from paths #2181
For some reason I can't get iris-grib to run on CircleCI; locally it works well. This is probably closely related to the problem reported by @remi-kazeroni here.

I tested this thoroughly with the following recipe: recipe_000.yml.txt

An example run is available on Levante here: /home/b/b309141/scratch/esmvaltool_output/recipe_000_20230825_080240

Note that with the default dask scheduler, this recipe ran into a timeout after 8 hours with 67/76 tasks finished. With the following dask configuration, I could run the same recipe on the same node (regular Levante compute node with 256 GiB of memory) in 5:27 min (!!) 🚀.

cluster:
  type: distributed.LocalCluster
  n_workers: 32
  threads_per_worker: 4
  memory_limit: 8 GiB

@ESMValGroup/technical-lead-development-team

doc/quickstart/find_data.rst

esmvalcore/cmor/_fixes/native6/era5.py

bettina-gier

Found a few typos and had a comment for futureproofing the automatic regridding in case we are adding more grib datasets (CAMS soon TM). Runs fine though and input looks reasonable!
Think we can merge this with 3 pull request reviews and all issues and dependencies solved?

doc/quickstart/find_data.rst

esmvalcore/_recipe/recipe.py

Co-authored-by: Bettina Gier <[email protected]>

schlunma · 2024-12-06T09:49:33Z

While testing this again, @bettina-gier and myself got the following error ZeroDivisionError: integer division or modulo by zero as already reported here.

Full traceback:

2024-12-06 09:38:18,767 UTC [4154330] ERROR   Program terminated abnormally, see stack trace below for more information:
multiprocessing.pool.RemoteTraceback: 
"""
Traceback (most recent call last):
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/multiprocessing/pool.py", line 125, in worker
    result = (True, func(*args, **kwds))
                    ^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/_task.py", line 895, in _run_task
    output_files = task.run()
                   ^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/_task.py", line 290, in run
    self.output_files = self._run(input_files)
                        ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/__init__.py", line 730, in _run
    product.apply(step, self.debug)
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/__init__.py", line 527, in apply
    self.cubes = preprocess(
                 ^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/__init__.py", line 430, in preprocess
    _run_preproc_function(
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/__init__.py", line 364, in _run_preproc_function
    return function(items, **kwargs)
           ^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/_shared.py", line 237, in wrapper
    result = func(data, *args, **kwargs)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/_time.py", line 826, in climate_statistics
    agg_kwargs = update_weights_kwargs(
                 ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/_shared.py", line 162, in update_weights_kwargs
    callback(cube, **callback_kwargs)
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/_time.py", line 870, in _add_time_weights_coord
    get_time_weights(cube),
    ^^^^^^^^^^^^^^^^^^^^^^
  File "/home/b/b309141/tmp/ESMValCore/esmvalcore/preprocessor/_shared.py", line 377, in get_time_weights
    time_weights = time_weights.rechunk(time_chunks)
                   ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 2779, in rechunk
    return rechunk(self, chunks, threshold, block_size_limit, balance, method)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/rechunk.py", line 349, in rechunk
    chunks = normalize_chunks(
             ^^^^^^^^^^^^^^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 3151, in normalize_chunks
    chunks = _convert_int_chunk_to_tuple(shape, chunks)
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 3177, in _convert_int_chunk_to_tuple
    return sum(
           ^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 3180, in <genexpr>
    blockdims_from_blockshape((s,), (c,))
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 1279, in blockdims_from_blockshape
    return tuple(
           ^^^^^^
  File "/work/bd0854/b309141/micromamba/envs/test_np1/lib/python3.12/site-packages/dask/array/core.py", line 1280, in <genexpr>
    ((bd,) * (d // bd) + ((d % bd,) if d % bd else ()) if d else (0,))
              ~~^^~~~
ZeroDivisionError: integer division or modulo by zero

The reason for this is that the intermediate rechunk between the preprocessor steps create the following chunks after the regrid step / before the climate_statistics steps:

chunks before rechunk
((36, 13), (720,), (1440,))
chunks after rechunk
((0, 36, 13), (720,), (1440,))

The only non-default preprocessors used here are regrid and climate_statistics. The leading zero here then causes the error during the calculation of the time weights. The very funny thing here again is that this depends on the timerange. The error appears for 3 days, but not 10 days (see recipe)...Also, as reported by Tina, this does not appear in older Dask versions (I am using 2024.12.0 for this test).

@bouweandela do you think this is related this PR here or another bug in Dask. I would really like to merge this here.

Recipe:
recipe_era5_grib.yml.txt

bouweandela · 2024-12-06T11:29:05Z

do you think this is related this PR here or another bug in Dask

If this PR works fine with the older version of Dask, it's probably a Dask bug and you can just merge. Especially since we just call rechunk on a Dask array without any arguments, this looks very Dask internal. It would be really good to open issue(s) about our recent problems with Dask (I believe the size 0 chunks are the third problem you've found in the past month?) on the Dask repository to make sure that things get fixed and we can keep using recent versions of Dask.

schlunma · 2024-12-06T11:41:39Z

It would be really good to open issue(s) about our recent problems with Dask (I believe the size 0 chunks are the third problem you've found in the past month?) on the Dask repository to make sure that things get fixed and we can keep using recent versions of Dask.

The big problem here really is that it's super hard to isolate this. During the last weeks I spent days trying to come up with a minimal example example of a Dask bug which does not use any external packages and couldn't do it. I really hope to finalize this at some point.

So if anyone has the resources to look into this one here, I would be super grateful.

schlunma · 2025-01-07T08:37:48Z

The size 0 chunk problem will be fixed in the next Dask release 🎉

schlunma added 7 commits August 18, 2023 13:56

First working prototype of ERA5 GRIB reader

f998ae3

Extended list of supported variables for ERA5 GRIB support

8c48834

Merge remote-tracking branch 'origin/main' into read_era5_grib

9740cd3

Added public function to check for unstructured grids

60488dc

Make regridding much faster

f9a4ab7

Add support for more variables and make regridding optional

fc7384a

Add doc

e0c4da3

schlunma added the observations label Aug 23, 2023

schlunma added this to the v2.10.0 milestone Aug 23, 2023

schlunma self-assigned this Aug 23, 2023

schlunma added 13 commits August 23, 2023 09:49

Added first tests

3db12bb

Added test for loading grib files

09aabcb

Added iris-grib to environment and setup.py

8c73373

Fixed environment

744c20b

Fixed eccodes dependency

0c8ce64

Next try to get environment working

39c6677

Temporarily remove GRIB loading test

b7a0c68

Fixed tests

e907ff1

Added missing tests

0b8fbfa

Fixed test

bef5b5e

Improved test coverage of ERA5 CMORizer

2693066

Increased test coverage of regrid module

e7e7285

Optimized doc

911ed28

schlunma and others added 2 commits August 25, 2023 13:44

More customizable automatic regriddind for ERA5 GRIB

7afa551

Merge branch 'main' into read_era5_grib

7d09738

schlunma modified the milestones: v2.10.0, v2.11.0 Sep 28, 2023

Merge remote-tracking branch 'origin/main' into read_era5_grib

d0ae8d2

Merge branch 'main' into read_era5_grib

7da3b26