Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Data]: GLAD Landcover Dataset 2020 #1134

Closed
abarenblitt opened this issue Nov 1, 2024 · 11 comments
Closed

[Data]: GLAD Landcover Dataset 2020 #1134

abarenblitt opened this issue Nov 1, 2024 · 11 comments
Assignees
Labels
MSFC MSFC related issues

Comments

@abarenblitt
Copy link

Dataset Name

Global Land Cover and Land Use 2000 and 2020

Dataset Description

Annual maps of land cover and land use (2020)
Global map with continuous measures of bare ground and tree height inside and outside of wetlands, seasonal water percent, and binary labels of built-up, permanent snow/ice, and cropland.

Requestor Name/Affiliation

Abigail Barenblitt/ University of Maryland

Platform/Method/Sensor

Landsat archive

URL or DOI to Dataset Description

https://doi.org/10.3389/frsen.2022.856903

URL to Download or Access the Data

https://storage.googleapis.com/earthenginepartners-hansen/GLCLU2000-2020/v2/download.html

Data License

Creative Commons Attribution 4.0 International License

Intended Science Use Case

Examining PA GEDI values by landcover tupe

Format of the Data

GeoTIFF

Approximate Size of the Data

No response

Date Needed By

No response

Additional Information

Currently only need 2020

@wildintellect
Copy link
Collaborator

The answer to this might not be to download anything, another group has made a single COG out of this dataset and hosted it publicly (we might want a copy later for repeatability).

Here's a notebook I put together showing how to extra points from the file using Terra in R, a single GEDI extracted tile from the GEDI_PA folder with 500,000 pts+ runs in less than a couple of minutes.
https://gist.github.com/wildintellect/87a9fc8fffa9c20e57a0f168845013f3

We'll still look into if we should officially pull a copy of the data into MAAP STAC. That would add a step of querying STAC for matching files, and then extracting across those.

@hrodmn
Copy link

hrodmn commented Nov 5, 2024

If we have the time to build a stactools package for this dataset it would be a valuable addition since there are several other GLAD datasets that could be useful in the future. If nothing else it could be a nice layer for visualizations in MAAP maps.

I spent a few minutes on a notebook that uses obstore for I/O and rio-cogeo for converting a plain tif to a COG.

@wildintellect do you still want to load this dataset into the MAAP STAC? It could be a useful exercise for me while getting ramped up on STAC ingestion.

@hrodmn
Copy link

hrodmn commented Nov 7, 2024

I rigged up a process for converting the raw data into COGs and uploading to the maap-ops-workspace shared directory. https://gist.github.com/hrodmn/2777fe84ca6af466791cdb9a66b62f66

They are not in the STAC yet but here is a notebook with the layers loaded in a leaflet map: https://notebooksharing.space/view/6af869282ffcfd9c097e364830ca528bd4320576712b553212627e83c92e2b68#displayOptions=hide-inputs

@hrodmn
Copy link

hrodmn commented Nov 13, 2024

PR for a stactools package is open here: stactools-packages/glad-glclu2020#2. @wildintellect or @abarenblitt are you interested in reviewing it before I generate the STAC metadata for this collection?

@wildintellect
Copy link
Collaborator

@freitagb there's a question about if this should go to the MAAP STAC or the VEDA STAC since both projects might be keenly interested in the COG version of this dataset.

@hrodmn
Copy link

hrodmn commented Nov 19, 2024

We are close to adding the GLAD LCLU dataset to the MAAP (or VEDA) STAC! Here is what I have done:

What's next:

  • move the COGs to s3://nasa-maap-data-store/file-staging/nasa-map/{collection_name}/
  • generate STAC metadata for the collections (glad-glclu2020-v2 and glad-glclu2020-v2-change) and upload to the STAC database
  • generate STAC metadata for the items and upload to the STAC database

I need some advice on a few things:

  1. We need to pick a catalog to add this dataset to - MAAP or VEDA? I'm not sure it really matters for our users but we do have to pick one!
  2. I need to leave breadcrumbs for my COG generating procedure since I did it in a notebook in the MAAP ADE instead of using some more robust workflow. Where should I leave those docs besides this ticket?
    • Alternatively, I could try running the process (with STAC metadata generation built-in) in the Airflow way. I haven't done this before but my goal is to become familiar with the best practices for adding data to a STAC and that method keeps coming up as the standard approach. If there is a tutorial or example of how we run that process I would love to see it!
  3. If we decide to just want to copy the COGs I will still need to run a process that generates STAC metadata and uploads it to the STAC database.

cc @jjfrench @ividito

@hrodmn
Copy link

hrodmn commented Nov 20, 2024

I have ingested the whole dataset into the MAAP test STAC, PR for the COG and STAC generation notebooks is here: MAAP-Project/maap-documentation-examples#67

If we want to put it in VEDA we can still do that, but for now it is in the MAAP test STAC.

Here is a notebook with map layers served from the STAC collection from titiler-pgstac: https://notebooksharing.space/view/4338e1efcfd0f8b2630b3832cda9c8ae5ef0926aa1f79bafcab10545e4f99474#displayOptions=hide-inputs

@wildintellect
Copy link
Collaborator

@hrodmn what's left to do before we pull it to the MAAP STAC ( officially not test )?

@hrodmn
Copy link

hrodmn commented Nov 21, 2024

Once I get the go ahead I will run the stac generation notebook but pointed at the dev stac ingestor instead of the test ingestor.

@hrodmn
Copy link

hrodmn commented Nov 26, 2024

I posted the glad-glclu2020-v2 and glad-glclu2020-change-v2 collections and items to the dev STAC database so they are ready to use!

https://stac.maap-project.org/collections/glad-glclu2020-v2
https://stac.maap-project.org/collections/glad-glclu2020-v2-change

@hrodmn hrodmn closed this as completed Nov 26, 2024
@hrodmn
Copy link

hrodmn commented Nov 26, 2024

Here is an example of how to use the STAC API to extract GLAD land cover values for a set of points in Python, following @wildintellect's example from above: https://gist.github.com/hrodmn/ecda7f64391bb77ed3c69528f9fb468a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
MSFC MSFC related issues
Projects
None yet
Development

No branches or pull requests

4 participants