Define necessary data parameter descriptors to be used in STAC definitions for data #7

santilland · 2023-01-30T17:42:13Z

WIP: STAC definition document:
https://docs.google.com/spreadsheets/d/1Rzygo7mt-d5Sb1OtvjTh270sKg-SgFkH-uI8GI2l2-M/edit?usp=sharing

Before deciding on any STAC structure or specific implementation we want to find and define all parameters we require to make sure we can represent the data as we currently do in the dashboard as well as with additional information that is currently missing. As we see in our project as well as others, there is a divide in Catalog data (being more bare bones) and additional data being maintained separately on how to present the data.
We want to try and define all of the parameters that are handled so that we can agree on how they can or should be included in the STAC descriptions.

This first entry will be edited to maintain a list of all the collected inputs provided as comments.

name
description
theme(s): to which overall category does this indicator belong to, could also be considered a tag
story: reference to markdown file providing extensive information with media content
thumbnail: url to small image
preview: url to medium sized image
extent: area covered
country: when indicator specific to a country
city: when indicator specific to a city
siteName: when within a city different location
data list: list of time entries extracted through various interfaces to define for which dates data is available
data endpoint:
- object describing how to retrieve or visualize the indicator on the map
- typical WMS/WMTS/Vector Tiles/COG/rendering services
citing: a way to describe how to cite when using the source
author: some way of describing all contributing authors / institutes / ....
license: what license has the data
disclaimer: any disclaimer related to the data
data sources: maybe list of references to same object being described here
accuracy: description of accuracy
resolution: when applicable what resolution has the data
doi: for dataset
colorlegend: object describing list of colors and range to be used
satellite-mission, object describing if data is coming directly from satellite mission including:
- sensor description
- mission group
- ...

santilland · 2023-02-03T09:23:14Z

As the second entry we could try describing what levels of stac we use and where some of the information should be contained:

santilland · 2023-02-03T10:14:32Z

List of references

Catalogs:

Catalog visualization:

Additional data descriptors:

Example file: https://github.com/NASA-IMPACT/veda-config/blob/develop/datasets/bangladesh-landcover-2001-2020.data.mdx

Related issues:

need for citing specification (from Cite Data eodash#1907)
area covered (from Query data by geographic info eodash#1905)
From Find Indicator Author eodash#1897:
- indicator metadata contains provider/data owner information/identifier
- data API exposes this info in machine readable form
- story exposes this info in human readable form

Technologies:

https://github.com/ASFHyP3/asf-stac

Others:
Possible consideration to describe tabular data, e.g. geodb:
use summaries to describe table
and/or use table stac extension

OSC:

Consider use of OSC extension especially for themes

j08lue · 2023-02-03T11:14:56Z

Great! Can we actually turn this into a table, where we can have some columns like

Name
Description
Exists in VEDA STAC
Exists in Planetary Computer STAC
Requires STAC extension (name/new)
Value for EO Dashboard project - on a scale from 1 (low) to 10 (high value)
Value for VEDA project - on a scale from 1 (low) to 10 (high value)

After some iteration, we can put these properties into a 2x2 matrix to identify the ones that are easy to implement and high-value.

j08lue · 2023-02-03T11:17:06Z

Scientific Citation, for example, already has a stable STAC extension: https://github.com/stac-extensions/scientific - easy to implement

santilland · 2023-02-21T14:19:11Z

Hello @j08lue as the table is getting a bit more complex then what i think makes sense to manage in github comments i created following document:
https://docs.google.com/spreadsheets/d/1Rzygo7mt-d5Sb1OtvjTh270sKg-SgFkH-uI8GI2l2-M/edit?usp=sharing

Please feel free to request access permissions if you would like to provide inputs there. I think from our side we have a good starting point which we can reference for starting our initial implementation tests for catalog creation, i imagine while working in the creation of the catalog we will find what things works and which don't.
For now the only thing i am missing is a way to describe applied colormaps, i have not seen any extension for that.

santilland · 2023-05-17T09:58:43Z

Hello @j08lue , i would like to update on some thoughts we are also going to discuss internally. I think the more important discussion (apart of what metadata we save) is the hierarchy we want to use so that the dashboards can be nicely populated.
I have tried to brainstorm the structure a little here and would love to hear your feedback on how this hierarchy would fit the concepts you use for your dashboard:

santilland · 2023-07-11T11:49:15Z

Initial implementation of generation logic has been started as part of new repository eodash-catalog.
Decisions on properties to include and possible necessary hierarchy will be done in an iterative process while trying to integrate all necessary information for available collection into the STAC catalog

j08lue · 2023-07-11T12:56:37Z

@santilland, can you briefly describe what the purpose of the eodash-catalog is? Is this some kind of translating / intake service that makes various (STAC) metadata providers compatible with the EO Dashboard?

Will eodash-catalog implement the "Indicator" level that you proposed previously?

We have been discussing different approaches to federated STAC search and connection to dashboards / visualization frontends and are planning on having a dedicated working group on this in the next few months. I hope there will be traction on this subject from the VEDA side then.

While I am sure that a "glue service" or auxiliary data injector will continue to be needed to harmonize various sources, I still have the dream of keeping that as slim as possible and moving more dashboard / visualization related information into STAC instead. Seems like this has been discussed before in the STAC community (re rendering hints, etc)... Would you be interested in us pursuing that direction (more dashboard-friendly STAC), too?

santilland · 2023-07-11T14:43:13Z

@j08lue the idea is to move the description of (data) content away from the eodash client repository and to allow using the dashboard just by pointing it to a supported STAC catalog. This way the instantiation of the eodash client is greatly simplified and the data provided by the different instances can more easily be integrated into other clients.

Additionally it simplifies how user/expert contributed data can be integrated into the eodash client instances.
Initially it is a translating/intake service for including also external catalogs that do not have the required information, but with the hope that we can as much as possible directly integrate other sources, without the need of translation.
The step is not only for translation but also for configuration of what data should actually be included, so for example, the catalog generated can point to external collections, but we still need a dedicated catalog because we usually do not want to integrate entire external catalogs, only specific collections or we want to subset in very specific ways.

For now our approach to define visualizations is to use the web-map-links extension.
We did some test for data we include in our dashboard from the VEDA endpoint, for example for the yearly NO2, by adding the web-map-link to the items it already can be visualized by the standard stac browser:
https://radiantearth.github.io/stac-browser/#/external/eurodatacube.github.io/eodash-catalog/trilateral/catalog.json
You can navigate to individual items and get a visualization on the map.

We are very much interested in pursuing that direction with you, that is why i am also trying to describe our process here :)

santilland · 2023-07-11T14:56:05Z

As for the indicator level this is something that is still under discussion, i see three approaches:

It is a special configuration that can be done on the client (for example describing which collections should be grouped into indicators) - don't necessarily want this in the client configuration
or it is a special type of collection, which makes things a bit complicated as in stac we expect the same "granularity" when navigating a level, so if some special collections (indicators) references other collections then it would not be the same granularity for "plain" collections that then point to items
or we add a new "level" where normally each indicator points to one collection, but for more complex indicators references to multiple collections. In this case the catalog would then point to indicators and not directly to collections

This still leaves some headaches about on which level which information should exist. Does an indicator describe the data collections it references? Or do you need to crawl the referenced collections to gather the data that describes the indicator, and so on

santilland added the enhancement New feature or request label Jan 30, 2023

santilland mentioned this issue Jan 30, 2023

Create concept for an improved search functionality eurodatacube/eodash#1950

Open

silvester-pari mentioned this issue Feb 8, 2023

Make data access flow clearer eurodatacube/eodash#2005

Open

This was referenced Jul 11, 2023

Implementation of STAC extraction for Sentinelhub collection #9

Closed

Implementation of STAC extraction for NASA datasets #6

Closed

santilland transferred this issue from eurodatacube/eodash Jul 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define necessary data parameter descriptors to be used in STAC definitions for data #7

Define necessary data parameter descriptors to be used in STAC definitions for data #7

santilland commented Jan 30, 2023 •

edited

Loading

santilland commented Feb 3, 2023 •

edited

Loading

santilland commented Feb 3, 2023 •

edited

Loading

j08lue commented Feb 3, 2023

j08lue commented Feb 3, 2023

santilland commented Feb 21, 2023

santilland commented May 17, 2023

santilland commented Jul 11, 2023

j08lue commented Jul 11, 2023

santilland commented Jul 11, 2023

santilland commented Jul 11, 2023

Define necessary data parameter descriptors to be used in STAC definitions for data #7

Define necessary data parameter descriptors to be used in STAC definitions for data #7

Comments

santilland commented Jan 30, 2023 • edited Loading

santilland commented Feb 3, 2023 • edited Loading

santilland commented Feb 3, 2023 • edited Loading

j08lue commented Feb 3, 2023

j08lue commented Feb 3, 2023

santilland commented Feb 21, 2023

santilland commented May 17, 2023

santilland commented Jul 11, 2023

j08lue commented Jul 11, 2023

santilland commented Jul 11, 2023

santilland commented Jul 11, 2023

santilland commented Jan 30, 2023 •

edited

Loading

santilland commented Feb 3, 2023 •

edited

Loading

santilland commented Feb 3, 2023 •

edited

Loading