You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SpatioTemporal Asset Catalogs (STACs) are a way to organize geospatial datasets. STAC APIs let users query huge STAC Catalogs by date, time, and other metadata.
For example, the Microsoft Planetary Computer runs a STAC API that lets users search over catalogs containing all of Sentinel 2 imagery, all Landsat 8, etc. The following code uses the pystac_client library to query the Planetary Computer STAC API and returns metadata, and links to GeoTIFFs, for relevant Sentinel 2 scenes:
We'd like to build a STACAPIDataset object that essentially wraps catalog.search(...), creates a RasterDataset from the returned items, and otherwise behaves as a normal PyTorch dataset (signing assets as needed, etc.). A signature like STACAPIDataset(root="data/", api_endpoint, max_cache_size=None, **query_parameters_to_pystac_client) would be a good starting point here.
As a detailed note, it may be a good idea to cache accessed data in a local directory.
The text was updated successfully, but these errors were encountered:
All yours :) (I had you in mind writing this actually, it is a bit more interesting than the other dataset stuff!) -- feel free to message me if you want to discuss details
@adamjstewart -- this would involve taking on some dependencies (pystac_client, planetary-computer, maybe stackstac)
SpatioTemporal Asset Catalogs (STACs) are a way to organize geospatial datasets. STAC APIs let users query huge STAC Catalogs by date, time, and other metadata.
For example, the Microsoft Planetary Computer runs a STAC API that lets users search over catalogs containing all of Sentinel 2 imagery, all Landsat 8, etc. The following code uses the pystac_client library to query the Planetary Computer STAC API and returns metadata, and links to GeoTIFFs, for relevant Sentinel 2 scenes:
We'd like to build a
STACAPIDataset
object that essentially wrapscatalog.search(...)
, creates a RasterDataset from the returned items, and otherwise behaves as a normal PyTorch dataset (signing assets as needed, etc.). A signature likeSTACAPIDataset(root="data/", api_endpoint, max_cache_size=None, **query_parameters_to_pystac_client)
would be a good starting point here.As a detailed note, it may be a good idea to cache accessed data in a local directory.
The text was updated successfully, but these errors were encountered: