-
Notifications
You must be signed in to change notification settings - Fork 59
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for loading data from Amazon S3 #416
Comments
Some notes from PyHC discussion on containerizing PyHC projects: fsspec package can be used to open files on S3 and return an object that behaves as an ordinary file handle. It also supports caching to local filesystem. cdflib already supports opening files on S3. netcdf4 does not, but there is an s3netcdf4 package that supports this. It might be good for cdflib and netcdf4 to provide direct support for passing fsspec objects instead of filenames (abstracting out the type of endpoint being used). |
Created a working branch (416-s3-awareness) as we begin to address this. Let us know if you'd like to discuss any updates and/or requests associated with this. |
Looking forward to seeing what you come up with! Feel free to bounce any ideas off me...do you have a general approach in mind yet? I was thinking it would be nice if the CDF, NetCDF, and FITS readers could be modified to take open file descriptors instead of filenames, then if the PySPEDAS download code sees s3: URLs, it could open them itself (using an S3 aware package) and pass the handles to the other libraries. |
@jameswilburlewis , this update with associated PR is not a full solution to close this issue. It covers the majority of PySPEDAS cases that calls Included here is also a flow chart of the logic for the It would be great to receive feedback on this initial PR in order to ensure the remaining changes planned for the custom download methods will match in logic and functionality. |
Heliocloud has a large amount of MMS data available via S3 buckets. At some point, we should update PyTplot and PySPEDAS to support loading data from S3
The text was updated successfully, but these errors were encountered: