-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Determine memory requirements for download #1
Comments
The |
Replaces pyDataverse DataAccessApi.get_datafile call with equivalent code that streams the response instead of fetching the entire file into memory. The StreamingDataAccessApi object can be removed once this pyDataverse issue is resolved: gdcc/pyDataverse#49 Fixes #1
I am looking into using libcurl to handle streaming uploading and downloading via pycurl. |
Uses `curl` command line tool to stream files when uploading and downloading since pyDataverse keeps the entire file in memory when uploading and downloading. Fixes #1 Upgrades github checkout to v3 to fix nodejs warning for v2.
The current download logic may be importing the entire file into memory here:
dataverse-access/dv.py
Lines 73 to 74 in 7a67502
Determine if there is a streaming approach to reduce requirements.
The text was updated successfully, but these errors were encountered: