t570 - Tasks to update existing GwETDs with proquest zipfile name, and to download only new ETDs from S3 #572
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #570.
To test:
BEFORE switching to this branch
...so, while on the
main
branch, download a few ETDs from theproquest-etds
bucket, copy them to/opt/scholarspace/scholarspace-ingest
.proquest-etds-test2
S3 bucket.gwss:ingest_pq_etds
task. Then, do a bulkrax import. See https://github.com/gwu-libraries/scholarspace-hyrax/wiki/Bulkrax-imports for details. Make sure to clear out the/opt/scholarspace/scholarspace-ingest
folder as well as the/opt/scholarspace/scholarspace-hyrax/tmp/bulkrax_zip
folder afterwards.AFTER switching to this branch
.env
with the new credential values at the bottom ofexample.env
. There is a bucket calledproquest-etds-test2
(inus-east-1
) that you can use. You can either create an additional, new credential pair, or contact @kerchner for existing credentials.proquest-etds-test2
contains some ProQuest ETDs (zip files) that your environment does not already have. You can copy a few more ProQuest ETDs over fromproquest-etds
to accomplish this.gwss:populate_etd_proquest_zipfile
task. Edit one of the GwETDs and observe thatproquest_zipfile
is (correctly) populatedproquest-etds
S3 bucket toproquest-etds-test-2
.gwss:download_new_pq_zips
task. Observe that the "new" ETDs have been downloaded to/opt/scholarspace/scholarspace-ingest
gwss:ingest_pq_etds
task. Then, do a bulkrax import. See https://github.com/gwu-libraries/scholarspace-hyrax/wiki/Bulkrax-imports for details. Observe (via the work edit form) that the newly loaded ETDs are populated withproquest_zipfile
values.