Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

KNME descriptions missing #919

Closed
ekemeyer opened this issue Sep 19, 2024 · 3 comments
Closed

KNME descriptions missing #919

ekemeyer opened this issue Sep 19, 2024 · 3 comments
Assignees
Labels

Comments

@ekemeyer
Copy link
Contributor

ekemeyer commented Sep 19, 2024

Miranda identified around 3.5K assets that had lost their descriptions and other relevant data during the migration, mostly in the NM collection. Most descriptions are still in AAPB, so that pbcore will need to be brought down and mapped to an update spreadsheet for ingest into AMS2.

Some descriptions are also missing from AAPB, but are accurate in AMS2-demo (see example here: https://ams2-demo.wgbh-mla.org/concern/asset_resources/cpb-aacip-191-00ns1rrc). The second step for this ticket will be to pull that data down and generate a second spreadsheet to update AMS2.

some details:
These are the pbcore fields missing data and their mapping to ingester csv.

fields = {
    'Asset.asset_types': ".//pbcore:pbcoreAssetType",
    'Asset.producing_organization': ".//pbcore:pbcoreCreator/pbcore:creator",
    'Asset.local_identifier': ".//pbcore:pbcoreIdentifier[@source='Local Identifier']",
    'Asset.aacip_identifier': ".//pbcore:pbcoreIdentifier[@source='http://americanarchiveinventory.org']",
    'Asset.genre': ".//pbcore:pbcoreGenre[@source='AAPB Format Genre']"
}

#Specific fields mapping based on their type attributes
title_type_mapping = {
    'Series': 'Asset.series_title',
    'Program': 'Asset.program_title',
    'Episode': 'Asset.episode_title',
    'Promo': 'Asset.promo_title'
}

date_type_mapping = {
    'Broadcast': 'Asset.broadcast_date',
    'Created': 'Asset.created_date',
    'Copyright': 'Asset.copyright_date'
}

description_type_mapping = {
    'Series': 'Asset.series_description',
    'Program': 'Asset.program_description',
    'Episode': 'Asset.episode_description',
    'Promo': 'Asset.promo_description'
@ekemeyer ekemeyer self-assigned this Sep 19, 2024
@ekemeyer
Copy link
Contributor Author

nm_URLS.txt

@ekemeyer
Copy link
Contributor Author

ekemeyer commented Sep 30, 2024

  • ID, pull down and integrate descriptions only in AMS2-Demo

  • ID, pull down and integrate Raw Footage descriptions in AAPB that were previously skipped

@ekemeyer
Copy link
Contributor Author

ekemeyer commented Oct 2, 2024

This is complete after a couple of iterations - AMS2-Demo, AAPB and AMS2 all had varying amounts of description metadata. Raw footage description was left out accidentally in the first round, so a second round of ingests had to happen to include those. I can't confirm it is 100% accurate, but AMS2 should now have all the descriptions that had previously existed in AAPB and/or AMS2-Demo.

@ekemeyer ekemeyer closed this as completed Oct 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants