Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language Archive Cologne records reference metadata profile that no longer exists #17

Open
twagoo opened this issue Nov 26, 2024 · 4 comments

Comments

@twagoo
Copy link
Member

twagoo commented Nov 26, 2024

A metadata issue has been identified by Twan on 26 November 2024:

  • Metadata provider: Language Archive Cologne

Desciption of the issue:

Records are harvested but transformation to CMDI 1.2 fails with the following error:

2024-11-25T19:01:08,983 INFO [Language Archive Cologne] ListHarvesting - retrieved cmd records from endpoint https://api.ka3.uni-koeln.de/oai/lac
2024-11-25T19:01:09,186 ERROR [Language Archive Cologne] TransformAction - Exception thrown by URIResolver; Line#: -1; Column#: -1
net.sf.saxon.trans.XPathException: Exception thrown by URIResolver

(..)

Caused by: net.sf.saxon.trans.XPathException: I/O error reported by XML parser processing https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.x/profiles/clarin.eu:cr1:p_1475136016193/xml: https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.x/profiles/clarin.eu:cr1:p_1475136016193/xml

(..)

Caused by: java.io.FileNotFoundException: https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.x/profiles/clarin.eu:cr1:p_1475136016193/xml

Full harvesting log: Language_Archive_Cologne.log.txt

@twagoo
Copy link
Member Author

twagoo commented Nov 26, 2024

@twagoo
Copy link
Member Author

twagoo commented Nov 26, 2024

It seems that an older version of the BLAM-bundle-repository profile has been deleted from the Component Registry. The CMDI 1.1 record has

<Components>
   <BLAM-bundle-repository-v0.14>
      <BundleGeneralInfo>
         <BundleID IdentifierType="Handle">hdl:11341/0000-0000-0000-2713</BundleID>

Recently published BLAM-bundle-repository_v1.0: https://catalog.clarin.eu/ds/ComponentRegistry#/?itemId=clarin.eu%3Acr1%3Ap_1721373444016&registrySpace=public

So it seems that the records need to be updated to make use of this profile.

@twagoo twagoo changed the title Issue during harvest for Language Archive Cologne Language Archive Cologne records reference metadata profile that no longer exists Nov 26, 2024
@twagoo
Copy link
Member Author

twagoo commented Nov 26, 2024

Status update: Felix (@fxru) reported that this is indeed the result of a changed identifier for the profile they are using, that they are aware of it but that it might take some time to fix it. Keeping the issue in the meantime.

@twagoo
Copy link
Member Author

twagoo commented Nov 26, 2024

The logs confirm that the referenced profile was deleted on 13 November.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant