Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EUNIS datasource outdated (updated exactly 10 years ago) #81

Closed
abubelinha opened this issue Feb 9, 2022 · 9 comments
Closed

EUNIS datasource outdated (updated exactly 10 years ago) #81

abubelinha opened this issue Feb 9, 2022 · 9 comments

Comments

@abubelinha
Copy link

abubelinha commented Feb 9, 2022

Regarding the example in #80, the mentioned EUNIS datasource results are very outdated.

In the example provided, verifier shows both "currentRecordid" & "recordId="144750512", which does not match the taxon id 316085 shown in the EUNIS url ... where Isoetes longissimum is reported as a synonym of I. longissima

Is it possible to update that datasource? I think there is a pretty recent version in COL CheckListBank (already contains the above example reported as a synonym, and shows the same id as EUNIS).

This datasource is pretty interesting for European regional checklists.

Thanks a lot in advance !!

@dimus
Copy link
Member

dimus commented Feb 9, 2022

Looks like they do have a dump, at least in RDF from 2020, so I can use it to update EUNIS. I do have to deal with other things for a few months though. Usually I reserve time in December/January, May/June for data updates

@abubelinha
Copy link
Author

It's a shame I didn't realize before #62
I'll put it in my wishlist for May ;)

@dimus
Copy link
Member

dimus commented Feb 11, 2022

Do they have DwCA file at GBIF? if yes, I can stick it in fast

@abubelinha
Copy link
Author

Actually they do, but it might be the same old version:
https://www.gbif.org/dataset/1bd42c2b-b58a-4a01-816b-bec8c8977927
Publication date January 1, 2010

Perhaps you get it from there too, 2 years later?
It doesn't seem to have synonyms links between taxon ids.
https://www.gbif.org/species/101280862/verbatim

I actually discovered it here at gnames, and then searched for it at COL to pass you the links.
Why don't you use the download options at COL? (I think one of them is DwCA)

@dimus
Copy link
Member

dimus commented Feb 11, 2022

ah ok, yes, with all probability it is the same file that I used. OK, i'll add EUNIS it to the queue for May

@abubelinha
Copy link
Author

abubelinha commented Feb 12, 2022

It's up to you. But I am curious about the reason to not using COL EUNIS-DwCA download file.

Looks like it contains the aforementioned synonyms (extract of Taxon.tsv core data file):

acceptedNameUsageID scientificNameID taxonomicStatus taxonRank scientificName
194389 accepted species Isoetes longissima Bory
194389 316085 synonym species Isoetes longissimum Bory

Is this COL-DwCA format not gnames-suitable for any particular reason, compared to the one in GBIF? (missing important dwca-extension or whatever).
If so I would like to know ... as I might end up needing to use COL dwca downloads myself for matching gnames resolver taxon ids (#85)

EDIT: maybe with "if yes, I can stick it in fast" you meant you can use COL file, but fast=May (I assumed fast=before May)

@abubelinha
Copy link
Author

abubelinha commented Aug 25, 2022

ah ok, yes, with all probability it is the same file that I used. OK, i'll add EUNIS it to the queue for May

Just wondering ... was this forgotten, or is there any problem with the DwCA file?
Thanks

@dimus
Copy link
Member

dimus commented Aug 26, 2022

Thanks for reminding about this one @abubelinha, I did forget about it. I will get back to it sometime in the second half of September.

@dimus
Copy link
Member

dimus commented Sep 2, 2022

EUNIS is updated now https://verifier.globalnames.org/data_sources/158

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants