Skip to content
This repository has been archived by the owner on Apr 29, 2022. It is now read-only.

Add arXiv data to database #45

Closed
dasaderi opened this issue Sep 24, 2019 · 4 comments
Closed

Add arXiv data to database #45

dasaderi opened this issue Sep 24, 2019 · 4 comments

Comments

@dasaderi
Copy link
Member

No description provided.

@blahah
Copy link
Contributor

blahah commented Sep 30, 2019

This data was added to the db initially, and then removed because of issues with arXiv metadata. Specifically, some author names and titles contain LaTeX markup which needs parsing into the correct UTF-8 characters. I didn't yet find a good way to do this consistently.

@dasaderi
Copy link
Member Author

Update after call with WinGravity:

This is something that we should do only if we want to fix the issue of the search. See issue #64. Currently the arXiv data is not all in because it's huge and Rik did not finish importing it (as far as I understand).

@blahah
Copy link
Contributor

blahah commented Nov 23, 2019

The issue with ArXiv was that there are LaTeX characters in the metadata and I didn't find a nice solution for rendering them well. I have just seen this potential solution: PREreview/getpreprints#6

@blahah
Copy link
Contributor

blahah commented Mar 3, 2020

If this was solved would you mind sharing the solution?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants