Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hyphenization character should probably be ¬ #44

Open
PonteIneptique opened this issue Feb 2, 2022 · 2 comments
Open

Hyphenization character should probably be ¬ #44

PonteIneptique opened this issue Feb 2, 2022 · 2 comments

Comments

@PonteIneptique
Copy link
Contributor

At the end of each line with hyphenization, you used ', maybe it should be ¬

@PonteIneptique
Copy link
Contributor Author

This is just for your information, it's probably doable to change this automatically.

@malamatenia
Copy link
Contributor

You're right, I tried to follow a more true-to-the-manuscript approach but it can actually create conflict with other usages of '. With a small check in of the .txt file for the verification corpus, ' is also used on two occasions as part of abbreviations (qd' for quod Mazarine_Inc59, verification_corpus text_Inc59_21-27.txt line 163 and istd' for istud respectively at line 173).

This means that since we are sure that ' equals hyphenation at the end of the line in all instances, it should be replaced by ¬ to avoid redundant noise and inaccuracies should someone wants to analyze the punctuation and signs in the text (that would probably be me in the future so better take care of this now).

The same holds true for the question marks, transcribed unanimously as : which creates conflict with the : sign used as a separator between primary and secondary phrases (,).

Having said this, I will be making all the changes to update the .txt, .XML files, the csv table and will add a clearer explanation to the documentation report by the end of the week.

Thank you for this comment Professor 👍 !!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants