-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Word lists #38
Comments
Thank you for your careful proofreading. Duplicates like It's ok for I'm sorry, but I don't understand when you say:
For the unaccented words in the dedicated file, I think it will be necessary to proofread them one by one, and to suppress those which have no reason to be there. I will do this as soon as possible. |
Thank you for your comments and your indefatigable work. As far as I can see, the file |
Yes you're right. I did not add any documentation on this precise topic. These are homographs that have a different hyphenation depending on the place of the accent (this is well documented in the homograph section of the doc). As the model without accent can not agree for both cases, there is inevitably an error message if the accented words are put in the dedicated file. By splitting it into the file that gives the correct hyphenation, I wanted to avoid that it falls into oblivion. To do well, I would have to do the same for |
OK, I see. Four of the eight forms mentioned above seem to be derived from tran(s)-sero, but what is the origin of I found some more duplicates in the word list: castricius, nympheum, pœniceus, angulus, and Daphne. |
There are several compounds of iacio in the list, e.g. ab-icio with its medieval variant ab-jicio. I think it is not good to write ab-iicio and the like. As far as I know, the double i variant does not exist. |
There are many forms of the rare verb abstruo in the list. I suppose the radix is struo and not truo, so I suggest do hyphenate |
Those are passive forms of transeo. See here for more details if you want.
Ok, I will clean all soon.
I think so too. It would be
I agree to the fact that |
For this particular point, it may be useful to open a specific issue I think. |
Are you sure the forms mentioned there are correct? |
I can't say that I'm sure as if I had studied the question in depth, but I trust this source that is usually correct.Indeed, it gives only the existing forms usually, not those that an automatic lemmatizer could deduce from algorithms. Insofar as these are passive forms and a priori infrequent, I did not see fit to dig deeper into the question. I do not know what your grammar is, probably the authors have good reason to mention what you say. I wanted to be as complete as possible by including these passive forms that may be disputable, in case… But one solution could be to remove them. |
I proofread the list of unaccented words. Everything is fixed, including the corresponding patterns as needed. Note that I deleted 4 words that I did not find anywhere in the consulted dictionaries:
One word had two syllables and so an accent is useless: I left out words that begin with |
I have the impression that those passive forms have been created automatically and then corrected by hand for the third person singular only (ire without prefix only has an impersonal passive). It is good to be as comprehensive as possible, but we should be careful not to propagate non-existing word forms. Internet sources tend to be incomplete or faulty, even if they may give good results in other cases. I suggest to replace |
I agree. This solution is consistent with your source, and in addition removes homographs. |
How can this be an inflected form of abs-trudo? Where is the d gone? |
|
I found an article about abstruo in the Archiv für lateinische Lexikographie und Grammatik, a series that appeared in the late 19th century to prepare the Thesaurus linguae Latinae. |
Ok, so it would be good to clean the word list, I see that I included I reopen this issue as a memento. |
I am not sure, how those “ghost words” shall be treated. abstruo is certainly not the only one. For example, when I looked for the etymology of prodicius/proditius (both listed in Gaffiot) in ThLL, I did not find proditius and for prodicius it was stated, that it is a faulty conjecture of a certain editor in a certain text. Ghost words will never occur in modern text editions, but their hyphenation might be needed when old text editions or dictionaries are retypeset, as it was the case for the Gaffiot 2016. Some of them, who arose early enough, may even occur as original words in medieval texts, e.g. here for abstruere. So it might be safer to keep them. |
There is one duplicate in
wordlist-liturgical.txt
: eucharistia.As far as I can see, the entry
a-bi-ens
is wrong. This is a participle of ab-ire.There are eight accented forms in the file, all of them beginning with
trans
.There is also one duplicate in
wordlist-liturgical-accents.txt
: transpadáneus.There are some entries with three or more syllables not having an accent in the file:
abstergo
adieuntis
(strange form, does it exist?)altertra
astasti
astastis
compinguescerent
cumscribillo
discrepentia
displuvia
distrivsti
distrivstis
elanguescam
elanguescerent
epithalamus
eschato
euhias
exsanguescerent
horreus
impinguescerent
impinguescetis
inexstinguibilis
inexstinguibiliter
interstinguant
languescerent
languoris
linguacioris
linguosioris
longivus
obfidire
obiex
perieam
perieamus
perieant
perieas
perieatis
periee
(also strange)perieim
perieimus
perieint
perieis
perieit
perieitis
perieo
perieunt
periur
perunguerent
pinguescerent
præaudio
præiens
præobturans
præstruo
relanguescerent
respire
sanguinolentus
satisaccipere
semetipsum
sorbuis
substinguitur
superescit
superimpedens
suscribere
The text was updated successfully, but these errors were encountered: