diff --git a/README.md b/README.md index 16a21bd..6afe433 100644 --- a/README.md +++ b/README.md @@ -1,4 +1,4 @@ -lexical taggers are for Sahidic Coptic +lexical taggers for Sahidic Coptic =========================================================== Includes lemmatizer and tagger for language of origin. diff --git a/language-tagger/lexical-tagger-August2014-release-notes.txt b/language-tagger/lexical-tagger-August2014-release-notes.txt index f447174..22870c8 100644 --- a/language-tagger/lexical-tagger-August2014-release-notes.txt +++ b/language-tagger/lexical-tagger-August2014-release-notes.txt @@ -1,5 +1,10 @@ +v. 1.21: +Words containing the mnt/ref/at/r morphs that were in the v. 1.1 release were re-added, for use by other projects that might not annotate on the morph level. + +v. 1.22 Words containing the mnt/ref/at/r morphs are removed since we are now tagging for language of origin on the morpheme level -vocabulary from Mark 12-16 and 1 Cor 1-5 added +Vocabulary from Mark 12-16 and 1 Cor 1-5 added. + +Where relevant, iotas that are also epsilon-iotas in Coptic have been normalized to epsilon-iota (e.g., ⲁⲣⲭⲓ/arxi to ⲁⲣⲭⲉⲓ/arxei) -Where relevant, iotas that are also epsilon-iotas in Coptic have been normalized to epsilon-iota (e.g., ⲁⲣⲭⲓ/arxi to ⲁⲣⲭⲉⲓ/arxei) \ No newline at end of file diff --git a/language-tagger/lexicon.txt b/language-tagger/lexicon.txt index 30f13df..f0ca02b 100644 --- a/language-tagger/lexicon.txt +++ b/language-tagger/lexicon.txt @@ -150,6 +150,10 @@ ⲙⲏ Greek ⲙⲏⲡⲟⲧⲉ Greek ⲙⲏⲧⲓ Greek +ⲙⲛⲧϩⲁⲡⲗⲟⲩⲥ Greek +ⲙⲛⲧⲁⲅⲁⲑⲟⲥ Greek +ⲙⲛⲧⲣⲉϥϩⲉⲧⲃⲯⲩⲭⲏ Greek +ⲙⲛⲧⲧⲉⲗⲓⲟⲥ Greek ⲙⲟⲛⲁⲭⲟⲥ Greek ⲙⲟⲛⲟⲛ Greek ⲙⲩⲥⲧⲏⲣⲓⲟⲛ Greek @@ -201,6 +205,7 @@ ⲡⲣⲟⲑⲉⲥⲓⲥ Greek ⲡⲣⲟⲥⲕⲁⲣⲧⲉⲣⲓⲁ Greek ⲡⲣⲟⲫⲏⲧⲏⲥ Greek +ⲣⲭⲣⲉⲓⲁ Greek ⲥⲁⲛⲇⲁⲗⲓⲟⲛ Greek ⲥⲁⲣⲕⲓⲕⲟⲛ Greek ⲥⲁⲣⲝ Greek