You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Source text
-----------
You need a copy of the text you want to extract from as plain text.
Source English word list
------------------------
The expected format is a list in lowercase, each line a substantive word.
Filename should be wordsEn.txt or modified in eliminate-common-nouns script.
Such file is available at http://www-01.sil.org/linguistics/wordlists/english/
Usage
-----
./extract-proper-nouns source.txt > nouns.txt
To sort them and eliminate duplicates:
./extract-proper-nouns source.txt | sort | uniq > nouns.txt
To discard known English words:
./eliminate-common-nouns nouns.txt
I guess there are two things to solve:
offer a nice message if the file hasn't been found to explain how to generate it
clarify README to indicate the download of a list of common nouns is mandatory
file not found at line 24" wordsEn.txt "
The text was updated successfully, but these errors were encountered: