Skip to content

Latest commit

 

History

History
23 lines (18 loc) · 795 Bytes

README.md

File metadata and controls

23 lines (18 loc) · 795 Bytes

WikiCount

Parses text from Wikipedia articles and provides most common words used.

Input: URL to an English Wikipedia article.

Output: Most common words used in that article, descending order.

preview

Download

⬇️ Version 1.1

Future Plans

  1. Provide functionality for non-English articles
  2. Implement word-cloud creation functionality for most common words
  3. Allow exportation of results into .csv or .xls file
  4. Show title of article for articles that were generated randomly
    1. Implement quiz-type game (match word list to article name)
  5. Provide support for non-Wikipedia URLs

Libraries

  • JSoup (parses HTML)
  • Apache Commons Lang (3.x)
  • JavaFX 2