make this an importable lib #14

schlitzered · 2020-07-16T08:16:48Z

hi, i find the tool pretty use full, and it would be nice if you could make this a lib, with a stable interface, that can be imported into other projects.

for this i would suggest that the logic to choose "corpus" should move into find_acronyms

mshemuni · 2022-11-17T19:46:55Z

I'd say it is.
Just looking at the code one can see the acronym can be used as:

import nltk
from acronym.acronym import find_acronyms

ac.acronym.find_acronyms("Hello World", nltk.corpus.gutenberg, min_length=2)

Output:


Collecting word corpus
Identifying matching acronyms
Process Complete
        long_version  score
acronym
HOWL     HellO WorLd     18
HEW      HEllo World     15
HOOD     HellO wOrlD     15
HOW      HellO World     15
HELD     HEllo worLD     13
HERD     HEllo woRlD     13
HOLD     HellO worLD     13
HOD      HellO worlD     10
HOO      HellO wOrld     10
HER      HEllo woRld      8
HOR      HellO woRld      8
HO       Hello wOrld      5

see:

acronym/acronym/acronym.py

Line 102 in 584c844

def find_acronyms(s, corpus, min_length=5, max_length=7):

One can change corpus

nltk.corpus.words
nltk.corpus.brown
nltk.corpus.gutenberg

Do not forget to change max and min length. In my example 5 was too long and the output was empty DataFrame.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make this an importable lib #14

make this an importable lib #14

schlitzered commented Jul 16, 2020 •

edited

Loading

mshemuni commented Nov 17, 2022 •

edited

Loading

make this an importable lib #14

make this an importable lib #14

Comments

schlitzered commented Jul 16, 2020 • edited Loading

mshemuni commented Nov 17, 2022 • edited Loading

schlitzered commented Jul 16, 2020 •

edited

Loading

mshemuni commented Nov 17, 2022 •

edited

Loading