You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The code on your github can find proper names but i want this:
For Example;
"Abraham Lincoln Hotel is very beautiful place and i want to go there with
Barbara Palvin. Also there are stores like Adidas ,Nike , Reebok."
The Output should be:
['Abraham Lincoln Hotel'] is very beautiful place and i want to go there with ['Barbara Palvin']. ['Also'] there are stores like ['Adidas'], ['Nike'], ['Reebok'].
As you mentioned words like "Also" isn't a problem for me because i have a lot of dataset to compare these proper names.
First of all i can't use the script because i think there is no model like PunktWordTokenizer anymore in latest version. But i tought your output is like an array full of proper names which are not chunked right?
Also i dont know what to download in ntlk.download() for PunktWordTokenizer
The code on your github can find proper names but i want this:
For Example;
"Abraham Lincoln Hotel is very beautiful place and i want to go there with
Barbara Palvin. Also there are stores like Adidas ,Nike , Reebok."
The Output should be:
['Abraham Lincoln Hotel'] is very beautiful place and i want to go there with ['Barbara Palvin']. ['Also'] there are stores like ['Adidas'], ['Nike'], ['Reebok'].
As you mentioned words like "Also" isn't a problem for me because i have a lot of dataset to compare these proper names.
See Also:
http://stackoverflow.com/questions/36688176/python-group-sequential-array-members
The text was updated successfully, but these errors were encountered: