Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset Collector #1, Egorova Anastasia - 19FPL2 #47

Open
wants to merge 18 commits into
base: main
Choose a base branch
from

Conversation

egorova-ai19-2
Copy link

No description provided.

@egorova-ai19-2 egorova-ai19-2 added the Review Required You are ready for next iteration of review label Mar 12, 2021
@dmitry-uraev dmitry-uraev added 🕷️Crawler accepted and removed Review Required You are ready for next iteration of review labels Mar 15, 2021
@egorova-ai19-2 egorova-ai19-2 added the Review Required You are ready for next iteration of review label Mar 31, 2021
Copy link
Contributor

@dmitry-uraev dmitry-uraev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good! I think you are done here. But you may fix optional pylint addition and get some reward)

pipeline.py Outdated
return "MorphologicalToken instance here"
return f'{self.normalized_form}<{self.mystem_tags}>({self.pymorphy_tags})'

def public_method(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You try to fix lint by using unnecessary stubs. Actually, it would be better to change lint params. In that case lint check would not pay any attention to "to few public methods" case. And you would not have to change given interface. If you want, you may fix this one. Tip: pylint settings are in .pylintrc files:)

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for the tip) I've fixed this error.

pass
self.original_word = original_word
self.normalized_form = normalized_form
self.mystem_tags = ''
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why str? may be list? or other data type

pipeline.py Outdated

def get_articles(self):
"""
Returns storage params
"""
return self._storage

def public_method(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same as above

@@ -2,7 +2,13 @@
Pipeline for text processing implementation
"""

import os
from pathlib import Path, PurePath
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice

pipeline.py Outdated

return tokens

def public_method(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same

@dmitry-uraev dmitry-uraev removed the Review Required You are ready for next iteration of review label Apr 2, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants