Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nächste Runde von Anpassungen #2

Draft
wants to merge 32 commits into
base: master
Choose a base branch
from
Draft

Nächste Runde von Anpassungen #2

wants to merge 32 commits into from

Conversation

ulf1
Copy link

@ulf1 ulf1 commented Nov 14, 2020

No description provided.

@ulf1 ulf1 requested a review from rknaebel November 14, 2020 19:34
setup.py Outdated
@@ -30,4 +30,6 @@
'console_scripts': [
'ims-nparser = imsnpars.cli:main',
],
})
},
python_requires=">=3.7"
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Muss das wirklich 3.7 sein? Funktioniert auch 3.6?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ich glaube das hat sich aus dem original Repository ergeben und wurde übernommen. Wenn du Tests für ältere Versionen durchlaufen lässt, können wir das gerne abändern.

Copy link
Author

@ulf1 ulf1 Nov 15, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bisher steht im Fork und im Orginal Repo die Version 3.7 nur in der Readme.md

README.md Outdated
### Download training data and serialized model

```sh
dvc pull
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

dvc pull => ERROR: failed to pull data from the cloud - config file error: no remote specified. Create a default remote with dvc remote add -d <remote name> <remote url>

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wo liegen die Daten? Sind die Daten "public"? Wenn ein Password oder Token benötigt muss in der README.md stehen wer die Kontaktperson ist.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Welche Daten meinst du genau? Der Parser ist im allgmeinen unabhängig von den Daten - müssen nur im CoNLL Format vorliegen. Für die aktuellen Modelle haben wir die Hamburger Treebank genutzt: https://corpora.uni-hamburg.de/hzsk/de/islandora/object/treebank:hdt

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Warum gibt es den Ordner .dvc? Zum Beispiel in https://github.com/zentrum-lexikographie/IMSnPars/tree/master/data/hdt sind lauter DVC Referenzen auf conllu Dateien, die eigentlich auf einem Remote Server herumliegen müssten

setup.py Outdated
'conllu>=3.1.1',
'Click>=7.1.2',
'psutil>=5.7.2',
'ray>=0.8.7',
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Gibt es einen großen Unterschied (breaking change) zwischen ray=0.* und ray 1.*

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

da kennt sich @gremid besser aus

@rknaebel rknaebel marked this pull request as draft November 15, 2020 17:18
pip install -r requirements-dev.txt --use-feature=2020-resolver
```

### Download training data and serialized model
Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DVC benötigt SSH account.

  • Lösung 1: public www ordner auf odo
  • Lösung 2: persistent ID (Datenpublikation)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants