Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Chinese Wordnet doc translation #70

Open
wants to merge 2 commits into
base: master
Choose a base branch
from

Conversation

yoyo-go
Copy link
Contributor

@yoyo-go yoyo-go commented Dec 20, 2020

Empty framework created for doc_zh.py

I have downloaded the Chinese Wordnet, and read the Building the Chinese Open Wordnet (COW): Starting from Core Synsets and the Cross-lingual Portability of Semantic Relations: Bootstrapping Chinese WordNet with English WordNet Relations papers, but I still have a few questions before translating:

  1. The Chinese Wordnet may not have all the relations in PWN, where can I figure out which relations to include and which to delete?
  2. Are there any glossary terms of EN-CH so I can refer on the name of the relations or the definitions? Or I just translate the terms by myself?
  3. How can I find the linked English synset when dealing with the Chinese examples? Since the Chinese Wordnet only have lexical entries like this:
<LexicalEntry id="cmnwn-lex21360">
      <Lemma writtenForm="基金" partOfSpeech="n"/>
      <Sense id="cmnwn-lex21360--13356985-n" synset="cmnwn-13356985-n"></Sense>
      <Sense id="cmnwn-lex21360--13358549-n" synset="cmnwn-13358549-n"></Sense>
</LexicalEntry>

If I would like to find an example like in Chinese, word A is a hypernym of word B, how should I achieve that?

Empty framework for doc_zh.py
@yoyo-go yoyo-go requested a review from fcbond December 20, 2020 15:04
@yoyo-go yoyo-go changed the title Empty framework created Chinese Wordnet doc translation Dec 20, 2020
@fcbond
Copy link
Member

fcbond commented Dec 27, 2020

Hi,

Empty framework created for doc_zh.py

I have downloaded the Chinese Wordnet, and read the Building the Chinese Open Wordnet (COW): Starting from Core Synsets and the Cross-lingual Portability of Semantic Relations: Bootstrapping Chinese WordNet with English WordNet Relations papers, but I still have a few questions before translating:

  1. The Chinese Wordnet may not have all the relations in PWN, where can I figure out which relations to include and which to delete?

Because Chinese speakers may use other wordnets, ideally all the relations should be defined. But maybe start with the constitutive ones.

  1. Are there any glossary terms of EN-CH so I can refer on the name of the relations or the definitions? Or I just translate the terms by myself?

I don't know of any good glossary. Maybe take a look at a Chinese lexical semantic textbook, or papers about one of the Chinese wordnets?

  1. How can I find the linked English synset when dealing with the Chinese examples? Since the Chinese Wordnet only have lexical entries like this:

Once you have loaded the lexicon in the wn module, you can use translate:

>>> wn.synsets('word', lexicon='ewn')[0].translate(lexicon='cmnwn:1.3+omw')
[Word('cmnwn-lex65514')]

to get the lemmas:

wn.synsets('word', lexicon='ewn')[0].translate(lexicon='cmnwn:1.3+omw')
['表达']

or the ILI:

wn.synsets('word', lexicon='ewn')[0].translate(lexicon='cmnwn:1.3+omw')[0].ili
'i26490'
<LexicalEntry id="cmnwn-lex21360">
      <Lemma writtenForm="基金" partOfSpeech="n"/>
      <Sense id="cmnwn-lex21360--13356985-n" synset="cmnwn-13356985-n"></Sense>
      <Sense id="cmnwn-lex21360--13358549-n" synset="cmnwn-13358549-n"></Sense>
</LexicalEntry>

If I would like to find an example like in Chinese, word A is a hypernym of word B, how should I achieve that?

This prints word, hypernym, word's ili and hypernym's ili.

>>> for ss in wn.synsets(lexicon='cmnwn:1.3+omw'):
...   for h in ss.hypernyms():
...     for sl in ss.lemmas():
...       for hl in h.lemmas():
...         print(sl, hl, ss.ili, h.ili, sep='\t')

All the relations existed in English docs has been translated, and I have tried my best to find examples in cmnwn.
@yoyo-go
Copy link
Contributor Author

yoyo-go commented Feb 6, 2021

Hi @fcbond,

I have finished the Chinese docs translation, but I am not sure about the translation of relation names, I find it hard to make them both meaningful and easy to understand, So just take it as a first version.

I will help to maintain it if there are new updates in relations or better translation of the names in the future.

The commit has a conflict, do I need to make a new pull request on that?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants