For more information about the ACQDIV project, see: http://www.acqdiv.uzh.ch/. The research leading to these results has received funding from the European Union’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 615988 (PI Sabine Stoll).
- Generate the bigrams
Run the script: bigrams.py
to generate the bigram frequencies. Note this make take a long time.
- Generate the trigram counts for each analysis
For words, morphemes, dyads, etc. This generates counts for the frequency calculation of the frames and for operationalization.
Call: python3 utils.py
Make sure to set the words or morphemes in the script!
- For global recall (DB) get the word types:
Call: sh get_types.sh
- Run the frames processing and PR analysis
This step requires that the bigrams are calculated and pickled first. Then run the frames analysis with frames.py
. The script outputs a TSV file to load into R.
Make sure to set words or morphemes (pos or glosses) in the script!
- Statistical analysis with R scripts
The frames-functions.R
file contains various functions for processing and plotting the frequent frames analyses. The words, morphemes and dyads analyses are in words.R
, morpheme-gloss-pos.R
and dyads.R
.