Frequent frames analysis pipeline

Details

For more information about the ACQDIV project, see: http://www.acqdiv.uzh.ch/. The research leading to these results has received funding from the European Union’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no 615988 (PI Sabine Stoll).

Pipeline

Generate the bigrams

Run the script: bigrams.py to generate the bigram frequencies. Note this make take a long time.

Generate the trigram counts for each analysis

For words, morphemes, dyads, etc. This generates counts for the frequency calculation of the frames and for operationalization.

Call: python3 utils.py

Make sure to set the words or morphemes in the script!

For global recall (DB) get the word types:

Call: sh get_types.sh

Run the frames processing and PR analysis

This step requires that the bigrams are calculated and pickled first. Then run the frames analysis with frames.py. The script outputs a TSV file to load into R.

Make sure to set words or morphemes (pos or glosses) in the script!

Statistical analysis with R scripts

The frames-functions.R file contains various functions for processing and plotting the frequent frames analyses. The words, morphemes and dyads analyses are in words.R, morpheme-gloss-pos.R and dyads.R.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Frequent frames analysis pipeline

Details

Pipeline

Files

README.md

Latest commit

History

README.md

File metadata and controls

Frequent frames analysis pipeline

Details

Pipeline