Code and data accompanying the study "Bangime: Secret Language..."

The code provided here is not in the best state with respect to readability, also because some ideas that have been formulated first in 2017 have now been superceded by new software packages. This being said, we emphasize, however, that it does not mean that the code does not run, and we have confirmed on 20th of November 2021 that the code runs with a fresh virtual environment for all scripts apart from the script C_makemaps.py, which was using the now deprecated basemap library. For now, we decided not to update the code for the map plotting routines, since they are not an essential aspect with respect to the replicability of the study. Given that the data has been made available in the form of a CLDF package, geolocations can be easily plotted with the help of the tools accompanying CLDF, like cldfviz or cldfbench.

Requirements

To install all requirements, use a fresh virtual environment, and just type:

$ pip install -r requirements.txt

Workflow

Drawing the language map (requires basemap)

Note that this part of the code will be difficult to replicate, since basemap has been deprecated, as mentioned above, so you better ignore this part of the code.

Draws the geographic map in the study (the map was manually modified to adjust readability).

$ python C_makemaps.py

Coverage statistics

Extracts the sublist of 300 concepts and 22 languages.

$ python C_coverage.py

Check overlap with other concept lists

We use the pyconcepticon API for this purpose along with the most recent verson of the Concepticon.

$ pip install pyconcepticon
$ git clone https://github.com/concepticon/concepticon-data
$ for i in "Blust-2008-210" "Gregersen-1976-217" "Matisoff-1978-200" "Swadesh-1955-100" "Swadesh-1952-200" "Tadmor-2009-100"; do echo $i `concepticon --repos=concepticon-data --repos-versino=v2.5.1 intersection Hantgan-2021-300.tsv $i | wc -l`  ; done

This yields as output:

Blust-2008-210 125
Gregersen-1976-217 122
Matisoff-1978-200 118
Swadesh-1955-100 72
Swadesh-1952-200 116
Tadmor-2009-100 69

Cognate detection and heatmaps (requires matplotlib)

Extracts cognate sets for two different approaches and compares shared pairwise similarities.

$ python C_lexstat.py

Barcharts

Creates barcharts of shared vocabulary.

$ python C_barcharts.py sca

For the lexstat analysis, write:

$ python C_barcharts.py

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.gitignore		.gitignore
C_barcharts.py		C_barcharts.py
C_coverage.py		C_coverage.py
C_lexstat.py		C_lexstat.py
C_makemaps.py		C_makemaps.py
C_stats.py		C_stats.py
D_base_list.tsv		D_base_list.tsv
D_languages.tsv		D_languages.tsv
D_subset-100-22.tsv		D_subset-100-22.tsv
D_subset-100-22.tsv-cognates.tsv		D_subset-100-22.tsv-cognates.tsv
D_subset-100-22.tsv.bin.tsv		D_subset-100-22.tsv.bin.tsv
D_subset-100-22.tsv.bin.tsv-cognates.tsv		D_subset-100-22.tsv.bin.tsv-cognates.tsv
D_subset-100-23.tsv		D_subset-100-23.tsv
D_subset-100-28.tsv		D_subset-100-28.tsv
D_subset-200-22.tsv		D_subset-200-22.tsv
D_subset-200-22.tsv-cognates.tsv		D_subset-200-22.tsv-cognates.tsv
D_subset-200-22.tsv.bin.tsv		D_subset-200-22.tsv.bin.tsv
D_subset-200-22.tsv.bin.tsv-cognates.tsv		D_subset-200-22.tsv.bin.tsv-cognates.tsv
D_subset-200-23.tsv		D_subset-200-23.tsv
D_subset-200-24.tsv		D_subset-200-24.tsv
D_subset-300-22.tsv		D_subset-300-22.tsv
D_subset-300-22.tsv-cognates.tsv		D_subset-300-22.tsv-cognates.tsv
D_subset-300-22.tsv.bin.tsv		D_subset-300-22.tsv.bin.tsv
D_subset-300-22.tsv.bin.tsv-cognates.tsv		D_subset-300-22.tsv.bin.tsv-cognates.tsv
Hantgan-2021-300.tsv		Hantgan-2021-300.tsv
O_atlantic-lexstatid.pdf		O_atlantic-lexstatid.pdf
O_atlantic-scaid.pdf		O_atlantic-scaid.pdf
O_bangime-lexstatid.pdf		O_bangime-lexstatid.pdf
O_bangime-scaid.pdf		O_bangime-scaid.pdf
O_combined_100.matrix		O_combined_100.matrix
O_combined_100.pdf		O_combined_100.pdf
O_combined_200.matrix		O_combined_200.matrix
O_combined_200.pdf		O_combined_200.pdf
O_combined_300.matrix		O_combined_300.matrix
O_combined_300.pdf		O_combined_300.pdf
O_dogon-lexstatid.pdf		O_dogon-lexstatid.pdf
O_dogon-scaid.pdf		O_dogon-scaid.pdf
O_language_map.pdf		O_language_map.pdf
O_lexstat_100.matrix		O_lexstat_100.matrix
O_lexstat_100.pdf		O_lexstat_100.pdf
O_lexstat_200.matrix		O_lexstat_200.matrix
O_lexstat_200.pdf		O_lexstat_200.pdf
O_lexstat_300.matrix		O_lexstat_300.matrix
O_lexstat_300.pdf		O_lexstat_300.pdf
O_mande-lexstatid.pdf		O_mande-lexstatid.pdf
O_mande-scaid.pdf		O_mande-scaid.pdf
O_patterns-lexstatid.tsv		O_patterns-lexstatid.tsv
O_patterns-scaid.tsv		O_patterns-scaid.tsv
O_sca_100.matrix		O_sca_100.matrix
O_sca_100.pdf		O_sca_100.pdf
O_sca_200.matrix		O_sca_200.matrix
O_sca_200.pdf		O_sca_200.pdf
O_sca_300.matrix		O_sca_300.matrix
O_sca_300.pdf		O_sca_300.pdf
O_songhai-lexstatid.pdf		O_songhai-lexstatid.pdf
O_songhai-scaid.pdf		O_songhai-scaid.pdf
README.md		README.md
dogon-300-lexstat.dst		dogon-300-lexstat.dst
dogon-300-sca.dst		dogon-300-sca.dst
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Code and data accompanying the study "Bangime: Secret Language..."

Requirements

Workflow

Drawing the language map (requires basemap)

Coverage statistics

Check overlap with other concept lists

Cognate detection and heatmaps (requires matplotlib)

Barcharts

About

Releases 7

Packages

Contributors 6

Languages

lingpy/language-island-paper

Folders and files

Latest commit

History

Repository files navigation

Code and data accompanying the study "Bangime: Secret Language..."

Requirements

Workflow

Drawing the language map (requires basemap)

Coverage statistics

Check overlap with other concept lists

Cognate detection and heatmaps (requires matplotlib)

Barcharts

About

Resources

Stars

Watchers

Forks

Releases 7

Packages 0

Contributors 6

Languages

Packages