Skip to content

Commit

Permalink
recreated dataset
Browse files Browse the repository at this point in the history
  • Loading branch information
xrotwang committed Jun 13, 2024
1 parent 6e61514 commit 48419fb
Show file tree
Hide file tree
Showing 60 changed files with 15,522 additions and 6,324 deletions.
2 changes: 1 addition & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -100,4 +100,4 @@ ENV/
# mypy
.mypy_cache/
.idea/

raw/atlas/japan
4 changes: 3 additions & 1 deletion NOTES.md
Original file line number Diff line number Diff line change
Expand Up @@ -87,4 +87,6 @@ Pama-Nyungan|104
[Speaker area shapes](https://github.com/cldf/cldf/tree/master/components/languages#speaker-area) are
provided as GeoJSON features, thus are available programmatically, e.g. using `pycldf`. But the GeoJSON
files for [language](cldf/languages.geojson)- and [family](cldf/families.geojson)-level areas can also
be inspected using GIS tools such as https://geojson.io
be inspected using GIS tools such as https://geojson.io

![](etc/erd.svg)
3 changes: 3 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -106,6 +106,9 @@ provided as GeoJSON features, thus are available programmatically, e.g. using `p
files for [language](cldf/languages.geojson)- and [family](cldf/families.geojson)-level areas can also
be inspected using GIS tools such as https://geojson.io

![](etc/erd.svg)


## CLDF Datasets

The following CLDF datasets are available in [cldf](cldf):
Expand Down
3 changes: 2 additions & 1 deletion RELEASING.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
# Releasing

```shell
cldfbench download cldfbench_languageatlasofthepacificarea.py
cldfbench makecldf cldfbench_languageatlasofthepacificarea.py --glottolog-version v5.0
```

Expand All @@ -17,7 +18,7 @@ cldfbench cldfreadme cldfbench_languageatlasofthepacificarea.py
```

```shell
cldferd --format compact.svg cldf > erd.svg
cldferd --format compact.svg cldf > etc/erd.svg
```

```shell
Expand Down
13 changes: 10 additions & 3 deletions cldf/Generic-metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -15,7 +15,7 @@
{
"rdf:about": "https://github.com/cldf-datasets/languageatlasofthepacificarea",
"rdf:type": "prov:Entity",
"dc:created": "v1.0-57-g1a5e2fb",
"dc:created": "v1.0-58-g6e61514",
"dc:title": "Repository"
},
{
Expand Down Expand Up @@ -45,7 +45,7 @@
"tableSchema": {
"columns": [
{
"dc:description": "We use the 1-based index of the first shape with corresponding LANGUAGE property in the original shapefile as identifier.",
"dc:description": "We use the 1-based index of the first shape with matching metadata in the original shapefile as identifier.",
"datatype": {
"base": "string",
"format": "[a-zA-Z0-9_\\-]+"
Expand Down Expand Up @@ -88,19 +88,21 @@
"name": "Rights"
},
{
"dc:description": "Contributions can be related to various kinds of media. ECAI shape contributions are linked to GeoJSON files that store the geo data; Atlas leaf contributions are linked to the corresponding scans and geo-data derived from these.",
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#mediaReference",
"separator": " ",
"name": "Media_IDs"
},
{
"dc:description": "There are two types of contributions: Individual shapes from ECAI's geo-registered dataset and individual leaves of the Atlas.",
"datatype": {
"base": "string",
"format": "leaf|shape"
},
"name": "Type"
},
{
"dc:description": "Shape metadata from ECAI's GIS dataset.",
"dc:description": "Shape metadata from ECAI's GIS dataset and Glottocodes of the Glottolog languoids to which the shape was matched.",
"datatype": "json",
"name": "Properties"
}
Expand Down Expand Up @@ -248,6 +250,11 @@
"name": "Speaker_Area"
},
{
"dc:description": "",
"datatype": {
"base": "string",
"format": "language|family"
},
"name": "Glottolog_Languoid_Level"
},
{
Expand Down
18 changes: 10 additions & 8 deletions cldf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,7 @@ property | value
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF Generic](http://cldf.clld.org/v1.0/terms.rdf#Generic)
[dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by-nc/4.0/
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/cldf-datasets/languageatlasofthepacificarea
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/cldf-datasets/languageatlasofthepacificarea/tree/0e776e3">cldf-datasets/languageatlasofthepacificarea 0e776e3</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v5.0">Glottolog v5.0</a></li></ol>
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/cldf-datasets/languageatlasofthepacificarea/tree/6e61514">cldf-datasets/languageatlasofthepacificarea v1.0-58-g6e61514</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v5.0">Glottolog v5.0</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>python</strong>: 3.10.12</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | languageatlasofthepacificarea
[rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution
Expand All @@ -25,28 +25,30 @@ We list the individual shapes from the source dataset as contributions in order
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF ContributionTable](http://cldf.clld.org/v1.0/terms.rdf#ContributionTable)
[dc:extent](http://purl.org/dc/terms/extent) | 3125
[dc:extent](http://purl.org/dc/terms/extent) | 3128


### Columns

Name/Property | Datatype | Description
--- | --- | ---
[ID](http://cldf.clld.org/v1.0/terms.rdf#id) | `string`<br>Regex: `[a-zA-Z0-9_\-]+` | We use the 1-based index of the first shape with corresponding LANGUAGE property in the original shapefile as identifier.<br>Primary key
[ID](http://cldf.clld.org/v1.0/terms.rdf#id) | `string`<br>Regex: `[a-zA-Z0-9_\-]+` | We use the 1-based index of the first shape with matching metadata in the original shapefile as identifier.<br>Primary key
[Name](http://cldf.clld.org/v1.0/terms.rdf#name) | `string` |
[Description](http://cldf.clld.org/v1.0/terms.rdf#description) | `string` |
[Contributor](http://cldf.clld.org/v1.0/terms.rdf#contributor) | `string` |
[Citation](http://cldf.clld.org/v1.0/terms.rdf#citation) | `string` |
[Source](http://cldf.clld.org/v1.0/terms.rdf#source) | list of `string` (separated by `;`) | References [sources.bib::BibTeX-key](./sources.bib)
[Media_IDs](http://cldf.clld.org/v1.0/terms.rdf#mediaReference) | list of `string` (separated by ` `) | References [media.csv::ID](#table-mediacsv)
`Type` | `string`<br>Valid choices:<br> `leaf` `shape` |
[Rights](http://purl.org/dc/terms/rights) | `string` |
[Media_IDs](http://cldf.clld.org/v1.0/terms.rdf#mediaReference) | list of `string` (separated by ` `) | Contributions can be related to various kinds of media. ECAI shape contributions are linked to GeoJSON files that store the geo data; Atlas leaf contributions are linked to the corresponding scans and geo-data derived from these.<br>References [media.csv::ID](#table-mediacsv)
`Type` | `string`<br>Valid choices:<br> `leaf` `shape` | There are two types of contributions: Individual shapes from ECAI's geo-registered dataset and individual leaves of the Atlas.
`Properties` | `json` | Shape metadata from ECAI's GIS dataset and Glottocodes of the Glottolog languoids to which the shape was matched.

## <a name="table-mediacsv"></a>Table [media.csv](./media.csv)

property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF MediaTable](http://cldf.clld.org/v1.0/terms.rdf#MediaTable)
[dc:extent](http://purl.org/dc/terms/extent) | 307
[dc:extent](http://purl.org/dc/terms/extent) | 357


### Columns
Expand All @@ -65,7 +67,7 @@ Name/Property | Datatype | Description
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF LanguageTable](http://cldf.clld.org/v1.0/terms.rdf#LanguageTable)
[dc:extent](http://purl.org/dc/terms/extent) | 1869
[dc:extent](http://purl.org/dc/terms/extent) | 1865


### Columns
Expand All @@ -80,7 +82,7 @@ Name/Property | Datatype | Description
[Glottocode](http://cldf.clld.org/v1.0/terms.rdf#glottocode) | `string`<br>Regex: `[a-z0-9]{4}[1-9][0-9]{3}` |
[ISO639P3code](http://cldf.clld.org/v1.0/terms.rdf#iso639P3code) | `string`<br>Regex: `[a-z]{3}` |
[Speaker_Area](http://cldf.clld.org/v1.0/terms.rdf#speakerArea) | `string` | References [media.csv::ID](#table-mediacsv)
`Glottolog_Languoid_Level` | `string` |
`Glottolog_Languoid_Level` | `string`<br>Valid choices:<br> `language` `family` |
`Family` | `string` |
[Contribution_IDs](http://cldf.clld.org/v1.0/terms.rdf#contributionReference) | list of `string` (separated by ` `) | List of identifiers of shapes in the original shapefile that were aggregated to create the shape referenced by Speaker_Area and of Atlas leaves mapping a georeferenced area intersecting with this languoid's area.<br>References [contributions.csv::ID](#table-contributionscsv)

3 changes: 1 addition & 2 deletions cldf/atlas/L001/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp57zql4ou/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L002/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpqwfdv3ym/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L003/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpz0e0htz7/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L004/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpznm1dukc/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L005/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpeu40deb2/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L006/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpddlwqt_w/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L007/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpj989z516/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L008/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp61lmvm1c/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L009/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp8ia6ebc3/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L010/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpblh41pbo/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L011/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpot3pt2u6/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L012/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmphfykj3o5/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L013/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp1in2yez9/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L014/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpbbes6ndm/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L015/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmplk8rinnk/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L016a/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpwaf3_avb/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L016b/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpzoceuvne/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L017/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpvsbm38g8/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L018/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp6criai69/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L019a/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpaodkdsx2/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L019b/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmp4ce48pjt/web.tif"
"id": "0"
},
"type": "Feature"
}
3 changes: 1 addition & 2 deletions cldf/atlas/L020/bounds.geojson
Original file line number Diff line number Diff line change
Expand Up @@ -34,8 +34,7 @@
},
"properties": {
"filename": "web.tif",
"id": "0",
"title": "/tmp/tmpl_8m7nfd/web.tif"
"id": "0"
},
"type": "Feature"
}
Loading

0 comments on commit 48419fb

Please sign in to comment.