Skip to content

Commit

Permalink
proper country table and more semantic source IDs
Browse files Browse the repository at this point in the history
  • Loading branch information
xrotwang committed Jul 3, 2023
1 parent 93ddc32 commit 3cc417f
Show file tree
Hide file tree
Showing 10 changed files with 73,627 additions and 73,343 deletions.
24 changes: 20 additions & 4 deletions cldf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ property | value
[dc:identifier](http://purl.org/dc/terms/identifier) | https://endangeredlanguages.com
[dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by/4.0/
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/cldf-datasets/elcat
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/cldf-datasets/elcat/tree/fceed2a">cldf-datasets/elcat v2023-1-gfceed2a</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v4.7">Glottolog v4.7</a></li></ol>
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/cldf-datasets/elcat/tree/v2023.2">cldf-datasets/elcat v2023.2</a></li><li><a href="https://github.com/glottolog/glottolog/tree/05491b967d">Glottolog v4.7-39-g05491b967d</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>python</strong>: 3.10.6</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | elcat
[rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution
Expand All @@ -24,7 +24,7 @@ property | value
property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF ValueTable](http://cldf.clld.org/v1.0/terms.rdf#ValueTable)
[dc:extent](http://purl.org/dc/terms/extent) | 64826
[dc:extent](http://purl.org/dc/terms/extent) | 64833


### Columns
Expand Down Expand Up @@ -60,10 +60,10 @@ Name/Property | Datatype | Description
[Glottocode](http://cldf.clld.org/v1.0/terms.rdf#glottocode) | `string` |
[ISO639P3code](http://cldf.clld.org/v1.0/terms.rdf#iso639P3code) | `string` |
[Comment](http://cldf.clld.org/v1.0/terms.rdf#comment) | `string` |
`Countries` | list of `string` (separated by ` `) | Countries a language is spoken in given by ISO 3166-1 alpha-2 code
`Countries` | list of `string` (separated by ` `) | Countries a language is spoken in given by ISO 3166-1 alpha-2 code<br>References [countries.csv::alpha_2](#table-countriescsv)
`ELCatMacroareas` | list of `string` (separated by `; `) |
`classification` | `string` | Top-level genealogical unit the language belongs to.
`endangerment` | `string` | ElCat's aggregated endangerment assessment.
`endangerment` | `string` | ElCat's aggregated endangerment assessment. Note that in a few cases this endangerment assessment does **not** match the assessment in the preferred source as given for parameter LEI in the ValueTable.
`code_authorities` | list of `string` (separated by `; `) | Other language catalogs which have assigned codes to the language.
`codes` | list of `string` (separated by `; `) | Codes assigned to the language by other language catalogs.
`alt_names` | list of `string` (separated by `; `) | Alternative names used for the language.
Expand Down Expand Up @@ -102,3 +102,19 @@ Name/Property | Datatype | Description
[Name](http://cldf.clld.org/v1.0/terms.rdf#name) | `string` |
[Description](http://cldf.clld.org/v1.0/terms.rdf#description) | `string` |

## <a name="table-countriescsv"></a>Table [countries.csv](./countries.csv)

property | value
--- | ---
[dc:extent](http://purl.org/dc/terms/extent) | 172


### Columns

Name/Property | Datatype | Description
--- | --- | ---
[alpha_2](http://cldf.clld.org/v1.0/terms.rdf#id) | `string` | Primary key
`alpha_3` | `string` |
[name](http://cldf.clld.org/v1.0/terms.rdf#name) | `string` |
`official_name` | `string` |

50 changes: 45 additions & 5 deletions cldf/StructureDataset-metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -16,13 +16,13 @@
{
"rdf:about": "https://github.com/cldf-datasets/elcat",
"rdf:type": "prov:Entity",
"dc:created": "v2023-1-gfceed2a",
"dc:created": "v2023.2",
"dc:title": "Repository"
},
{
"rdf:about": "https://github.com/glottolog/glottolog",
"rdf:type": "prov:Entity",
"dc:created": "v4.7",
"dc:created": "v4.7-39-g05491b967d",
"dc:title": "Glottolog"
}
],
Expand All @@ -41,7 +41,7 @@
"tables": [
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#ValueTable",
"dc:extent": 64826,
"dc:extent": 64833,
"tableSchema": {
"columns": [
{
Expand Down Expand Up @@ -160,7 +160,7 @@
},
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#id",
"required": true,
"valueUrl": "https://www.endangeredlanguages.com/lang/{ID}",
"valueUrl": "https://endangeredlanguages.com/lang/{ID}",
"name": "ID"
},
{
Expand Down Expand Up @@ -232,7 +232,7 @@
"name": "classification"
},
{
"dc:description": "ElCat's aggregated endangerment assessment.",
"dc:description": "ElCat's aggregated endangerment assessment. Note that in a few cases this endangerment assessment does **not** match the assessment in the preferred source as given for parameter LEI in the ValueTable.",
"datatype": {
"base": "string",
"format": "at risk|awakening|critically endangered|dormant|endangered|endangerment|severely endangered|threatened|vulnerable"
Expand All @@ -259,6 +259,19 @@
"name": "alt_names"
}
],
"foreignKeys": [
{
"columnReference": [
"Countries"
],
"reference": {
"resource": "countries.csv",
"columnReference": [
"alpha_2"
]
}
}
],
"primaryKey": [
"ID"
]
Expand Down Expand Up @@ -357,6 +370,33 @@
]
},
"url": "codes.csv"
},
{
"dc:extent": 172,
"tableSchema": {
"columns": [
{
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#id",
"name": "alpha_2"
},
{
"datatype": "string",
"name": "alpha_3"
},
{
"propertyUrl": "http://cldf.clld.org/v1.0/terms.rdf#name",
"name": "name"
},
{
"datatype": "string",
"name": "official_name"
}
],
"primaryKey": [
"alpha_2"
]
},
"url": "countries.csv"
}
]
}
173 changes: 173 additions & 0 deletions cldf/countries.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,173 @@
alpha_2,alpha_3,name,official_name
AF,AFG,Afghanistan,Islamic Republic of Afghanistan
AL,ALB,Albania,Republic of Albania
DZ,DZA,Algeria,People's Democratic Republic of Algeria
AO,AGO,Angola,Republic of Angola
AR,ARG,Argentina,Argentine Republic
AU,AUS,Australia,
AT,AUT,Austria,Republic of Austria
AZ,AZE,Azerbaijan,Republic of Azerbaijan
BD,BGD,Bangladesh,People's Republic of Bangladesh
BY,BLR,Belarus,Republic of Belarus
BE,BEL,Belgium,Kingdom of Belgium
BZ,BLZ,Belize,
BJ,BEN,Benin,Republic of Benin
BT,BTN,Bhutan,Kingdom of Bhutan
BO,BOL,"Bolivia, Plurinational State of",Plurinational State of Bolivia
BA,BIH,Bosnia and Herzegovina,Republic of Bosnia and Herzegovina
BW,BWA,Botswana,Republic of Botswana
BR,BRA,Brazil,Federative Republic of Brazil
BN,BRN,Brunei Darussalam,
BG,BGR,Bulgaria,Republic of Bulgaria
BF,BFA,Burkina Faso,
KH,KHM,Cambodia,Kingdom of Cambodia
CM,CMR,Cameroon,Republic of Cameroon
CA,CAN,Canada,
CF,CAF,Central African Republic,
TD,TCD,Chad,Republic of Chad
CL,CHL,Chile,Republic of Chile
CN,CHN,China,People's Republic of China
CO,COL,Colombia,Republic of Colombia
KM,COM,Comoros,Union of the Comoros
CG,COG,Congo,Republic of the Congo
CD,COD,"Congo, The Democratic Republic of the",
CK,COK,Cook Islands,
CR,CRI,Costa Rica,Republic of Costa Rica
HR,HRV,Croatia,Republic of Croatia
CY,CYP,Cyprus,Republic of Cyprus
CZ,CZE,Czechia,Czech Republic
CI,CIV,Côte d'Ivoire,Republic of Côte d'Ivoire
DK,DNK,Denmark,Kingdom of Denmark
EC,ECU,Ecuador,Republic of Ecuador
EG,EGY,Egypt,Arab Republic of Egypt
SV,SLV,El Salvador,Republic of El Salvador
GQ,GNQ,Equatorial Guinea,Republic of Equatorial Guinea
ER,ERI,Eritrea,the State of Eritrea
EE,EST,Estonia,Republic of Estonia
ET,ETH,Ethiopia,Federal Democratic Republic of Ethiopia
FJ,FJI,Fiji,Republic of Fiji
FI,FIN,Finland,Republic of Finland
FR,FRA,France,French Republic
GF,GUF,French Guiana,
PF,PYF,French Polynesia,
GA,GAB,Gabon,Gabonese Republic
GM,GMB,Gambia,Republic of the Gambia
GE,GEO,Georgia,
DE,DEU,Germany,Federal Republic of Germany
GH,GHA,Ghana,Republic of Ghana
GR,GRC,Greece,Hellenic Republic
GU,GUM,Guam,
GT,GTM,Guatemala,Republic of Guatemala
GN,GIN,Guinea,Republic of Guinea
GW,GNB,Guinea-Bissau,Republic of Guinea-Bissau
GY,GUY,Guyana,Republic of Guyana
HN,HND,Honduras,Republic of Honduras
HU,HUN,Hungary,Hungary
IN,IND,India,Republic of India
ID,IDN,Indonesia,Republic of Indonesia
IR,IRN,"Iran, Islamic Republic of",Islamic Republic of Iran
IQ,IRQ,Iraq,Republic of Iraq
IE,IRL,Ireland,
IL,ISR,Israel,State of Israel
IT,ITA,Italy,Italian Republic
JM,JAM,Jamaica,
JP,JPN,Japan,
KZ,KAZ,Kazakhstan,Republic of Kazakhstan
KE,KEN,Kenya,Republic of Kenya
KI,KIR,Kiribati,Republic of Kiribati
KR,KOR,"Korea, Republic of",
KW,KWT,Kuwait,State of Kuwait
KG,KGZ,Kyrgyzstan,Kyrgyz Republic
LA,LAO,Lao People's Democratic Republic,
LV,LVA,Latvia,Republic of Latvia
LB,LBN,Lebanon,Lebanese Republic
LS,LSO,Lesotho,Kingdom of Lesotho
LR,LBR,Liberia,Republic of Liberia
LY,LBY,Libya,Libya
LI,LIE,Liechtenstein,Principality of Liechtenstein
LT,LTU,Lithuania,Republic of Lithuania
LU,LUX,Luxembourg,Grand Duchy of Luxembourg
MG,MDG,Madagascar,Republic of Madagascar
MY,MYS,Malaysia,
ML,MLI,Mali,Republic of Mali
MR,MRT,Mauritania,Islamic Republic of Mauritania
MX,MEX,Mexico,United Mexican States
FM,FSM,"Micronesia, Federated States of",Federated States of Micronesia
MD,MDA,"Moldova, Republic of",Republic of Moldova
MN,MNG,Mongolia,
ME,MNE,Montenegro,Montenegro
MA,MAR,Morocco,Kingdom of Morocco
MZ,MOZ,Mozambique,Republic of Mozambique
MM,MMR,Myanmar,Republic of Myanmar
NA,NAM,Namibia,Republic of Namibia
NR,NRU,Nauru,Republic of Nauru
NP,NPL,Nepal,Federal Democratic Republic of Nepal
NL,NLD,Netherlands,Kingdom of the Netherlands
NC,NCL,New Caledonia,
NZ,NZL,New Zealand,
NI,NIC,Nicaragua,Republic of Nicaragua
NG,NGA,Nigeria,Federal Republic of Nigeria
NU,NIU,Niue,Niue
NF,NFK,Norfolk Island,
MK,MKD,North Macedonia,Republic of North Macedonia
MP,MNP,Northern Mariana Islands,Commonwealth of the Northern Mariana Islands
NO,NOR,Norway,Kingdom of Norway
OM,OMN,Oman,Sultanate of Oman
PK,PAK,Pakistan,Islamic Republic of Pakistan
PW,PLW,Palau,Republic of Palau
PA,PAN,Panama,Republic of Panama
PG,PNG,Papua New Guinea,Independent State of Papua New Guinea
PY,PRY,Paraguay,Republic of Paraguay
PE,PER,Peru,Republic of Peru
PH,PHL,Philippines,Republic of the Philippines
PN,PCN,Pitcairn,
PL,POL,Poland,Republic of Poland
PT,PRT,Portugal,Portuguese Republic
PR,PRI,Puerto Rico,
RO,ROU,Romania,
RU,RUS,Russian Federation,
RE,REU,Réunion,
SN,SEN,Senegal,Republic of Senegal
RS,SRB,Serbia,Republic of Serbia
SL,SLE,Sierra Leone,Republic of Sierra Leone
SG,SGP,Singapore,Republic of Singapore
SK,SVK,Slovakia,Slovak Republic
SI,SVN,Slovenia,Republic of Slovenia
SB,SLB,Solomon Islands,
SO,SOM,Somalia,Federal Republic of Somalia
ZA,ZAF,South Africa,Republic of South Africa
SS,SSD,South Sudan,Republic of South Sudan
ES,ESP,Spain,Kingdom of Spain
LK,LKA,Sri Lanka,Democratic Socialist Republic of Sri Lanka
SD,SDN,Sudan,Republic of the Sudan
SR,SUR,Suriname,Republic of Suriname
SE,SWE,Sweden,Kingdom of Sweden
CH,CHE,Switzerland,Swiss Confederation
SY,SYR,Syrian Arab Republic,
TW,TWN,"Taiwan, Province of China","Taiwan, Province of China"
TJ,TJK,Tajikistan,Republic of Tajikistan
TZ,TZA,"Tanzania, United Republic of",United Republic of Tanzania
TH,THA,Thailand,Kingdom of Thailand
TL,TLS,Timor-Leste,Democratic Republic of Timor-Leste
TG,TGO,Togo,Togolese Republic
TK,TKL,Tokelau,
TO,TON,Tonga,Kingdom of Tonga
TT,TTO,Trinidad and Tobago,Republic of Trinidad and Tobago
TN,TUN,Tunisia,Republic of Tunisia
TR,TUR,Turkey,Republic of Turkey
TM,TKM,Turkmenistan,
TV,TUV,Tuvalu,
UG,UGA,Uganda,Republic of Uganda
UA,UKR,Ukraine,
AE,ARE,United Arab Emirates,
GB,GBR,United Kingdom,United Kingdom of Great Britain and Northern Ireland
US,USA,United States,United States of America
UY,URY,Uruguay,Eastern Republic of Uruguay
UZ,UZB,Uzbekistan,Republic of Uzbekistan
VU,VUT,Vanuatu,Republic of Vanuatu
VE,VEN,"Venezuela, Bolivarian Republic of",Bolivarian Republic of Venezuela
VN,VNM,Viet Nam,Socialist Republic of Viet Nam
EH,ESH,Western Sahara,
YE,YEM,Yemen,Republic of Yemen
ZM,ZMB,Zambia,Republic of Zambia
ZW,ZWE,Zimbabwe,Republic of Zimbabwe
Loading

0 comments on commit 3cc417f

Please sign in to comment.