Skip to content

Commit

Permalink
Add files via upload
Browse files Browse the repository at this point in the history
Added 1 PORTULAN corpus
  • Loading branch information
jakoble authored Nov 7, 2024
1 parent 8b90d3a commit 10ba608
Showing 1 changed file with 17 additions and 0 deletions.
17 changes: 17 additions & 0 deletions corpora/reference-corpora/cintil-corpus-internacional.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,17 @@
{
"Name": "CINTIL-Corpus Internacional do Português",
"URL": "https://hdl.handle.net/21.11129/0000-000B-D33B-5",
"Family": "Manually annotated corpora",
"Description": "This is a linguistically annotated corpus of both written and spoken Portuguese, whose annotations were manually verified.\nThe written texts consists of fictional, newspaper, and technical discourse (689,124 tokens) while the spoken texts correspond to both informal and formal speech (502,622 tokens).\nThe corpus is available from PORTULAN.",
"Language": ["por"],
"Licence": "ELRA END USER",
"Size": ["1 million tokens"],
"Annotation": ["tokenised", "PoS-tagged", "lemmatised"],
"Infrastructure": "CLARIN",
"Group": ["PoS MSD tagging"],
"Access": {
"Concordancer": "http://cintil.ul.pt/",
"Download": "https://hdl.handle.net/21.11129/0000-000B-D33B-5"
},
"Publication": ""
}

0 comments on commit 10ba608

Please sign in to comment.