Skip to content

Latest commit

 

History

History
21 lines (14 loc) · 828 Bytes

README.md

File metadata and controls

21 lines (14 loc) · 828 Bytes

Referenzkorpus Mittelhochdeutsch

Source and license

Main page of the project

License:

Das Referenzkorpus Mittelhochdeutsch ist lizenziert unter einer Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International Lizenz.

No change is made on the corpus. This code is intended to parse the corpus.

Corpus retrieval

  1. Go to https://www.linguistics.rub.de/rem/access/index.html.
  2. Click on "CORA-XML AKS .TAR.XZ" or "CORA-XML ALS .ZIP"
  3. Click on "Herunterladen".
  4. Uncompress the dowloaded file.
  5. You have a folder, named rem-corraled-20161222 (2019-09-18) with a list of XML files which are annotated texts.

Code

The available code will parse individual XML files.