Reference Corpus of Middle High German (1050–1350)

Accessing the corpus

Version 2.0

The Reference Corpus of Middle High German can be accessed via the corpus query tool ANNIS. The changes in version 2.0 are detailed in the NEWS file.

We also provide the complete corpus in multiple formats: TEI-compatible XML, Tabular JSON, which contains all of the available annotations, GraphML for use with a local ANNIS 4 instance, and CorA-XML, as before. These can be downloaded from our Zenodo repository via the links provided here:

The detailed text overview contains links to PDF versions of the texts’s transcriptions (without annotations). These can also be downloaded here as a single archive, with a total uncompressed size of around 77 MB.

Citation

If you use this corpus in your work, please cite it as follows:

Roussel, Adam; Klein, Thomas; Dipper, Stefanie; Wegera, Klaus-Peter; Wich-Reif, Claudia (2024). Referenzkorpus Mittelhochdeutsch (1050–1350), Version 2.0, https://www.linguistics.ruhr-uni-bochum.de/rem/. ISLRN 937-948-254-174-0.

Version 1.0

The previous version of the Reference Corpus of Middle High German can be accessed here:

Alternatively, we provide compressed archives with the individual texts in CorA-XML file format; when extracted, the total size of the files is around 1.1 GB.

Citation

If you use this corpus in your work, please cite it as follows:

Klein, Thomas; Wegera, Klaus-Peter; Dipper, Stefanie; Wich-Reif, Claudia (2016). Referenzkorpus Mittelhochdeutsch (1050–1350), Version 1.0, https://www.linguistics.ruhr-uni-bochum.de/rem/. ISLRN 332-536-136-099-5.

License

The Reference Corpus of Middle High German is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Creative Commons License