Accessing the corpus
Version 2.0
The Reference Corpus of Middle High German can be accessed via the corpus query tool ANNIS. The changes in version 2.0 are detailed in the NEWS file.
We also provide the complete corpus in multiple formats: TEI-compatible XML, Tabular JSON, which contains all of the available annotations, GraphML for use with a local ANNIS 4 instance, and CorA-XML, as before. These can be downloaded from our Zenodo repository via the links provided here:
The detailed text overview contains links to PDF versions of the texts’s transcriptions (without annotations). These can also be downloaded here as a single archive, with a total uncompressed size of around 77 MB.
Citation
If you use this corpus in your work, please cite it as follows:
Roussel, Adam; Klein, Thomas; Dipper, Stefanie; Wegera, Klaus-Peter; Wich-Reif, Claudia (2024). Referenzkorpus Mittelhochdeutsch (1050–1350), Version 2.0, https://www.linguistics.ruhr-uni-bochum.de/rem/. ISLRN 937-948-254-174-0.
Version 1.0
The previous version of the Reference Corpus of Middle High German can be accessed here:
Alternatively, we provide compressed archives with the individual texts in CorA-XML file format; when extracted, the total size of the files is around 1.1 GB.
Citation
If you use this corpus in your work, please cite it as follows:
Klein, Thomas; Wegera, Klaus-Peter; Dipper, Stefanie; Wich-Reif, Claudia (2016). Referenzkorpus Mittelhochdeutsch (1050–1350), Version 1.0, https://www.linguistics.ruhr-uni-bochum.de/rem/. ISLRN 332-536-136-099-5.
License
The Reference Corpus of Middle High German is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.