Releases: langdoc/four-battles-corpus
Releases · langdoc/four-battles-corpus
Parallel corpus for Erzya, Hill Mari, Permian Komi, Zyrian Komi and Udmurt
This versions contains all languages aligned this far, but they are not in an uniform structure, and all annotations added in different studies have not been merged. At the moment Markdown files contain tagging marked with bold and cursive in the markdown, and the CoNLL-U files contain additional syntactic tagging, especially for subject and object.
Parallel corpus for Komi-Zyrian, Komi-Permyak, Udmurt and Hill Mari
Data used in Jeremy Bradley's, Alexandra Kellner's and Niko Partanen's paper Variation in word order in Permic and Mari varieties: a corpus-based investigation at symposium "Language contacts of the nations of Volga-Ural region", Cheboksary, 21–24.5.2018.