From Diachronic Treebank to Dictionary Resource: the Varangian Rus Project

By November 23, 2016,
Page 335-339
Author Hanne Martine Eckhoff, Aleksandrs Berdičevskis
Title From Diachronic Treebank to Dictionary Resource: the Varangian Rus Project
Abstract In this paper we present the Varangian Rus’ dictionary resource, which is based on the Tromsø Old Russian and OCS Treebank (TOROT), a diachronic treebank of Russian containing a balanced selection of 11th–17th century Old East Slavic and Middle Russian texts. The treebank is lemmatised and has rich morphological and syntactic annotation. With simple glossing of the word meanings found in the treebank, we are able to generate a dictionary resource with rich grammatical information. The dictionary work also enables us to improve the lemmatisation of the treebank considerably, as well as annotation at other levels. The dictionary resource will be published as a searchable online tool towards the end of 2016 and is intended for students and scholars alike.
Session Lexicography and Corpus Linguistics
Keywords treebank; historical dictionary; Russian
BibTex
@InProceedings{ELX2016-035,
author={Hanne Martine Eckhoff, Aleksandrs Berdičevskis},
title={From Diachronic Treebank to Dictionary Resource: the Varangian Rus Project},
pages={335-339},
booktitle={Proceedings of the 17th EURALEX International Congress},
year={2016},
month={sep},
date={6-10},
address={Tbilisi, Georgia},
editor={Tinatin Margalitadze, George Meladze},
publisher={Ivane Javakhishvili Tbilisi University Press},
isbn={978-9941-13-542-2},
}
Download