Abstract |
This paper presents a simple method for drafting bilingual dictionary content using existing lexical and NLP resources for Basque. The method consists of five steps, three belonging to a semi-automatic drafting, and another two to semi-automatic and manual post-editing: (1), the building of a corpus-based frequency lemma list; (2) the drafting of syntactical entities belonging to a lemma-sign; (3) the drafting of word senses belonging to syntactical entities; (4) a semi-automatic detection of gaps regarding syntactical entities, and (5) manual detection of word sense gaps. The described method relies on the exploitation of existing resources for Basque, and the multilingual cross-references present in WordNet. The application of the described method follows two goals: (1) drafting of a series of bilingual dictionaries with Basque, and (2) contribution to the updating and enrichment of two Basque NLP resources used for the drafting, EDBL and EusWN. |
BibTex |
@InProceedings{ELX2016-099,
author={David Lindemann, Iñaki San Vicente},
title={Bilingual Dictionary Drafting: Connecting Basque Word Senses to Multilingual Equivalents},
pages={898-905},
booktitle={Proceedings of the 17th EURALEX International Congress},
year={2016},
month={sep},
date={6-10},
address={Tbilisi, Georgia},
editor={Tinatin Margalitadze, George Meladze},
publisher={Ivane Javakhishvili Tbilisi University Press},
isbn={978-9941-13-542-2},
} |