Orthographic Variation in Lexical Databases

By November 17, 2016,
Page 167-172
Author Maarten Janssen
Title Orthographic Variation in Lexical Databases
Abstract Traditionally, orthographic variants have been modelled as different ways of spelling the same word - described at the level of the lexeme. But when inflection is taken into account, this runs into a problem: different citation forms have different inflectional paradigm - and orthographic variation does not merely affect the citation form, but the entire paradigm. The MorDebe database therefore models orthographic variation as a relation between distinct, yet still token-identical lexemes. This paper discusses the advantage of that approach, and the full set of practical problems that arose during the structural treatment of orthographic variation in the MorDebe database.
Session 3. COMPUTATIONAL LEXICOGRAPHY AND LEXICOLOGY
Keywords
BibTex
@InProceedings{ELX06-022,
author = {Maarten Janssen},
title = {Orthographic Variation in Lexical Databases },
pages = {167-172},
booktitle = {Proceedings of the 12th EURALEX International Congress},
year = {2006},
month = {sep},
date = {6-9},
address = {Torino, Italy},
editor = {Elisa Corino, Carla Marello, Cristina Onesti},
publisher = {Edizioni dell'Orso},
isbn = {88-7694-918-6},
}
Download