Using local rules for disambiguation of homographs in Hungarian corpora

Page239-248
AuthorJudith Pais, Júlia Pajzs
TitleUsing local rules for disambiguation of homographs in Hungarian corpora
AbstractThe historical corpus of Hungarian contains about 20 million running words at the moment. To be able to retrieve the occurrences of the lexemes, a morphological analyser programme was developed which is able to segment the running words and identifies the lexeme and the suffixes. Over 30% of the running words can have more then one correct analysis. Therefore we are aiming to develop methods for automatic disambiguation of the analysed text. This paper desrcibes an attempt for disambiguation by using local rules.
SessionPART 2 - Computational Lexicology and Lexicography
Keywordscorpus analysis, disambiguation, lexeme retrieval
BibTex
@InProceedings{ELX98_1-029,
author = {Judith Pais, Júlia Pajzs},
title = {Using local rules for disambiguation of homographs in Hungarian corpora},
pages = {239-248},
booktitle = {Proceedings of the 8th EURALEX International Congress},
year = {1998},
month = {aug},
date = {4-8},
address = {Liège, Belgium},
editor = {Thierry Fontenelle, Philippe Hiligsmann, Archibald Michiels, André Moulin, Siegfried Theissen},
publisher = {Euralex},
isbn = {2-87233-091-7},
}
Download