Using local rules for disambiguation of homographs in Hungarian corpora

By November 17, 2016,
Page 239-248
Author Judith Pais, Júlia Pajzs
Title Using local rules for disambiguation of homographs in Hungarian corpora
Abstract The historical corpus of Hungarian contains about 20 million running words at the moment. To be able to retrieve the occurrences of the lexemes, a morphological analyser programme was developed which is able to segment the running words and identifies the lexeme and the suffixes. Over 30% of the running words can have more then one correct analysis. Therefore we are aiming to develop methods for automatic disambiguation of the analysed text. This paper desrcibes an attempt for disambiguation by using local rules.
Session PART 2 - Computational Lexicology and Lexicography
Keywords corpus analysis, disambiguation, lexeme retrieval
BibTex
@InProceedings{ELX98_1-029,
author = {Judith Pais, Júlia Pajzs},
title = {Using local rules for disambiguation of homographs in Hungarian corpora},
pages = {239-248},
booktitle = {Proceedings of the 8th EURALEX International Congress},
year = {1998},
month = {aug},
date = {4-8},
address = {Liège, Belgium},
editor = {Thierry Fontenelle, Philippe Hiligsmann, Archibald Michiels, André Moulin, Siegfried Theissen},
publisher = {Euralex},
isbn = {2-87233-091-7},
}
Download