Using local rules for disambiguation of homographs in Hungarian corpora

By admynNovember 17, 2016Euralex 1998 Part 1, Publications

Page	239-248
Author	Judith Pais, Júlia Pajzs
Title	Using local rules for disambiguation of homographs in Hungarian corpora
Abstract	The historical corpus of Hungarian contains about 20 million running words at the moment. To be able to retrieve the occurrences of the lexemes, a morphological analyser programme was developed which is able to segment the running words and identifies the lexeme and the suffixes. Over 30% of the running words can have more then one correct analysis. Therefore we are aiming to develop methods for automatic disambiguation of the analysed text. This paper desrcibes an attempt for disambiguation by using local rules.
Session	PART 2 - Computational Lexicology and Lexicography
Keywords	corpus analysis, disambiguation, lexeme retrieval
BibTex	@InProceedings{ELX98_1-029, author = {Judith Pais, Júlia Pajzs}, title = {Using local rules for disambiguation of homographs in Hungarian corpora}, pages = {239-248}, booktitle = {Proceedings of the 8th EURALEX International Congress}, year = {1998}, month = {aug}, date = {4-8}, address = {Liège, Belgium}, editor = {Thierry Fontenelle, Philippe Hiligsmann, Archibald Michiels, André Moulin, Siegfried Theissen}, publisher = {Euralex}, isbn = {2-87233-091-7}, }
Download