Methods for quality assurance in semi-automatic lexicon acquisition from corpora

Page119-127
AuthorJudith Eckle-Kohler
TitleMethods for quality assurance in semi-automatic lexicon acquisition from corpora
AbstractThis paper presents linguistics-based methods and engineering methods for quality assurance in semi-automatic acquisition of broad coverage lexicons from corpora. Automated linguistic tests are used to acquire candidates for particular subcategorization frames automatically; the regular use of metrics in the acquisition process contributes to a controlled development of these tests. The proposed methods are illustrated by the acquisition of a particular class of verbs taking daß-clauses in German, showing how the precision of the automatically acquired data can be maximized with only a slight decrease in recall.
SessionPART 2 - Computational Lexicology and Lexicography
KeywordsQuality assurance in lexicon acquisition, semi-automatic, corpus-based lexicon acquisition
BibTex
@InProceedings{ELX98_1-017,
author = {Judith Eckle-Kohler},
title = {Methods for quality assurance in semi-automatic lexicon acquisition from corpora},
pages = {119-127},
booktitle = {Proceedings of the 8th EURALEX International Congress},
year = {1998},
month = {aug},
date = {4-8},
address = {Liège, Belgium},
editor = {Thierry Fontenelle, Philippe Hiligsmann, Archibald Michiels, André Moulin, Siegfried Theissen},
publisher = {Euralex},
isbn = {2-87233-091-7},
}
Download