Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora

By November 17, 2016,
Page 171-175
Author Kristina Spranger
Title Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora
Abstract In this paper, we describe a subcategorization acquisition system for German. The envisaged machine-readable lexicon is useful for both NLP tools and lexicographers. The system focuses on subcategorization extraction without being limited to this task. It also provides distributional information, selectionaI preferences and hints for the detection of idioms and of support-verb-constractions and other collocations. Moreover, each lexical entry is presented together with its usage contexts provided in the form of corpus examples and each subcategorization frame is presented together with its relative frequency. Thus, much additional data are given to support the lexicographer in his selection task. Furthermore, we do not only extract pairs of valency carrier and valency filler(s), but we are able to extract an almost arbitrary number of different lexicographically relevant parameters: we provide the lexicographer (and NLP tools) with quite detailed information concerning the extracted structures, such as, for example, the determiner used in the noun phrase of a verb+object collocation(definite/indefinite/possessive/null).
Session Computational Lexicography and Lexicology
Keywords
BibTex
@InProceedings{ELX04-018,
author = {Kristina Spranger},
title = {Beyond subcategorization acquisition - Multi-parameter extraction from German Text Corpora },
pages = {171-175},
booktitle = {Proceedings of the 11th EURALEX International Congress},
year = {2004},
month = {july},
date = {6-10},
address = {Lorient, France},
editor = {Geoffrey Williams and Sandra Vessier},
publisher = {UniversiteĢ de Bretagne-Sud, FaculteĢ des lettres et des sciences humaines},
isbn = {29-52245-70-3},
}
Download