Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora

Kristina Spranger

Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora

By admynNovember 17, 2016Euralex 2004, Publications

Page	171-175
Author	Kristina Spranger
Title	Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora
Abstract	In this paper, we describe a subcategorization acquisition system for German. The envisaged machine-readable lexicon is useful for both NLP tools and lexicographers. The system focuses on subcategorization extraction without being limited to this task. It also provides distributional information, selectionaI preferences and hints for the detection of idioms and of support-verb-constractions and other collocations. Moreover, each lexical entry is presented together with its usage contexts provided in the form of corpus examples and each subcategorization frame is presented together with its relative frequency. Thus, much additional data are given to support the lexicographer in his selection task. Furthermore, we do not only extract pairs of valency carrier and valency filler(s), but we are able to extract an almost arbitrary number of different lexicographically relevant parameters: we provide the lexicographer (and NLP tools) with quite detailed information concerning the extracted structures, such as, for example, the determiner used in the noun phrase of a verb+object collocation(definite/indefinite/possessive/null).
Session	Computational Lexicography and Lexicology
Keywords
BibTex	@InProceedings{ELX04-018, author = {Kristina Spranger}, title = {Beyond subcategorization acquisition - Multi-parameter extraction from German Text Corpora }, pages = {171-175}, booktitle = {Proceedings of the 11th EURALEX International Congress}, year = {2004}, month = {july}, date = {6-10}, address = {Lorient, France}, editor = {Geoffrey Williams and Sandra Vessier}, publisher = {Université de Bretagne-Sud, Faculté des lettres et des sciences humaines}, isbn = {29-52245-70-3}, }
Download

Beyond subcategorization acquisition – Multi-parameter extraction from German Text Corpora

Contact data

EURALEX address

EURALEX is supported by

Quick message