Abstract |
In this paper, we describe a subcategorization acquisition system for German. The envisaged machine-readable lexicon is useful for both NLP tools and lexicographers. The system focuses on subcategorization extraction without being limited to this task. It also provides distributional information, selectionaI preferences and hints for the detection of idioms and of support-verb-constractions and other collocations. Moreover, each lexical entry is presented together with its usage contexts provided in the form of corpus examples and each subcategorization frame is presented together with its relative frequency. Thus, much additional data are given to support the lexicographer in his selection task. Furthermore, we do not only extract pairs of valency carrier and valency filler(s), but we are able to extract an almost arbitrary number of different lexicographically relevant parameters: we provide the lexicographer (and NLP tools) with quite detailed information concerning the extracted structures, such as, for example, the determiner used in the noun phrase of a verb+object collocation(definite/indefinite/possessive/null). |
BibTex |
@InProceedings{ELX04-018, author = {Kristina Spranger}, title = {Beyond subcategorization acquisition - Multi-parameter extraction from German Text Corpora }, pages = {171-175}, booktitle = {Proceedings of the 11th EURALEX International Congress}, year = {2004}, month = {july}, date = {6-10}, address = {Lorient, France}, editor = {Geoffrey Williams and Sandra Vessier}, publisher = {UniversiteĢ de Bretagne-Sud, FaculteĢ des lettres et des sciences humaines}, isbn = {29-52245-70-3}, } |