Morphosyntactic structure of terms in Basque for automatic terminology extraction

By November 17, 2016,
Page 373-381
Author Ruben Urizar, N. Ezeiza, I. Alegria
Title Morphosyntactic structure of terms in Basque for automatic terminology extraction
Abstract This paper describes the morphosyntactic patterns of technical terms in Basque and presents an architecture for a term-extracting tool. As Basque is a highly inflected agglutinative language, part-of-speech information is not enough to define term patterns. The use of morphological and syntactic information is essential to reduce considerably the number of structures. For example, a noun, an adverbial, a postpositive adjectival, the nominal form of a verb and even a determiner in the genitive case may work as a prepositive adjective; however, they all share the same syntactic function. Therefore, for the termextracting tool to perform properly, the texts must be morphosyntactically analysed and disambiguated. Then a shallow syntactic parser will identify the previously described patterns.
Session PART 8 - Extraction of terminologically relevant multiword expressions
Keywords
BibTex
@InProceedings{ELX00-044,
author = {Ruben Urizar, N. Ezeiza, I. Alegria},
title = {Morphosyntactic structure of terms in Basque for automatic terminology extraction},
pages = {373-381},
booktitle = {Proceedings of the 9th EURALEX International Congress},
year = {2000},
month = {aug},
date = {8-12},
address = {Stuttgart, Germany},
editor = {Ulrich Heid, Stefan Evert, Egbert Lehmann, Christian Rohrer},
publisher = {Institut für Maschinelle Sprachverarbeitung},
isbn = {3-00-006574-1},
}
Download