Abstract |
This paper describes the morphosyntactic patterns of technical terms in Basque and presents an architecture for a term-extracting tool. As Basque is a highly inflected agglutinative language, part-of-speech information is not enough to define term patterns. The use of morphological and syntactic information is essential to reduce considerably the number of structures. For example, a noun, an adverbial, a postpositive adjectival, the nominal form of a verb and even a determiner in the genitive case may work as a prepositive adjective; however, they all share the same syntactic function. Therefore, for the termextracting tool to perform properly, the texts must be morphosyntactically analysed and disambiguated. Then a shallow syntactic parser will identify the previously described patterns. |
BibTex |
@InProceedings{ELX00-044, author = {Ruben Urizar, N. Ezeiza, I. Alegria}, title = {Morphosyntactic structure of terms in Basque for automatic terminology extraction}, pages = {373-381}, booktitle = {Proceedings of the 9th EURALEX International Congress}, year = {2000}, month = {aug}, date = {8-12}, address = {Stuttgart, Germany}, editor = {Ulrich Heid, Stefan Evert, Egbert Lehmann, Christian Rohrer}, publisher = {Institut für Maschinelle Sprachverarbeitung}, isbn = {3-00-006574-1}, } |