Building a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks

Page 351-361
Author William Martinez, Sílvia Barbosa
Title Building a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks
Abstract This paper focuses on the elaboration of a dictionary of terms in the Portuguese language which describe the wine-tasting experience. We present a corpus-based analysis aimed at designing an electronic dictionary: on the basis of a compilation of approximately 21,000 wine descriptions downloaded from a dozen Portuguese websites, we estimated both by frequency analysis and lexicographical study which terms were recurrent, relevant and representative of the “hard to put into words” occupation that is oenology. From the results thus obtained, a list was made of words that describe the sensory analysis in its three main aspects: visual, olfactive and gustatory. An exhaustive co-occurrence analysis then identified those terms which contribute most to structuring the text by way of their tendency to attract other words against statistical odds. When displayed in a co-occurrence network, these anchors emerge from the mesh as the foundational lexicon for wine tasting, and can be evaluated as prime candidates for a distributional thesaurus.
Session TERMINOLOGY, TERMINOGRAPHY AND SPECIALISED LEXICOGRAPHY
Keywords collocations, co-occurrences, word network, corpus linguistics, oenology, terminology
BibTex
@InProceedings{ELX2018-029,
author={William Martinez, Sílvia Barbosa},
title={Building a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks},
pages={351-361},
booktitle={Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts},
year={2018},
month={jul},
date={17-21},
address={Ljubljana, Slovenia},
editor={Jaka Čibej, Vojko Gorjanc, Iztok Kosem, Simon Krek},
publisher={Ljubljana University Press, Faculty of Arts},
isbn={978-961-06-0097-8}, }
Download