Building a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks

Page351-361
AuthorWilliam Martinez, Sílvia Barbosa
TitleBuilding a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks
AbstractThis paper focuses on the elaboration of a dictionary of terms in the Portuguese language which describe the wine-tasting experience. We present a corpus-based analysis aimed at designing an electronic dictionary: on the basis of a compilation of approximately 21,000 wine descriptions downloaded from a dozen Portuguese websites, we estimated both by frequency analysis and lexicographical study which terms were recurrent, relevant and representative of the “hard to put into words” occupation that is oenology. From the results thus obtained, a list was made of words that describe the sensory analysis in its three main aspects: visual, olfactive and gustatory. An exhaustive co-occurrence analysis then identified those terms which contribute most to structuring the text by way of their tendency to attract other words against statistical odds. When displayed in a co-occurrence network, these anchors emerge from the mesh as the foundational lexicon for wine tasting, and can be evaluated as prime candidates for a distributional thesaurus.
SessionTERMINOLOGY, TERMINOGRAPHY AND SPECIALISED LEXICOGRAPHY
Keywordscollocations, co-occurrences, word network, corpus linguistics, oenology, terminology
BibTex
@InProceedings{ELX2018-029,
author={William Martinez, Sílvia Barbosa},
title={Building a Portuguese Oenological Dictionary: from Corpus to Terminology via Co-occurrence Networks},
pages={351-361},
booktitle={Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts},
year={2018},
month={jul},
date={17-21},
address={Ljubljana, Slovenia},
editor={Jaka Čibej, Vojko Gorjanc, Iztok Kosem, Simon Krek},
publisher={Ljubljana University Press, Faculty of Arts},
isbn={978-961-06-0097-8}, }
Download