The CPLP Corpus: A Pluricentric Corpus for the Common Portuguese Spelling Dictionary (VOC)

Page835-840
AuthorMaarten Janssen, Tanara Zingano Kuhn, José Pedro Ferreira, Margarita Correia
TitleThe CPLP Corpus: A Pluricentric Corpus for the Common Portuguese Spelling Dictionary (VOC)
AbstractThe Pluricentric Corpus of the Portuguese Language (CPLP Corpus) aims to provide comparable corpora for the national varieties of the countries where Portuguese is an official language, making it possible to undertake corpus-based comparisons among the varieties of these countries. It is intended as a publicly available corpus for comparative linguistics and language resource development, but furthermore constitutes one of the pillars of the Vocabulário Ortográfico Comum da Língua Portuguesa (VOC), the official spelling dictionary for Portuguese. The headword list in VOC is partly derived from lexicographic tradition, which is to date based almost exclusively on the European and Brazilian varieties, and partly made up of words retrieved from the CPLP corpus, many of them included for the first time in official language resources for Portuguese. This double inclusion route aims at presenting an integral (i.e., non-contrastive) and increasingly balanced perspective on all the varieties. This paper describes the general design of the corpus, the challenges faced in its development, as well as the way it was used in the compilation of VOC.
SessionPoster Presentations
Keywordscorpus, pluricentric languages, Portuguese, spelling dictionaries
BibTex
@InProceedings{ELX2018-069,
author={Maarten Janssen, Tanara Zingano Kuhn, José Pedro Ferreira, Margarita Correia},
title={The CPLP Corpus: A Pluricentric Corpus for the Common Portuguese Spelling Dictionary (VOC)},
pages={835-840},
booktitle={Proceedings of the XVIII EURALEX International Congress: Lexicography in Global Contexts},
year={2018},
month={jul},
date={17-21},
address={Ljubljana, Slovenia},
editor={Jaka Čibej, Vojko Gorjanc, Iztok Kosem, Simon Krek},
publisher={Ljubljana University Press, Faculty of Arts},
isbn={978-961-06-0097-8}, }
Download