Reuse of Lexicographic Data for a Multipurpose Pronunciation Database and Phonetic Transcription Generator for Regional Variants of Portuguese

Simone Ashby; José Pedro Ferreira

Reuse of Lexicographic Data for a Multipurpose Pronunciation Database and Phonetic Transcription Generator for Regional Variants of Portuguese

By admynNovember 17, 2016Euralex 2010, Publications

Page	241-244
Author	Simone Ashby, José Pedro Ferreira
Title	Reuse of Lexicographic Data for a Multipurpose Pronunciation Database and Phonetic Transcription Generator for Regional Variants of Portuguese
Abstract	Among the benefits of a flexible and modular lexical database are: the facility of building new modules from existing ones, the reuse of lexicographic data to both enhance the user experience and achieve NLP aims, the time saved in accomplishing these objectives, and the economy that comes from minimizing redundancy (van der Eijk, Bloksma, and van der Kraan 1992). LUPo, or the Portuguese Unisyn Lexicon, is one of the first speech-dedicated applications to take full advantage of a collection of lexical resources as the basis for a text-to-speech system. Consisting of a pronunciation lexicon and rule system for generating accent-specific phonetic transcriptions for Portuguese, LUPo circumvents the cost of producing high-quality phonetic transcriptions by hand, while attracting a wider pan Lusophone audience to the lexical database in which it resides, and providing the research community with a vast resource of Portuguese accent data for evaluating speech applications and testing theories.
Session	Computational Lexicography and Lexicology
Keywords
BibTex	@InProceedings{ELX10-007, author = {Simone Ashby, José Pedro Ferreira}, title = {Reuse of Lexicographic Data for a Multipurpose Pronunciation Database and Phonetic Transcription Generator for Regional Variants of Portuguese}, pages = {241-244}, booktitle = {Proceedings of the 14th EURALEX International Congress}, year = {2010}, month = {jul}, date = {6-10}, address = {Leeuwarden/Ljouwert, The Netherlands}, editor = {Anne Dykstra and Tanneke Schoonheim}, publisher = {Fryske Akademy}, isbn = {978-90-6273-850-3}, }
Download

Reuse of Lexicographic Data for a Multipurpose Pronunciation Database and Phonetic Transcription Generator for Regional Variants of Portuguese

Contact data

EURALEX address

EURALEX is supported by

Quick message