Elexbi, a Basic Tool for Bilingual Term Extraction from Spanish-Basque Parallel Corpora

By November 17, 2016,
Page 159-165
Author A. Gurrutxaga, X. Saralegi, S. Ugartetxea, Iñaki Alegria
Title Elexbi, a Basic Tool for Bilingual Term Extraction from Spanish-Basque Parallel Corpora
Abstract We present the work done by Elhuyar Foundation in the field of bilingual terminology extraction. The aim of this work is to develop some techniques for the automatic extraction of pairs of equivalent terms from Spanish-Basque translation memories, and to implement those techniques in a prototype. Our approach is based on a monolingual extraction of term candidates in each language, then the creation of candidate bigrams from both segments of the same translation unit, and, finally, the selection of the most likely pair of candidates, mainly by the use of statistical information (association measures) and cognates. In the first step, we use linguistic techniques for the extraction of term candidates. The result of our work is ELexBI, a prototype tool that can extract equivalent terms from Spanish-Basque translation memories. This work wants to be a contribution to corpus-based bilingual lexicography and terminology in Basque.
Session 3. COMPUTATIONAL LEXICOGRAPHY AND LEXICOLOGY
Keywords
BibTex
@InProceedings{ELX06-021,
author = {A. Gurrutxaga, X. Saralegi, S. Ugartetxea, Iñaki Alegria },
title = {Elexbi, a Basic Tool for Bilingual Term Extraction from Spanish-Basque Parallel Corpora },
pages = {159-165},
booktitle = {Proceedings of the 12th EURALEX International Congress},
year = {2006},
month = {sep},
date = {6-9},
address = {Torino, Italy},
editor = {Elisa Corino, Carla Marello, Cristina Onesti},
publisher = {Edizioni dell'Orso},
isbn = {88-7694-918-6},
}
Download