Term candidate extraction for terminography and CAT: an overview of TTC

By November 17, 2016,
Page585-594
AuthorUlrich Heid, Anita Gojun
TitleTerm candidate extraction for terminography and CAT: an overview of TTC
AbstractIn this paper, we present a tool chain for terminology extraction and term alignment which is under development in the EU-project TTC.1 The tool components comprise the crawling of domain-specific text from the internet, in different languages, linguistic pre-processing of the corpus collected in this way, and the extraction of term candidates. Extracted term candidates of two languages are aligned into pairs of source and target term equivalents. This output can be used both in interactive translation setups (e.g. computer-aided translation) and in machine translation.
SessionTerminology, LSP and lexicography
KeywordsTerminology extraction, computer-assisted translation, term alignment.
BibTex
@InProceedings{ELX12-047,
author = {Ulrich Heid and Anita Gojun},
title = {Term candidate extraction for terminography and CAT: an overview of TTC},
pages = {585--594},
booktitle = {Proceedings of the 15th EURALEX International Congress},
year = {2012},
month = {aug},
date = {7-11},
address = {Oslo,Norway},
editor = {Ruth Vatvedt Fjeld and Julie Matilde Torjusen},
publisher = {Department of Linguistics and Scandinavian Studies, University of Oslo},
isbn = {978-82-303-2228-4},
}
Download