Abstract |
One of the problems facing the user of a bilingual dictionary is producing multiword expressions and phrases in the target language when the explicit phrasal translation does not appear in the dictionary. Defining collocations as the preferred choice of words for expressing the desired concept, the DECIDE project has been exploring how collocational information from mono- and bilingual dictionaries and raw text corpora can be discovered, extracted and stored online. During the project, we have developed tools for identifying potential collocations from raw text; for marking up English, French and German text for use in an interactive corpus query tool; for accessing lexical and grammatical patterns over such a corpus via this corpus query tool; for accessing collocations derived from online bilingual dictionaries; and for documenting such collocations using available text corpora. Finally, we have produced a common interface to these textual, corpus and dictionary tools, and used this interface to create a multilingual lexicon of the collocational choices of support verbs for nominalizations of speech act verbs. This paper presents an overview of this European Union sponsored project, its objectives, its methodology, and its results. |
BibTex |
@InProceedings{ELX96_1-014, author = {Gregory Grefenstette, Ulrich Heid, Bruno Maximilian Schulze, Thierry Fontenelle, Claire Gera}, title = {The DECIDE Project: Multilingual Collocation Extraction}, pages = {101-106}, booktitle = {Proceedings of the 7th EURALEX International Congress}, year = {1996}, month = {aug}, date = {13-18}, address = {Göteborg, Sweden}, editor = {Martin Gellerstam, Jerker Järborg, Sven-Göran Malmgren, Kerstin Norén, Lena Rogström, Catalina Röjder Papmehl}, publisher = {Novum Grafiska AB}, isbn = {91-87850-14-1}, } |