Abstract |
The paper discusses problems related to the compilation of the first Lithuanian dictionary of collocations. The problems encountered are both theoretical and empirical in nature. Theoretically, it is important to differentiate between collocations consisting of nodes and their collocates on one hand and collocational strings, i.e. clearcut real-text chains of words, on the other. The latter notion is applied as the basis of the method for the extraction of collocations from the corpus of the Lithuanian language. Empirically, the statistical output, i.e. collocations extracted from the corpus of 100 million running words, is described from the point of view of their grammatical form, lexical autonomy and boundaries, and the steps taken to transform the list of statistical collocational strings into the list of linguistically acceptable collocations or phrases are outlined. |
BibTex |
@InProceedings{ELX04-080, author = {Rûta Marcinkevičiené}, title = {Dictionary of Lithuanian phrases }, pages = {739-748}, booktitle = {Proceedings of the 11th EURALEX International Congress}, year = {2004}, month = {july}, date = {6-10}, address = {Lorient, France}, editor = {Geoffrey Williams and Sandra Vessier}, publisher = {Université de Bretagne-Sud, Faculté des lettres et des sciences humaines}, isbn = {29-52245-70-3}, } |