Abstract |
With the log files of online dictionaries, in which all submitted user queries are stored, dictionary authors have for the first time in the history of dictionary building direct access to the users' requests, in this article, we show 1) how to use the log file to evaluate the current contents of an online dictionary and 2) how to choose the most promising corpus type for enlarging it according to the users' needs. We use the example of a German- Slovenian online dictionary for this work. As a result of the first evaluation, we detect that the dictionary does not fulfill the users' needs in coverage colloquial and vulgar language, as well as in words and expressions used in everyday life. The result of the second evaluation confirms the importance of this part of the vocabulary, in an overall comparison of queries and corpora, a fiction corpus, which by its nature contains also colloquial language, yields a better result than a newspaper corpus and a non-fiction corpus. |
BibTex |
@InProceedings{ELX04-028, author = {Primož Jakopin, Birte Lönneker}, title = {Query-driven dictionary enhancement }, pages = {273-284}, booktitle = {Proceedings of the 11th EURALEX International Congress}, year = {2004}, month = {july}, date = {6-10}, address = {Lorient, France}, editor = {Geoffrey Williams and Sandra Vessier}, publisher = {Université de Bretagne-Sud, Faculté des lettres et des sciences humaines}, isbn = {29-52245-70-3}, } |