Corpus Frequency and Lexicographical Relevancy – Czech Words with a Morfem Micro- (in Hundred Million Corpus of Czech Language – SYN2000)

By November 17, 2016,
Page 209-220
Author Michal Šulc
Title Corpus Frequency and Lexicographical Relevancy – Czech Words with a Morfem Micro- (in Hundred Million Corpus of Czech Language – SYN2000)
Abstract Using the example of a Czech morpheme "micro", the paper shows several possibilities for how to decisively use language features to incorporate the item into a draft of dictionary-entry list. Quantity of occurrences, quantity of lemmas, frames of word-formation, parts of speech and especially quantity of lemmas inside frequency ranks are discussed. The result of the paper is that we can use a graph showing the quantity of lemmas inside frequency ranks as linguistic evidence and make the boundary for the draft entry list accordingly.
Session Computational Lexicography and Lexicology
Keywords
BibTex
@InProceedings{ELX02-020,
author = {Michal Šulc},
title = {Corpus Frequency and Lexicographical Relevancy - Czech Words with a Morfem Micro- (in Hundred Million Corpus of Czech Language - SYN2000)},
pages = {209-220},
booktitle = {Proceedings of the 10th EURALEX International Congress},
year = {2002},
month = {aug},
date = {13-17},
address = {København, Denmark},
editor = {Anna Braasch and Claus Povlsen},
publisher = {Center for Sprogteknologi},
isbn = {87-90708-09-1},
}
Download