Abstract COMLEX Syntax is a large (38,000 head words) on-line syntactic dictionary of English, developed at New York University under the auspices of the Linguistic Data Consortium (LDC). It was envisioned primarily as a tool to aid in the parsing of natural language by computers. To this end, it contains an exceptionally detailed set of syntactic features and complements for the major parts of speech (nouns: 9 features and 9 complements; adjectives: 7 features and 14 complements; verbs: 5 features and 92 complements). The lexicon also contains, for 750 common verbs, references to citations (tags) in a large corpus (100 MB). This corpus is also available from the LDC. These citations can be used to gather frequency-of-occurrence statistics for the complements of these verbs and have also served as a quality check on the dictionary. COMLEX Syntax Version 2.2 (the tagged version) is now available to members of the LDC for research and commercial purposes with minimal legal encumbrances.
