collostructions
Collostructions are statistical associations between lexical items and syntactic constructions in corpus linguistics and construction grammar. The term describes how often a given word tends to occur in a particular syntactic construction compared with what would be expected if words were distributed independently of constructions. Researchers extract instances of a construction from a corpus, tally the occurrence of each lexical item in the construction's slot(s), and compare observed counts to expected counts under a null model, using statistics such as log-likelihood ratio, Pearson's chi-square, or Fisher's exact test. The result is a collostruction strength, often expressed as a measure of excitatory (more frequent than expected) or inhibitory (less frequent than expected) association. A positive strength indicates that the item preferentially co-occurs with the construction; a negative strength suggests avoidance.
Collostruction analysis supports construction grammar by providing empirical distributional evidence for the association between words and
Limitations include sensitivity to corpus size, sampling bias, polysemy, and the alignment of a word form to