Stylometry - Infinite Lexicon - Infinite Lexicon

Stylometry

Stylometry is the quantitative analysis of writing style used to make inferences about authorship, provenance, or stylistic development. It treats text as a data object and seeks measurable, repeatable features that can differentiate authors or track changes over time. Commonly analyzed features include word frequencies, function word usage, character and word n-grams, punctuation patterns, syntax, and lexical richness.

Methods typically involve feature extraction followed by statistical or machine learning modeling. Supervised approaches train classifiers

Applications span authorship attribution of disputed texts (forensic linguistics), plagiarism detection, literary analysis, and historical document

Limitations include the influence of topic, genre, translation, and period on style, as well as data scarcity

a

cross-validation

generalization.

authentication.

reproducibility.