Perplexity

Perplexity is a statistical measure used in information theory and natural language processing to evaluate how well a probability model predicts a sample. In the context of language modeling, it assesses how surprised a model is by a test set of words.

Formally, for a test sequence of N words W = w1, w2, ..., wN, and a model that assigns

Perplexity is closely related to cross-entropy and entropy. Specifically, perplexity equals exp(H) where H is the

Practical considerations include its dependence on vocabulary size and smoothing methods. Large vocabularies can inflate perplexity

|

...,

|

A

a

a

a