Fishervectors
Fishervectors are a powerful image representation technique used in computer vision and machine learning. They extend the concept of bag-of-visual-words (BoVW) models by incorporating gradient information from local feature descriptors. Instead of simply counting occurrences of visual words, Fishervectors represent images as points in a high-dimensional space defined by the gradients of a generative model, typically a Gaussian Mixture Model (GMM).
The process begins by extracting local features, such as SIFT (Scale-Invariant Feature Transform) or SURF (Speeded
The resulting Fishervector is a dense, high-dimensional descriptor that encodes richer information than a simple BoVW