PSSMs

Position-Specific Scoring Matrices (PSSMs) are matrices used in sequence analysis to quantify the likelihood of each residue at each position in a sequence profile. They encode position-specific substitution preferences derived from a multiple sequence alignment of related sequences, allowing a single matrix to capture conservation patterns across the region of interest. PSSMs are widely used to detect distant homologs and to score sequences against a protein family or motif.

Construction typically starts with a curated multiple sequence alignment of related sequences. For each position i,

Applications include scoring a query sequence by summing the scores for each position, enabling the detection

Types and relation to other concepts: PSSMs are commonly constructed for proteins (20 amino acids) but can

a

=

/

).

a

a

a

a

representations

implementations,

overrepresented

inter-positional