PSSMs
Position-Specific Scoring Matrices (PSSMs) are matrices used in sequence analysis to quantify the likelihood of each residue at each position in a sequence profile. They encode position-specific substitution preferences derived from a multiple sequence alignment of related sequences, allowing a single matrix to capture conservation patterns across the region of interest. PSSMs are widely used to detect distant homologs and to score sequences against a protein family or motif.
Construction typically starts with a curated multiple sequence alignment of related sequences. For each position i,
Applications include scoring a query sequence by summing the scores for each position, enabling the detection
Types and relation to other concepts: PSSMs are commonly constructed for proteins (20 amino acids) but can