percentidentity
Percent identity is a metric used in sequence analysis to quantify how similar two sequences are at the residue level after an alignment. It is commonly defined as the percentage of positions in the aligned region that contain identical symbols in the two sequences.
Calculation and interpretation can vary slightly by tool. Typically, percent identity is calculated as the number
Percent identity is used to infer relatedness and homology, to cluster sequences, and to assess the quality
Limitations include dependence on the alignment method, region selection, and sequence length. Short alignments tend to