GlyXYMuster
GlyXYMuster is a bioinformatics toolkit designed to identify and characterize Gly-X-Y repeats in protein sequences, with a focus on collagen-like proteins that adopt a triple-helical structure. The Gly-X-Y motif refers to a repeating tripeptide pattern in which glycine appears at every third position, commonly represented as Gly-X-Y, where X and Y are variable residues that often include proline and hydroxyproline. The project provides an integrated workflow for detecting these patterns, annotating them within protein sequences, and summarizing their distribution across datasets.
Key capabilities include sequence scanning for Gly-X-Y tripeptides, reporting the number, start and end positions, and
Implementation and availability: GlyXYMuster is implemented in Python with optional R components for advanced visualization. It