proteogenomic
Proteogenomics is an interdisciplinary field that integrates proteomics and genomics to improve genome annotation and to study protein expression. It combines large-scale mass spectrometry–based proteomics with genomic and transcriptomic data to provide direct evidence of protein products and to refine gene models. The central concept is to use peptide evidence identified by proteomics to validate, correct, or extend predicted protein-coding regions, and to discover novel coding regions not predicted by current annotations, such as alternative open reading frames, non-canonical translation, and alternative splicing events.
Workflow typically involves generating a customized protein sequence database derived from genomic sequences, transcript sequences, or
Applications include applications in human health and disease research, such as cancer proteogenomics to identify tumor-specific
Challenges include the large search space from expanded databases increasing the potential for false positives, reliance