Proteogenomics
Proteogenomics is an interdisciplinary field that integrates proteomics and genomics to improve genome annotation and enable discovery of novel protein-coding events. In bottom-up proteomics, tandem mass spectrometry identifies peptides that can be mapped back to a genome or transcriptome. By combining these data, researchers can validate predicted genes, refine exon–intron boundaries, and discover translation events not captured by conventional annotation, such as alternative start sites, alternative splicing, or translation of previously annotated noncoding regions.
Proteogenomic workflows typically start with genomic and transcriptomic data to generate a protein sequence database, often
Applications include improved genome annotation in non-model organisms, cancer research for identifying tumor-specific neoantigens or personalized
Challenges include managing large, sample-specific databases, distinguishing true translation from noise, biases in proteomics data, and