gencode
GENCODE is a community-driven project that provides comprehensive annotation of gene features for the human and mouse reference genomes. The primary goal is to identify and curate all protein-coding genes, non-coding RNA genes, and pseudogenes, and to define the full set of transcripts and splice isoforms for each locus. Annotation includes gene structures, transcript coordinates, exon-intron boundaries, and metadata such as biotype and evidence.
The project combines automated computational annotation with manual curation. The manual component is carried out by
Data products from GENCODE include GTF/GFF3 annotation files, transcript and gene identifiers (such as ENSG/ENST IDs
GENCODE is closely linked to the ENCODE project and forms a foundational resource for gene-level annotation