sORFs
Small open reading frames (sORFs) are short stretches of nucleotides that can be translated into small peptides, typically encoding fewer than 100 amino acids. They are found throughout genomes, including in annotated protein-coding genes, untranslated regions (5' UTRs and 3' UTRs), long noncoding RNAs, and intergenic regions. sORFs are widespread in bacteria, archaea, and eukaryotes, and in organellar genomes.
This class was historically neglected due to small size and biases against short coding sequences in gene
Contexts: uORFs in 5' UTRs can regulate downstream translation; overlapping sORFs may coincide with annotated CDSs;
Functionality: Some sORFs have validated roles in development, stress responses, and cellular signaling; others remain uncharacterized.
Annotation and resources: Specialized databases such as sORFs.org, OpenProt, and the sORF database curate predicted and