Lymfasta
Lymfasta is a fictional open-access dataset designed for teaching and benchmarking in bioinformatics. The name combines lymphatic biology with the FASTA sequence format, reflecting its focus on sequences associated with the lymphatic system. The collection includes nucleotide and amino acid sequences drawn from vertebrate markers and genes relevant to lymphatic tissues such as lymph nodes, spleen, and thymus. Each entry carries metadata fields for accession, species, tissue, gene symbol, and functional annotation, and sequences are provided in FASTA format as well as machine-readable JSON or CSV files.
Lymfasta is used to illustrate common data-handling tasks, including sequence alignment, motif discovery, primer design, and
The project originated in 2010 as a teaching resource and is maintained by a community of curators