Scanpy

Scanpy is an open-source Python toolkit for the analysis of single-cell transcriptomics data. It is designed to handle large-scale scRNA-seq experiments and is centered on the AnnData data structure, which stores the expression matrix (cells by genes) along with per-cell and per-gene metadata and analysis results.

The library provides an end-to-end, modular workflow for common tasks in single-cell analysis. Typical preprocessing steps

Scanpy emphasizes scalability through efficient, sparse-matrix representations and integrates with the broader Python scientific stack, including

transformation,

a

nearest-neighbor

diffusion-based

a