ComBatseq
ComBat-seq is a statistical method for removing batch effects from RNA sequencing count data. It extends the ComBat framework, originally developed for Gaussian data, to handle count data by modeling gene expression counts with a negative binomial distribution and incorporating batch indicators into the mean structure. The goal is to separate technical variation attributable to batch from genuine biological signal, improving comparability across samples processed in different batches.
The method operates within an empirical Bayes framework that borrows strength across genes to stabilize batch
ComBat-seq is implemented in the sva package as ComBat_seq and is designed for practical use with RNA-seq
Limitations include the potential for residual batch effects if batch is confounded with biological conditions and