Canu

Canu is a free, open-source genome assembler designed for long-read sequencing data, such as those produced by Pacific Biosciences SMRT sequencing and Oxford Nanopore Technologies. It is a continuation of legacy Celera Assembler work and is widely used for de novo genome assembly, including large and complex genomes. Canu focuses on handling the high error rates characteristic of long reads and aims to produce accurate assemblies from single data sets or mixed data.

The Canu workflow consists of three main stages: read error correction, read trimming, and assembly. The software

Canu runs on Unix-like operating systems and is typically deployed on high-performance computing infrastructure due to

Impact and usage: since its introduction, Canu has become a standard tool in de novo genome assembly,

overlap-layout-consensus

reference-grade