1CNRS & D.I., UMR 8548, École Normale Supérieure, Paris, France2Commissariat à l'Energie Atomique et aux Energies Alternatives, Direction de la Recherche Fondamentale, Genoscope3UMR 8030, Centre National de la Recherche Scientifique4Université Paris-Saclay, Evry, France
Checking for direct PDF access through Ovid
MotivationNew long read sequencers promise to transform sequencing and genome assembly by producing reads tens of kilobases long. However, their high error rate significantly complicates assembly and requires expensive correction steps to layout the reads using standard assembly engines.ResultsWe present an original and efficient spectral algorithm to layout the uncorrected nanopore reads, and its seamless integration into a straightforward overlap/layout/consensus (OLC) assembly scheme. The method is shown to assemble Oxford Nanopore reads from several bacterial genomes into good quality (˜99% identity to the reference) genome-sized contigs, while yielding more fragmented assemblies from the eukaryotic microbe Sacharomyces cerevisiae.Availability and implementationhttps://github.com/antrec/spectrassembler.Contactantoine.firstname.lastname@example.orgSupplementary InformationSupplementary data are available at Bioinformatics online.