Next generation sequencing (NGS) may be an alternative to phenotypic susceptibility testing for surveillance and clinical diagnosis. However, current bioinformatics methods may be associated with false positives and negatives. In this study, a novel mapping method was developed and benchmarked to two different methods in current use for identification of antibiotic resistance genes in bacterial WGS data.Methods
A novel method, KmerResistance, which examines the co-occurrence of k-mers between the WGS data and a database of resistance genes, was developed. The performance of this method was compared with two previously described methods; ResFinder and SRST2, which use an assembly/BLAST method and BWA, respectively, using two datasets with a total of 339 isolates, covering five species, originating from the Oxford University Hospitals NHS Trust and Danish pig farms. The predicted resistance was compared with the observed phenotypes for all isolates. To challenge further the sensitivity of the in silico methods, the datasets were also down-sampled to 1% of the reads and reanalysed.Results
The best results were obtained by identification of resistance genes by mapping directly against the raw reads. This indicates that information might be lost during assembly. KmerResistance performed significantly better than the other methods, when data were contaminated or only contained few sequence reads.Conclusions
Read mapping is superior to assembly-based methods and the new KmerResistance seemingly outperforms currently available methods particularly when including datasets with few reads.