Rxivist logo

RFPlasmid: Predicting plasmid sequences from short read assembly data using machine learning

By Linda van der Graaf van Bloois, Jaap A Wagenaar, Aldert L. Zomer

Posted 02 Aug 2020
bioRxiv DOI: 10.1101/2020.07.31.230631

Antimicrobial resistance (AMR) genes in bacteria are often carried on plasmids and these plasmids can transfer AMR genes between bacteria. For molecular epidemiology purposes and risk assessment, it is important to know if the genes are located on highly transferable plasmids or in the more stable chromosomes. However, draft whole genome sequences are fragmented, making it difficult to discriminate plasmid and chromosomal contigs. Current methods that predict plasmid sequences from draft genome sequences rely on single features, like k-mer composition, circularity of the DNA molecule, copy number or sequence identity to plasmid replication genes, all of which have their drawbacks, especially when faced with large single copy plasmids, which often carry resistance genes. With our newly developed prediction tool RFPlasmid, we use a combination of multiple features, including k-mer composition and databases with plasmid and chromosomal marker proteins, to predict if the likely source of a contig is plasmid or chromosomal. The tool RFPlasmid supports models for 17 different bacterial species, including Campylobacter , E. coli , and Salmonella , and has a species agnostic model for metagenomic assemblies or unsupported organisms. RFPlasmid is available both as standalone tool and via web interface. ### Competing Interest Statement The authors have declared no competing interest.

Download data

  • Downloaded 337 times
  • Download rankings, all-time:
    • Site-wide: 65,070 out of 118,598
    • In bioinformatics: 6,391 out of 9,592
  • Year to date:
    • Site-wide: 23,112 out of 118,598
  • Since beginning of last month:
    • Site-wide: 19,101 out of 118,598

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)