Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 67,284 bioRxiv papers from 296,196 authors.

A computational protocol to characterize elusive Candidate Phyla Radiation bacteria in oral environments using metagenomic data

By Peiqi Meng, Chang Lu, Xinzhe Lou, Qian Zhang, Peizeng Jia, Zhimin Yan, Jiuxiang Lin, Feng Chen

Posted 29 Jun 2018
bioRxiv DOI: 10.1101/358812

Several studies have documented the diversity and potential pathogenic associations of organisms in the human oral cavity. Although much progress has been made in understanding the complex bacterial community inhabiting the human oral cavity, our understanding of some microorganisms is less resolved due to a variety of reasons. One such little-understood group is the candidate phyla radiation (CPR), which is a recently identified, but highly abundant group of ultrasmall bacteria with reduced genomes and unusual ribosomes. Here, we present a computational protocol for the detection of CPR organisms from metagenomic data. Our approach relies on a self-constructed dataset comprising published CPR genomic sequences as a filter to identify CPR sequences from metagenomic sequencing data. After assembly and functional prediction, the taxonomic affiliation of CPR contigs can be identified through phylogenetic analysis with publically available 16S rRNA gene and ribosomal proteins, in addition to sequence similarity analyses (e.g., average nucleotide identity calculations and contig mapping). Using this protocol, we reconstructed two draft genomes of organisms within the TM7 superphylum, that had genome sizes of 0.594 Mb and 0.678 Mb. Among the predicted functional genes of the constructed genomes, a high percentage were related to signal transduction, cell motility, and cell envelope biogenesis, which could contribute to cellular morphological changes in response to environmental cues.

Download data

  • Downloaded 311 times
  • Download rankings, all-time:
    • Site-wide: 35,264 out of 67,284
    • In bioinformatics: 4,374 out of 6,635
  • Year to date:
    • Site-wide: 25,989 out of 67,284
  • Since beginning of last month:
    • Site-wide: 21,673 out of 67,284

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News