Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 57,506 bioRxiv papers from 264,779 authors.

A computational protocol to characterize elusive Candidate Phyla Radiation bacteria in oral environments using metagenomic data

By Peiqi Meng, Chang Lu, Xinzhe Lou, Qian Zhang, Peizeng Jia, Zhimin Yan, Jiuxiang Lin, Feng Chen

Posted 29 Jun 2018
bioRxiv DOI: 10.1101/358812

Several studies have documented the diversity and potential pathogenic associations of organisms in the human oral cavity. Although much progress has been made in understanding the complex bacterial community inhabiting the human oral cavity, our understanding of some microorganisms is less resolved due to a variety of reasons. One such little-understood group is the candidate phyla radiation (CPR), which is a recently identified, but highly abundant group of ultrasmall bacteria with reduced genomes and unusual ribosomes. Here, we present a computational protocol for the detection of CPR organisms from metagenomic data. Our approach relies on a self-constructed dataset comprising published CPR genomic sequences as a filter to identify CPR sequences from metagenomic sequencing data. After assembly and functional prediction, the taxonomic affiliation of CPR contigs can be identified through phylogenetic analysis with publically available 16S rRNA gene and ribosomal proteins, in addition to sequence similarity analyses (e.g., average nucleotide identity calculations and contig mapping). Using this protocol, we reconstructed two draft genomes of organisms within the TM7 superphylum, that had genome sizes of 0.594 Mb and 0.678 Mb. Among the predicted functional genes of the constructed genomes, a high percentage were related to signal transduction, cell motility, and cell envelope biogenesis, which could contribute to cellular morphological changes in response to environmental cues.

Download data

  • Downloaded 254 times
  • Download rankings, all-time:
    • Site-wide: 34,210 out of 57,506
    • In bioinformatics: 4,259 out of 5,863
  • Year to date:
    • Site-wide: 23,184 out of 57,506
  • Since beginning of last month:
    • Site-wide: 26,273 out of 57,506

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News