Rxivist logo

ORF Capture-Seq: a versatile method for targeted identification of full-length isoforms

By Gloria M. Sheynkman, Katharine S Tuttle, Elizabeth Tseng, Jason G Underwood, Liang Yu, Da Dong, Melissa L. Smith, Robert Sebra, Tong Hao, Michael A. Calderwood, David E. Hill, Marc Vidal

Posted 11 Apr 2019
bioRxiv DOI: 10.1101/604157 (published DOI: 10.1038/s41467-020-16174-z)

Most human protein-coding genes are expressed as multiple isoforms. This in turn greatly expands the functional repertoire of the encoded proteome. While at least one reliable open reading frame (ORF) model has been assigned for every gene, the majority of alternative isoforms remains uncharacterized experimentally. This is primarily due to: i) vast differences of overall levels between different isoforms expressed from common genes, and ii) the difficulty of obtaining contiguous full-length ORF sequences. Here, we present ORF Capture-Seq (OCS), a flexible and cost-effective method that addresses both challenges for targeted full-length isoform sequencing applications using collections of cloned ORFs as probes. As proof-of-concept, we show that an OCS pipeline focused on genes coding for transcription factors increases isoform detection by an order of magnitude, compared to unenriched sample. In short, OCS enables rapid discovery of isoforms from custom-selected genes and will allow mapping of the full set of human isoforms at reasonable cost.

Download data

  • Downloaded 1,283 times
  • Download rankings, all-time:
    • Site-wide: 7,564 out of 88,923
    • In genomics: 1,167 out of 5,671
  • Year to date:
    • Site-wide: 10,810 out of 88,923
  • Since beginning of last month:
    • Site-wide: 17,266 out of 88,923

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)