Rxivist logo

Modeling RNA-binding protein specificity in vivo by precisely registering protein-RNA crosslink sites

By Huijuan Feng, Suying Bao, Sebastien M Weyn-Vanhentenryck, Aziz Khan, Justin Wong, Ankeeta Shah, Elise D. Flynn, Chaolin Zhang

Posted 27 Sep 2018
bioRxiv DOI: 10.1101/428615 (published DOI: 10.1016/j.molcel.2019.02.002)

RNA-binding proteins (RBPs) regulate post-transcriptional gene expression by recognizing short and degenerate sequence elements in their target transcripts. Despite the expanding list of RBPs with in vivo binding sites mapped genomewide using crosslinking and immunoprecipitation (CLIP), defining precise RBP binding specificity remains challenging. We previously demonstrated that the exact protein-RNA crosslink sites can be mapped using CLIP data at single-nucleotide resolution and observed that crosslinking frequently occurs at specific positions in RBP motifs. Here we have developed a computational method, named mCross, to jointly model RBP binding specificity while precisely registering the crosslinking position in motif sites. We applied mCross to 112 RBPs using ENCODE eCLIP data and validated the reliability of the resulting motifs by genome-wide analysis of allelic binding sites also detected by CLIP. We found that the prototypical SR protein SRSF1 recognizes GGA clusters to regulate splicing in a much larger repertoire of transcripts than previously appreciated.

Download data

  • Downloaded 1,260 times
  • Download rankings, all-time:
    • Site-wide: 8,127 out of 92,290
    • In molecular biology: 280 out of 3,156
  • Year to date:
    • Site-wide: 21,431 out of 92,290
  • Since beginning of last month:
    • Site-wide: 27,977 out of 92,290

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)