Rxivist logo

SSMART: Sequence-structure motif identification for RNA-binding proteins

By Alina Munteanu, Neelanjan Mukherjee, Uwe Ohler

Posted 23 Mar 2018
bioRxiv DOI: 10.1101/287953 (published DOI: 10.1093/bioinformatics/bty404)

Motivation: RNA-binding proteins (RBPs) regulate every aspect of RNA metabolism and function. There are hundreds of RBPs encoded in the eukaryotic genomes, and each recognize its RNA targets through a specific mixture of RNA sequence and structure properties. For most RBPs, however, only a primary sequence motif has been determined, while the structure of the binding sites is uncharacterized. Results: We developed SSMART, an RNA motif finder that simultaneously models the primary sequence and the structural properties of the RNA targets sites. The sequence-structure motifs are represented as consensus strings over a degenerate alphabet, extending the IUPAC codes for nucleotides to account for secondary structure preferences. Evaluation on synthetic data showed that SSMART is able to recover both sequence and structure motifs implanted into 3'UTR-like sequences, for various degrees of structured/unstructured binding sites. In addition, we successfully used SSMART on high-throughput in vivo and in vitro data, showing that we not only recover the known sequence motif, but also gain insight into the structural preferences of the RBP. Availability: SSMART is freely available at https://ohlerlab.mdc-berlin.de/software/SSMART 137/

Download data

  • Downloaded 616 times
  • Download rankings, all-time:
    • Site-wide: 23,654 out of 88,613
    • In bioinformatics: 3,324 out of 8,383
  • Year to date:
    • Site-wide: 46,583 out of 88,613
  • Since beginning of last month:
    • Site-wide: 22,448 out of 88,613

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)