Rxivist logo

Modeling the functional relationship network at the splice isoform level through heterogeneous data integration

By Hong-Dong Li, Rajasree Menon, Ridvan Eksi, Aysam Guerler, Yang Zhang, Gilbert S. Omenn, Yuanfang Guan

Posted 09 Jan 2014
bioRxiv DOI: 10.1101/001719

Functional relationship networks, which reveal the collaborative roles between genes, have significantly accelerated our understanding of gene functions and phenotypic relevance. However, establishing such networks for alternatively spliced isoforms remains a difficult, unaddressed problem due to the lack of systematic functional annotations at the isoform level, which renders most supervised learning methods difficult to be applied to isoforms. Here we describe a novel multiple instance learning-based probabilistic approach that integrates large-scale, heterogeneous genomic datasets, including RNA-seq, exon array, protein docking and pseudo-amino acid composition, for modeling a global functional relationship network at the isoform level in the mouse. Using this approach, we formulate a gene pair as a set of isoform pairs of potentially different properties. Through simulation and cross-validation studies, we showed the superior accuracy of our algorithm in revealing the isoform-level functional relationships. The local networks reveal functional diversity of the isoforms of the same gene, as demonstrated by both large-scale analyses and experimental and literature evidence for the disparate functions revealed for the isoforms of Ptbp1 and Anxa6 by our network. Our work can assist the understanding of the diversity of functions achieved by alternative splicing of a limited set of genes in mammalian genomes, and may shift the current gene-centered network prediction paradigm to the isoform level.

Download data

  • Downloaded 1,204 times
  • Download rankings, all-time:
    • Site-wide: 28,830
    • In genomics: 2,359
  • Year to date:
    • Site-wide: 207,291
  • Since beginning of last month:
    • Site-wide: 198,138

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide