Rxivist logo

Coevolution-based prediction of protein-protein interactions in polyketide biosynthetic assembly lines

By Yan Wang, Miguel Correa Marrero, Marnix H Medema, Aalt D.J. van Dijk

Posted 13 Jun 2019
bioRxiv DOI: 10.1101/669291 (published DOI: 10.1093/bioinformatics/btaa595)

Polyketide synthases are multimodular enzymes that generate diverse molecules of great pharmaceutical importance, including a range of clinically used antimicrobials and antitumor agents. Many polyketides are synthesized by type I polyketide synthases (PKSs), which are organized in assembly lines, in which multiple enzymes line up in a specific order. This order is defined by specific protein-protein interactions. The unique modular structure and catalyzing mechanism of these assembly lines makes their products predictable and also spurred combinatorial biosynthesis studies to produce novel polyketides using synthetic biology. However, predicting the interactions of PKSs, and thereby inferring the order of their assembly line, is still challenging, especially for cases in which this order is not reflected by the ordering of the PKS-encoding genes in the genome. Here, we introduce PKSpop, which uses a coevolution-based protein-protein interaction prediction algorithm to infer protein order in PKS assembly lines. Our method accurately predicts protein orders (80% accuracy). Additionally, we identify new residue pairs that are key in determining interaction specificity, and show that coevolution of N- and C-terminal docking domains of PKSs is significantly more predictive for protein-protein interactions than coevolution between ketosynthase and acyl carrier protein domains.

Download data

  • Downloaded 472 times
  • Download rankings, all-time:
    • Site-wide: 72,050
    • In bioinformatics: 6,754
  • Year to date:
    • Site-wide: 89,885
  • Since beginning of last month:
    • Site-wide: 87,157

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide