Rxivist logo

eGARD: Extracting associations between genomic anomalies and drug responses from text

By A. S. M. Ashique Mahmood, Shruti Rao, Peter McGarvey, Cathy Wu, Subha Madhavan, K. Vijay-Shanker

Posted 11 Jun 2017
bioRxiv DOI: 10.1101/148833 (published DOI: 10.1371/journal.pone.0189663)

Tumor molecular profiling plays an integral role in identifying genomic anomalies which may help in personalizing cancer treatments, improving patient outcomes and minimizing risks associated with different therapies. However, critical information regarding the evidence of clinical utility of such anomalies is largely buried in biomedical literature. It is becoming prohibitive for biocurators, clinical researchers and oncologists to keep up with the rapidly growing volume and breadth of information, especially those that describe therapeutic implications of biomarkers and therefore relevant for treatment selection. In an effort to improve and speed up the process of manually reviewing and extracting relevant information from literature, we have developed a natural language processing (NLP)-based text mining (TM) system called eGARD (extracting Genomic Anomalies association with Response to Drugs). This system relies on the syntactic nature of sentences coupled with various textual features to extract relations between genomic anomalies and drug response from MEDLINE abstracts. Our system achieved high precision, recall and F-measure of up to 0.95, 0.86 and 0.90, respectively, on annotated evaluation datasets created in-house and obtained externally from PharmGKB. Additionally, the system extracted information that helps determine the confidence level of extraction to support prioritization of curation. Such a system will enable clinical researchers to explore the use of published markers to stratify patients upfront for "best-fit" therapies and readily generate hypotheses for new clinical trials.

Download data

  • Downloaded 514 times
  • Download rankings, all-time:
    • Site-wide: 73,354
    • In bioinformatics: 6,831
  • Year to date:
    • Site-wide: None
  • Since beginning of last month:
    • Site-wide: 151,172

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide