Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

William Koehler

Relation-weighted Link Prediction for Disease Gene Identification

Nov 13, 2020

Srivamshi Pittala, William Koehler, Jonathan Deans, Daniel Salinas, Martin Bringmann, Katharina Sophia Volz, Berk Kapicioglu

Figure 1 for Relation-weighted Link Prediction for Disease Gene Identification

Figure 2 for Relation-weighted Link Prediction for Disease Gene Identification

Figure 3 for Relation-weighted Link Prediction for Disease Gene Identification

Figure 4 for Relation-weighted Link Prediction for Disease Gene Identification

Abstract:Identification of disease genes, which are a set of genes associated with a disease, plays an important role in understanding and curing diseases. In this paper, we present a biomedical knowledge graph designed specifically for this problem, propose a novel machine learning method that identifies disease genes on such graphs by leveraging recent advances in network biology and graph representation learning, study the effects of various relation types on prediction performance, and empirically demonstrate that our algorithms outperform its closest state-of-the-art competitor in disease gene identification by 24.1%. We also show that we achieve higher precision than Open Targets, the leading initiative for target identification, with respect to predicting drug targets in clinical trials for Parkinson's disease.

* 4th Knowledge Representation and Reasoning Meets Machine Learning Workshop (KR2ML), NeurIPS 2020

Via

Access Paper or Ask Questions

Biomedical Information Extraction for Disease Gene Prioritization

Nov 12, 2020

Jupinder Parmar, William Koehler, Martin Bringmann, Katharina Sophia Volz, Berk Kapicioglu

Figure 1 for Biomedical Information Extraction for Disease Gene Prioritization

Figure 2 for Biomedical Information Extraction for Disease Gene Prioritization

Figure 3 for Biomedical Information Extraction for Disease Gene Prioritization

Figure 4 for Biomedical Information Extraction for Disease Gene Prioritization

Abstract:We introduce a biomedical information extraction (IE) pipeline that extracts biological relationships from text and demonstrate that its components, such as named entity recognition (NER) and relation extraction (RE), outperform state-of-the-art in BioNLP. We apply it to tens of millions of PubMed abstracts to extract protein-protein interactions (PPIs) and augment these extractions to a biomedical knowledge graph that already contains PPIs extracted from STRING, the leading structured PPI database. We show that, despite already containing PPIs from an established structured source, augmenting our own IE-based extractions to the graph allows us to predict novel disease-gene associations with a 20% relative increase in hit@30, an important step towards developing drug targets for uncured diseases.

* 4th Knowledge Representation and Reasoning Meets Machine Learning Workshop (KR2ML), at NeurIPS 2020

Via

Access Paper or Ask Questions