Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Laura Graf

Scalable and interpretable rule-based link prediction for large heterogeneous knowledge graphs

Dec 10, 2020

Simon Ott, Laura Graf, Asan Agibetov, Christian Meilicke, Matthias Samwald

Figure 1 for Scalable and interpretable rule-based link prediction for large heterogeneous knowledge graphs

Figure 2 for Scalable and interpretable rule-based link prediction for large heterogeneous knowledge graphs

Figure 3 for Scalable and interpretable rule-based link prediction for large heterogeneous knowledge graphs

Figure 4 for Scalable and interpretable rule-based link prediction for large heterogeneous knowledge graphs

Abstract:Neural embedding-based machine learning models have shown promise for predicting novel links in biomedical knowledge graphs. Unfortunately, their practical utility is diminished by their lack of interpretability. Recently, the fully interpretable, rule-based algorithm AnyBURL yielded highly competitive results on many general-purpose link prediction benchmarks. However, its applicability to large-scale prediction tasks on complex biomedical knowledge bases is limited by long inference times and difficulties with aggregating predictions made by multiple rules. We improve upon AnyBURL by introducing the SAFRAN rule application framework which aggregates rules through a scalable clustering algorithm. SAFRAN yields new state-of-the-art results for fully interpretable link prediction on the established general-purpose benchmark FB15K-237 and the large-scale biomedical benchmark OpenBioLink. Furthermore, it exceeds the results of multiple established embedding-based algorithms on FB15K-237 and narrows the gap between rule-based and embedding-based algorithms on OpenBioLink. We also show that SAFRAN increases inference speeds by up to two orders of magnitude.

Via

Access Paper or Ask Questions