Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ramon Fernández Mir

The University of Edinburgh School of Informatics, Artificial Intelligence and its Applications Institute

Neurosymbolic AI for Reasoning on Biomedical Knowledge Graphs

Jul 17, 2023

Lauren Nicole DeLong, Ramon Fernández Mir, Zonglin Ji, Fiona Niamh Coulter Smith, Jacques D. Fleuriot

Abstract:Biomedical datasets are often modeled as knowledge graphs (KGs) because they capture the multi-relational, heterogeneous, and dynamic natures of biomedical systems. KG completion (KGC), can, therefore, help researchers make predictions to inform tasks like drug repositioning. While previous approaches for KGC were either rule-based or embedding-based, hybrid approaches based on neurosymbolic artificial intelligence are becoming more popular. Many of these methods possess unique characteristics which make them even better suited toward biomedical challenges. Here, we survey such approaches with an emphasis on their utilities and prospective benefits for biomedicine.

* Proceedings of the $\mathit{40}^{th}$ International Conference on Machine Learning: Workshop on Knowledge and Logical Reasoning in the Era of Data-driven Learning (https://klr-icml2023.github.io/schedule.html). PMLR 202, 2023. Condensed, workshop-ready version of previous survey, arXiv:2302.07200 , which is under review. 13 pages (9 content, 4 references), 3 figures, 1 table

Via

Access Paper or Ask Questions

Machine-Learned Premise Selection for Lean

Mar 17, 2023

Bartosz Piotrowski, Ramon Fernández Mir, Edward Ayers

Figure 1 for Machine-Learned Premise Selection for Lean

Figure 2 for Machine-Learned Premise Selection for Lean

Figure 3 for Machine-Learned Premise Selection for Lean

Figure 4 for Machine-Learned Premise Selection for Lean

Abstract:We introduce a machine-learning-based tool for the Lean proof assistant that suggests relevant premises for theorems being proved by a user. The design principles for the tool are (1) tight integration with the proof assistant, (2) ease of use and installation, (3) a lightweight and fast approach. For this purpose, we designed a custom version of the random forest model, trained in an online fashion. It is implemented directly in Lean, which was possible thanks to the rich and efficient metaprogramming features of Lean 4. The random forest is trained on data extracted from mathlib -- Lean's mathematics library. We experiment with various options for producing training features and labels. The advice from a trained model is accessible to the user via the suggest_premises tactic which can be called in an editor while constructing a proof interactively.

Via

Access Paper or Ask Questions

Neurosymbolic AI for Reasoning on Graph Structures: A Survey

Feb 14, 2023

Lauren Nicole DeLong, Ramon Fernández Mir, Matthew Whyte, Zonglin Ji, Jacques D. Fleuriot

Figure 1 for Neurosymbolic AI for Reasoning on Graph Structures: A Survey

Figure 2 for Neurosymbolic AI for Reasoning on Graph Structures: A Survey

Figure 3 for Neurosymbolic AI for Reasoning on Graph Structures: A Survey

Figure 4 for Neurosymbolic AI for Reasoning on Graph Structures: A Survey

Abstract:Neurosymbolic AI is an increasingly active area of research which aims to combine symbolic reasoning methods with deep learning to generate models with both high predictive performance and some degree of human-level comprehensibility. As knowledge graphs are becoming a popular way to represent heterogeneous and multi-relational data, methods for reasoning on graph structures have attempted to follow this neurosymbolic paradigm. Traditionally, such approaches have utilized either rule-based inference or generated representative numerical embeddings from which patterns could be extracted. However, several recent studies have attempted to bridge this dichotomy in ways that facilitate interpretability, maintain performance, and integrate expert knowledge. Within this article, we survey a breadth of methods that perform neurosymbolic reasoning tasks on graph structures. To better compare the various methods, we propose a novel taxonomy by which we can classify them. Specifically, we propose three major categories: (1) logically-informed embedding approaches, (2) embedding approaches with logical constraints, and (3) rule-learning approaches. Alongside the taxonomy, we provide a tabular overview of the approaches and links to their source code, if available, for more direct comparison. Finally, we discuss the applications on which these methods were primarily used and propose several prospective directions toward which this new field of research could evolve.

* 21 pages, 8 figures, 1 table, currently under review. Corresponding GitHub page here: https://github.com/NeSymGraphs

Via

Access Paper or Ask Questions