Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yoshiki Vazquez-Baeza

Theoretical Knowledge Graph Reasoning via Ending Anchored Rules

Dec 15, 2020

Canlin Zhang, Yannis Katsis, Yoshiki Vazquez-Baeza, Andrew Bartko, Ho-Cheol Kim, Chun-Nan Hsu

Figure 1 for Theoretical Knowledge Graph Reasoning via Ending Anchored Rules

Figure 2 for Theoretical Knowledge Graph Reasoning via Ending Anchored Rules

Figure 3 for Theoretical Knowledge Graph Reasoning via Ending Anchored Rules

Figure 4 for Theoretical Knowledge Graph Reasoning via Ending Anchored Rules

Abstract:Discovering precise and specific rules from knowledge graphs is regarded as an essential challenge, which can improve the performances of many downstream tasks and even provide new ways to approach some Natural Language Processing research topics. In this paper, we provide a fundamental theory for knowledge graph reasoning based on the ending anchored rules. Our theory provides precise reasons explaining why or why not a triple is correct. Then, we implement our theory by what we call the EARDict model. Results show that our EARDict model significantly outperforms all the benchmark models on two large datasets of knowledge graph completion, including achieving a Hits@10 score of 96.6 percent on WN18RR.

* Comparing to v2, v3 raises the lower bound of the connection set to be 2, which increases the performances on WN18RR for about 20 percent, and increases those on FB15K-237 for about 6 percent. People may refer to our presentation "EARDict_refinement" posted on github.com/ucsd-cmi/eardict for a detailed comparison between v2 and v3. We also revise our expressions a lot in v3

Via

Access Paper or Ask Questions

Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Nov 30, 2020

Lingjing Jiang, Niina Haiminen, Anna-Paola Carrieri, Shi Huang, Yoshiki Vazquez-Baeza, Laxmi Parida, Ho-Cheol Kim, Austin D. Swafford, Rob Knight, Loki Natarajan

Figure 1 for Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Figure 2 for Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Figure 3 for Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Figure 4 for Utilizing stability criteria in choosing feature selection methods yields reproducible results in microbiome data

Abstract:Feature selection is indispensable in microbiome data analysis, but it can be particularly challenging as microbiome data sets are high-dimensional, underdetermined, sparse and compositional. Great efforts have recently been made on developing new methods for feature selection that handle the above data characteristics, but almost all methods were evaluated based on performance of model predictions. However, little attention has been paid to address a fundamental question: how appropriate are those evaluation criteria? Most feature selection methods often control the model fit, but the ability to identify meaningful subsets of features cannot be evaluated simply based on the prediction accuracy. If tiny changes to the training data would lead to large changes in the chosen feature subset, then many of the biological features that an algorithm has found are likely to be a data artifact rather than real biological signal. This crucial need of identifying relevant and reproducible features motivated the reproducibility evaluation criterion such as Stability, which quantifies how robust a method is to perturbations in the data. In our paper, we compare the performance of popular model prediction metric MSE and proposed reproducibility criterion Stability in evaluating four widely used feature selection methods in both simulations and experimental microbiome applications. We conclude that Stability is a preferred feature selection criterion over MSE because it better quantifies the reproducibility of the feature selection method.

Via

Access Paper or Ask Questions