Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marta Soare

Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Jul 13, 2017

Pedram Daee, Tomi Peltola, Marta Soare, Samuel Kaski

Figure 1 for Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Figure 2 for Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Figure 3 for Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Figure 4 for Knowledge Elicitation via Sequential Probabilistic Inference for High-Dimensional Prediction

Abstract:Prediction in a small-sized sample with a large number of covariates, the "small n, large p" problem, is challenging. This setting is encountered in multiple applications, such as precision medicine, where obtaining additional samples can be extremely costly or even impossible, and extensive research effort has recently been dedicated to finding principled solutions for accurate prediction. However, a valuable source of additional information, domain experts, has not yet been efficiently exploited. We formulate knowledge elicitation generally as a probabilistic inference process, where expert knowledge is sequentially queried to improve predictions. In the specific case of sparse linear regression, where we assume the expert has knowledge about the values of the regression coefficients or about the relevance of the features, we propose an algorithm and computational approximation for fast and efficient interaction, which sequentially identifies the most informative features on which to query expert knowledge. Evaluations of our method in experiments with simulated and real users show improved prediction accuracy already with a small effort from the expert.

* 22 pages, 9 figures. The paper is published in Machine Learning journal (http://rdcu.be/t9KF). Codes and data available at https://github.com/HIIT/knowledge-elicitation-for-linear-regression, Machine Learning, (2017)

Via

Access Paper or Ask Questions

Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

May 09, 2017

Iiris Sundin, Tomi Peltola, Muntasir Mamun Majumder, Pedram Daee, Marta Soare, Homayun Afrabandpey, Caroline Heckman, Samuel Kaski, Pekka Marttinen

Figure 1 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 2 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 3 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Figure 4 for Improving drug sensitivity predictions in precision medicine through active expert knowledge elicitation

Abstract:Predicting the efficacy of a drug for a given individual, using high-dimensional genomic measurements, is at the core of precision medicine. However, identifying features on which to base the predictions remains a challenge, especially when the sample size is small. Incorporating expert knowledge offers a promising alternative to improve a prediction model, but collecting such knowledge is laborious to the expert if the number of candidate features is very large. We introduce a probabilistic model that can incorporate expert feedback about the impact of genomic measurements on the sensitivity of a cancer cell for a given drug. We also present two methods to intelligently collect this feedback from the expert, using experimental design and multi-armed bandit models. In a multiple myeloma blood cancer data set (n=51), expert knowledge decreased the prediction error by 8%. Furthermore, the intelligent approaches can be used to reduce the workload of feedback collection to less than 30% on average compared to a naive approach.

* 24 pages, 3 figures

Via

Access Paper or Ask Questions

Regression with n$\to$1 by Expert Knowledge Elicitation

Feb 07, 2017

Marta Soare, Muhammad Ammad-ud-din, Samuel Kaski

$Figure 1 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 2 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 3 for Regression with n$\to$1 by Expert Knowledge Elicitation$

$Figure 4 for Regression with n$\to$1 by Expert Knowledge Elicitation$

Abstract:We consider regression under the "extremely small $n$ large $p$" condition, where the number of samples $n$ is so small compared to the dimensionality $p$ that predictors cannot be estimated without prior knowledge. This setup occurs in personalized medicine, for instance, when predicting treatment outcomes for an individual patient based on noisy high-dimensional genomics data. A remaining source of information is expert knowledge, which has received relatively little attention in recent years. We formulate the inference problem of asking expert feedback on features on a budget, propose an elicitation strategy for a simple "small $n$" setting, and derive conditions under which the elicitation strategy is optimal. Experiments on simulated experts, both on synthetic and genomics data, demonstrate that the proposed strategy can drastically improve prediction accuracy.

* In Proceedings of the 15th IEEE International Conference on Machine Learning and Applications (IEEE ICMLA'16)

Via

Access Paper or Ask Questions

Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Jan 16, 2017

Luana Micallef, Iiris Sundin, Pekka Marttinen, Muhammad Ammad-ud-din, Tomi Peltola, Marta Soare, Giulio Jacucci, Samuel Kaski

Figure 1 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 2 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 3 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Figure 4 for Interactive Elicitation of Knowledge on Feature Relevance Improves Predictions in Small Data Sets

Abstract:Providing accurate predictions is challenging for machine learning algorithms when the number of features is larger than the number of samples in the data. Prior knowledge can improve machine learning models by indicating relevant variables and parameter values. Yet, this prior knowledge is often tacit and only available from domain experts. We present a novel approach that uses interactive visualization to elicit the tacit prior knowledge and uses it to improve the accuracy of prediction models. The main component of our approach is a user model that models the domain expert's knowledge of the relevance of different features for a prediction task. In particular, based on the expert's earlier input, the user model guides the selection of the features on which to elicit user's knowledge next. The results of a controlled user study show that the user model significantly improves prior knowledge elicitation and prediction accuracy, when predicting the relative citation counts of scientific documents in a specific domain.

* in Proceedings of the 22nd International Conference on Intelligent User Interfaces (IUI 2017)

Via

Access Paper or Ask Questions

Best-Arm Identification in Linear Bandits

Nov 04, 2014

Marta Soare, Alessandro Lazaric, Rémi Munos

Figure 1 for Best-Arm Identification in Linear Bandits

Figure 2 for Best-Arm Identification in Linear Bandits

Abstract:We study the best-arm identification problem in linear bandit, where the rewards of the arms depend linearly on an unknown parameter $\theta^*$ and the objective is to return the arm with the largest reward. We characterize the complexity of the problem and introduce sample allocation strategies that pull arms to identify the best arm with a fixed confidence, while minimizing the sample budget. In particular, we show the importance of exploiting the global linear structure to improve the estimate of the reward of near-optimal arms. We analyze the proposed strategies and compare their empirical performance. Finally, as a by-product of our analysis, we point out the connection to the $G$-optimality criterion used in optimal experimental design.

* In Advances in Neural Information Processing Systems 27 (NIPS), 2014

Via

Access Paper or Ask Questions