Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Paola Malsot

Ecole Polytechnique Fédérale de Lausanne

Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information

Jul 18, 2024

Fedor Sergeev, Paola Malsot, Gunnar Rätsch, Vincent Fortuin

Figure 1 for Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information

Figure 2 for Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information

Figure 3 for Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information

Figure 4 for Towards Dynamic Feature Acquisition on Medical Time Series by Maximizing Conditional Mutual Information

Abstract:Knowing which features of a multivariate time series to measure and when is a key task in medicine, wearables, and robotics. Better acquisition policies can reduce costs while maintaining or even improving the performance of downstream predictors. Inspired by the maximization of conditional mutual information, we propose an approach to train acquirers end-to-end using only the downstream loss. We show that our method outperforms random acquisition policy, matches a model with an unrestrained budget, but does not yet overtake a static acquisition strategy. We highlight the assumptions and outline avenues for future work.

* Presented at the ICML 2024 Next Generation of Sequence Modeling Architectures (NGSM) Workshop

Via

Access Paper or Ask Questions

Optirank: classification for RNA-Seq data with optimal ranking reference genes

Jan 11, 2023

Paola Malsot, Filipe Martins, Didier Trono, Guillaume Obozinski

Figure 1 for Optirank: classification for RNA-Seq data with optimal ranking reference genes

Figure 2 for Optirank: classification for RNA-Seq data with optimal ranking reference genes

Figure 3 for Optirank: classification for RNA-Seq data with optimal ranking reference genes

Figure 4 for Optirank: classification for RNA-Seq data with optimal ranking reference genes

Abstract:Classification algorithms using RNA-Sequencing (RNA-Seq) data as input are used in a variety of biological applications. By nature, RNA-Seq data is subject to uncontrolled fluctuations both within and especially across datasets, which presents a major difficulty for a trained classifier to generalize to an external dataset. Replacing raw gene counts with the rank of gene counts inside an observation has proven effective to mitigate this problem. However, the rank of a feature is by definition relative to all other features, including highly variable features that introduce noise in the ranking. To address this problem and obtain more robust ranks, we propose a logistic regression model, optirank, which learns simultaneously the parameters of the model and the genes to use as a reference set in the ranking. We show the effectiveness of this method on simulated data. We also consider real classification tasks, which present different kinds of distribution shifts between train and test data. Those tasks concern a variety of applications, such as cancer of unknown primary classification, identification of specific gene signatures, and determination of cell type in single-cell RNA-Seq datasets. On those real tasks, optirank performs at least as well as the vanilla logistic regression on classical ranks, while producing sparser solutions. In addition, to increase the robustness against dataset shifts, we propose a multi-source learning scheme and demonstrate its effectiveness when used in combination with rank-based classifiers.

Via

Access Paper or Ask Questions