Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Panagiotis Papapetrou

PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations

Aug 26, 2025

Tim Kreuzer, Jelena Zdravkovic, Panagiotis Papapetrou

Abstract:Time series forecasting has seen considerable improvement during the last years, with transformer models and large language models driving advancements of the state of the art. Modern forecasting models are generally opaque and do not provide explanations for their forecasts, while well-known post-hoc explainability methods like LIME are not suitable for the forecasting context. We propose PAX-TS, a model-agnostic post-hoc algorithm to explain time series forecasting models and their forecasts. Our method is based on localized input perturbations and results in multi-granular explanations. Further, it is able to characterize cross-channel correlations for multivariate time series forecasts. We clearly outline the algorithmic procedure behind PAX-TS, demonstrate it on a benchmark with 7 algorithms and 10 diverse datasets, compare it with two other state-of-the-art explanation algorithms, and present the different explanation types of the method. We found that the explanations of high-performing and low-performing algorithms differ on the same datasets, highlighting that the explanations of PAX-TS effectively capture a model's behavior. Based on time step correlation matrices resulting from the benchmark, we identify 6 classes of patterns that repeatedly occur across different datasets and algorithms. We found that the patterns are indicators of performance, with noticeable differences in forecasting error between the classes. Lastly, we outline a multivariate example where PAX-TS demonstrates how the forecasting model takes cross-channel correlations into account. With PAX-TS, time series forecasting models' mechanisms can be illustrated in different levels of detail, and its explanations can be used to answer practical questions on forecasts.

Via

Access Paper or Ask Questions

Counterfactual Explanations for Time Series Forecasting

Oct 12, 2023

Zhendong Wang, Ioanna Miliou, Isak Samsten, Panagiotis Papapetrou

Figure 1 for Counterfactual Explanations for Time Series Forecasting

Figure 2 for Counterfactual Explanations for Time Series Forecasting

Figure 3 for Counterfactual Explanations for Time Series Forecasting

Figure 4 for Counterfactual Explanations for Time Series Forecasting

Abstract:Among recent developments in time series forecasting methods, deep forecasting models have gained popularity as they can utilize hidden feature patterns in time series to improve forecasting performance. Nevertheless, the majority of current deep forecasting models are opaque, hence making it challenging to interpret the results. While counterfactual explanations have been extensively employed as a post-hoc approach for explaining classification models, their application to forecasting models still remains underexplored. In this paper, we formulate the novel problem of counterfactual generation for time series forecasting, and propose an algorithm, called ForecastCF, that solves the problem by applying gradient-based perturbations to the original time series. ForecastCF guides the perturbations by applying constraints to the forecasted values to obtain desired prediction outcomes. We experimentally evaluate ForecastCF using four state-of-the-art deep model architectures and compare to two baselines. Our results show that ForecastCF outperforms the baseline in terms of counterfactual validity and data manifold closeness. Overall, our findings suggest that ForecastCF can generate meaningful and relevant counterfactual explanations for various forecasting tasks.

* 10 pages, 6 figures. Accepted by ICDM 2023

Via

Access Paper or Ask Questions

An End-to-End Workflow using Topic Segmentation and Text Summarisation Methods for Improved Podcast Comprehension

Jul 25, 2023

Andrew Aquilina, Sean Diacono, Panagiotis Papapetrou, Maria Movin

Abstract:The consumption of podcast media has been increasing rapidly. Due to the lengthy nature of podcast episodes, users often carefully select which ones to listen to. Although episode descriptions aid users by providing a summary of the entire podcast, they do not provide a topic-by-topic breakdown. This study explores the combined application of topic segmentation and text summarisation methods to investigate how podcast episode comprehension can be improved. We have sampled 10 episodes from Spotify's English-Language Podcast Dataset and employed TextTiling and TextSplit to segment them. Moreover, three text summarisation models, namely T5, BART, and Pegasus, were applied to provide a very short title for each segment. The segmentation part was evaluated using our annotated sample with the $P_k$ and WindowDiff ($WD$) metrics. A survey was also rolled out ($N=25$) to assess the quality of the generated summaries. The TextSplit algorithm achieved the lowest mean for both evaluation metrics ($\bar{P_k}=0.41$ and $\bar{WD}=0.41$), while the T5 model produced the best summaries, achieving a relevancy score only $8\%$ less to the one achieved by the human-written titles.

Via

Access Paper or Ask Questions

FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

May 30, 2022

Lena Mondrejevski, Ioanna Miliou, Annaclaudia Montanino, David Pitts, Jaakko Hollmén, Panagiotis Papapetrou

Figure 1 for FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

Figure 2 for FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

Figure 3 for FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

Figure 4 for FLICU: A Federated Learning Workflow for Intensive Care Unit Mortality Prediction

Abstract:Although Machine Learning (ML) can be seen as a promising tool to improve clinical decision-making for supporting the improvement of medication plans, clinical procedures, diagnoses, or medication prescriptions, it remains limited by access to healthcare data. Healthcare data is sensitive, requiring strict privacy practices, and typically stored in data silos, making traditional machine learning challenging. Federated learning can counteract those limitations by training machine learning models over data silos while keeping the sensitive data localized. This study proposes a federated learning workflow for ICU mortality prediction. Hereby, the applicability of federated learning as an alternative to centralized machine learning and local machine learning is investigated by introducing federated learning to the binary classification problem of predicting ICU mortality. We extract multivariate time series data from the MIMIC-III database (lab values and vital signs), and benchmark the predictive performance of four deep sequential classifiers (FRNN, LSTM, GRU, and 1DCNN) varying the patient history window lengths (8h, 16h, 24h, 48h) and the number of FL clients (2, 4, 8). The experiments demonstrate that both centralized machine learning and federated learning are comparable in terms of AUPRC and F1-score. Furthermore, the federated approach shows superior performance over local machine learning. Thus, the federated approach can be seen as a valid and privacy-preserving alternative to centralized machine learning for classifying ICU mortality when sharing sensitive patient data between hospitals is not possible.

Via

Access Paper or Ask Questions

Robust Explanations for Private Support Vector Machines

Feb 07, 2021

Rami Mochaourab, Sugandh Sinha, Stanley Greenstein, Panagiotis Papapetrou

Figure 1 for Robust Explanations for Private Support Vector Machines

Figure 2 for Robust Explanations for Private Support Vector Machines

Figure 3 for Robust Explanations for Private Support Vector Machines

Figure 4 for Robust Explanations for Private Support Vector Machines

Abstract:We consider counterfactual explanations for private support vector machines (SVM), where the privacy mechanism that publicly releases the classifier guarantees differential privacy. While privacy preservation is essential when dealing with sensitive data, there is a consequent degradation in the classification accuracy due to the introduced perturbations in the classifier weights. For such classifiers, counterfactual explanations need to be robust against the uncertainties in the SVM weights in order to ensure, with high confidence, that the classification of the data instance to be explained is different than its explanation. We model the uncertainties in the SVM weights through a random vector, and formulate the explanation problem as an optimization problem with probabilistic constraint. Subsequently, we characterize the problem's deterministic equivalent and study its solution. For linear SVMs, the problem is a convex second-order cone program. For non-linear SVMs, the problem is non-convex. Thus, we propose a sub-optimal solution that is based on the bisection method. The results show that, contrary to non-robust explanations, the quality of explanations from the robust solution degrades with increasing privacy in order to guarantee a prespecified confidence level for correct classifications.

* 13 pages, 9 figures, 1 table. Under consideration at Pattern Recognition Letters

Via

Access Paper or Ask Questions

Clinical Predictive Keyboard using Statistical and Neural Language Modeling

Jun 22, 2020

John Pavlopoulos, Panagiotis Papapetrou

Figure 1 for Clinical Predictive Keyboard using Statistical and Neural Language Modeling

Figure 2 for Clinical Predictive Keyboard using Statistical and Neural Language Modeling

Abstract:A language model can be used to predict the next word during authoring, to correct spelling or to accelerate writing (e.g., in sms or emails). Language models, however, have only been applied in a very small scale to assist physicians during authoring (e.g., discharge summaries or radiology reports). But along with the assistance to the physician, computer-based systems which expedite the patient's exit also assist in decreasing the hospital infections. We employed statistical and neural language modeling to predict the next word of a clinical text and assess all the models in terms of accuracy and keystroke discount in two datasets with radiology reports. We show that a neural language model can achieve as high as 51.3% accuracy in radiology reports (one out of two words predicted correctly). We also show that even when the models are employed only for frequent words, the physician can save valuable time.

* To appear in CBMS'20

Via

Access Paper or Ask Questions

RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

Jun 11, 2020

Vasiliki Kougia, John Pavlopoulos, Panagiotis Papapetrou, Max Gordon

Figure 1 for RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

Figure 2 for RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

Figure 3 for RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

Figure 4 for RTEX: A novel methodology for Ranking, Tagging, and Explanatory diagnostic captioning of radiography exams

Abstract:This paper introduces RTEx, a novel methodology for a) ranking radiography exams based on their probability to contain an abnormality, b) generating abnormality tags for abnormal exams, and c) providing a diagnostic explanation in natural language for each abnormal exam. The task of ranking radiography exams is an important first step for practitioners who want to identify and prioritize those radiography exams that are more likely to contain abnormalities, for example, to avoid mistakes due to tiredness or to manage heavy workload (e.g., during a pandemic). We used two publicly available datasets to assess our methodology and demonstrate that for the task of ranking it outperforms its competitors in terms of NDCG@k. For each abnormal radiography exam RTEx generates a set of abnormality tags alongside an explanatory diagnostic text to explain the tags and guide the medical expert. Our tagging component outperforms two strong competitor methods in terms of F1. Moreover, the diagnostic captioning component of RTEx, which exploits the already extracted tags to constrain the captioning process, outperforms all competitors with respect to clinical precision and recall.

Via

Access Paper or Ask Questions

Aggregate-Eliminate-Predict: Detecting Adverse Drug Events from Heterogeneous Electronic Health Records

Jul 13, 2019

Maria Bampa, Panagiotis Papapetrou

Figure 1 for Aggregate-Eliminate-Predict: Detecting Adverse Drug Events from Heterogeneous Electronic Health Records

Figure 2 for Aggregate-Eliminate-Predict: Detecting Adverse Drug Events from Heterogeneous Electronic Health Records

Figure 3 for Aggregate-Eliminate-Predict: Detecting Adverse Drug Events from Heterogeneous Electronic Health Records

Figure 4 for Aggregate-Eliminate-Predict: Detecting Adverse Drug Events from Heterogeneous Electronic Health Records

Abstract:We study the problem of detecting adverse drug events in electronic healthcare records. The challenge in this work is to aggregate heterogeneous data types involving diagnosis codes, drug codes, as well as lab measurements. An earlier framework proposed for the same problem demonstrated promising predictive performance for the random forest classifier by using only lab measurements as data features. We extend this framework, by additionally including diagnosis and drug prescription codes, concurrently. In addition, we employ a recursive feature selection mechanism on top, that extracts the top-k most important features. Our experimental evaluation on five medical datasets of adverse drug events and six different classifiers, suggests that the integration of these additional features provides substantial and statistically significant improvements in terms of AUC, while employing medically relevant features.

* 4 pages

Via

Access Paper or Ask Questions

Cellular Traffic Prediction and Classification: a comparative evaluation of LSTM and ARIMA

Jun 03, 2019

Amin Azari, Panagiotis Papapetrou, Stojan Denic, Gunnar Peters

Figure 1 for Cellular Traffic Prediction and Classification: a comparative evaluation of LSTM and ARIMA

Figure 2 for Cellular Traffic Prediction and Classification: a comparative evaluation of LSTM and ARIMA

Figure 3 for Cellular Traffic Prediction and Classification: a comparative evaluation of LSTM and ARIMA

Figure 4 for Cellular Traffic Prediction and Classification: a comparative evaluation of LSTM and ARIMA

Abstract:Prediction of user traffic in cellular networks has attracted profound attention for improving resource utilization. In this paper, we study the problem of network traffic traffic prediction and classification by employing standard machine learning and statistical learning time series prediction methods, including long short-term memory (LSTM) and autoregressive integrated moving average (ARIMA), respectively. We present an extensive experimental evaluation of the designed tools over a real network traffic dataset. Within this analysis, we explore the impact of different parameters to the effectiveness of the predictions. We further extend our analysis to the problem of network traffic classification and prediction of traffic bursts. The results, on the one hand, demonstrate superior performance of LSTM over ARIMA in general, especially when the length of the training time series is high enough, and it is augmented by a wisely-selected set of features. On the other hand, the results shed light on the circumstances in which, ARIMA performs close to the optimal with lower complexity.

* arXiv admin note: text overlap with arXiv:1906.00951

Via

Access Paper or Ask Questions

User Traffic Prediction for Proactive Resource Management: Learning-Powered Approaches

May 09, 2019

Amin Azari, Panagiotis Papapetrou, Stojan Denic, Gunnar Peters

Figure 1 for User Traffic Prediction for Proactive Resource Management: Learning-Powered Approaches

Figure 2 for User Traffic Prediction for Proactive Resource Management: Learning-Powered Approaches

Figure 3 for User Traffic Prediction for Proactive Resource Management: Learning-Powered Approaches

Figure 4 for User Traffic Prediction for Proactive Resource Management: Learning-Powered Approaches

Abstract:Traffic prediction plays a vital role in efficient planning and usage of network resources in wireless networks. While traffic prediction in wired networks is an established field, there is a lack of research on the analysis of traffic in cellular networks, especially in a content-blind manner at the user level. Here, we shed light into this problem by designing traffic prediction tools that employ either statistical, rule-based, or deep machine learning methods. First, we present an extensive experimental evaluation of the designed tools over a real traffic dataset. Within this analysis, the impact of different parameters, such as length of prediction, feature set used in analyses, and granularity of data, on accuracy of prediction are investigated. Second, regarding the coupling observed between behavior of traffic and its generating application, we extend our analysis to the blind classification of applications generating the traffic based on the statistics of traffic arrival/departure. The results demonstrate presence of a threshold number of previous observations, beyond which, deep machine learning can outperform linear statistical learning, and before which, statistical learning outperforms deep learning approaches. Further analysis of this threshold value represents a strong coupling between this threshold, the length of future prediction, and the feature set in use. Finally, through a case study, we present how the experienced delay could be decreased by traffic arrival prediction.

* arXiv admin note: text overlap with arXiv:1906.00939

Via

Access Paper or Ask Questions