Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dimitris Papatheodoulou

Data augmentation on-the-fly and active learning in data stream classification

Oct 13, 2022

Kleanthis Malialis, Dimitris Papatheodoulou, Stylianos Filippou, Christos G. Panayiotou, Marios M. Polycarpou

Figure 1 for Data augmentation on-the-fly and active learning in data stream classification

Figure 2 for Data augmentation on-the-fly and active learning in data stream classification

Figure 3 for Data augmentation on-the-fly and active learning in data stream classification

Figure 4 for Data augmentation on-the-fly and active learning in data stream classification

Abstract:There is an emerging need for predictive models to be trained on-the-fly, since in numerous machine learning applications data are arriving in an online fashion. A critical challenge encountered is that of limited availability of ground truth information (e.g., labels in classification tasks) as new data are observed one-by-one online, while another significant challenge is that of class imbalance. This work introduces the novel Augmented Queues method, which addresses the dual-problem by combining in a synergistic manner online active learning, data augmentation, and a multi-queue memory to maintain separate and balanced queues for each class. We perform an extensive experimental study using image and time-series augmentations, in which we examine the roles of the active learning budget, memory size, imbalance level, and neural network type. We demonstrate two major advantages of Augmented Queues. First, it does not reserve additional memory space as the generation of synthetic data occurs only at training times. Second, learning models have access to more labelled data without the need to increase the active learning budget and / or the original memory size. Learning on-the-fly poses major challenges which, typically, hinder the deployment of learning models. Augmented Queues significantly improves the performance in terms of learning quality and speed. Our code is made publicly available.

* IEEE Symposium Series on Computational Intelligence (SSCI), 2022
* Keywords: incremental learning, active learning, data streams, class imbalance, neural networks

Via

Access Paper or Ask Questions

A Multi-label Time Series Classification Approach for Non-intrusive Water End-Use Monitoring

Sep 30, 2022

Dimitris Papatheodoulou, Pavlos Pavlou, Stelios G. Vrachimis, Kleanthis Malialis, Demetrios G. Eliades, Theocharis Theocharides

Abstract:Numerous real-world problems from a diverse set of application areas exist that exhibit temporal dependencies. We focus on a specific type of time series classification which we refer to as aggregated time series classification. We consider an aggregated sequence of a multi-variate time series, and propose a methodology to make predictions based solely on the aggregated information. As a case study, we apply our methodology to the challenging problem of household water end-use dissagregation when using non-intrusive water monitoring. Our methodology does not require a-priori identification of events, and to our knowledge, it is considered for the first time. We conduct an extensive experimental study using a residential water-use simulator, involving different machine learning classifiers, multi-label classification methods, and successfully demonstrate the effectiveness of our methodology.

* 18th International Conference on Artificial Intelligence Applications and Innovations (AIAI), 2022

Via

Access Paper or Ask Questions