Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Shabnam Nazmi

Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

Jul 06, 2022

Xuyang Yan, Shabnam Nazmi, Biniam Gebru, Mohd Anwar, Abdollah Homaifar, Mrinmoy Sarkar, Kishor Datta Gupta

Figure 1 for Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

Figure 2 for Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

Figure 3 for Mitigating shortage of labeled data using clustering-based active learning with diversity exploration

Abstract:In this paper, we proposed a new clustering-based active learning framework, namely Active Learning using a Clustering-based Sampling (ALCS), to address the shortage of labeled data. ALCS employs a density-based clustering approach to explore the cluster structure from the data without requiring exhaustive parameter tuning. A bi-cluster boundary-based sample query procedure is introduced to improve the learning performance for classifying highly overlapped classes. Additionally, we developed an effective diversity exploration strategy to address the redundancy among queried samples. Our experimental results justified the efficacy of the ALCS approach.

* Accepted by the ICML 2022 Workshop on Adaptive Experimental Design and Active Learning in the Real World

Via

Access Paper or Ask Questions

A Software Tool for Evaluating Unmanned Autonomous Systems

Nov 21, 2021

Abdollah Homaifar, Ali Karimoddini, Mike Heiges, Mubbashar A. Khan, Berat A. Erol, Shabnam Nazmi

Figure 1 for A Software Tool for Evaluating Unmanned Autonomous Systems

Figure 2 for A Software Tool for Evaluating Unmanned Autonomous Systems

Figure 3 for A Software Tool for Evaluating Unmanned Autonomous Systems

Figure 4 for A Software Tool for Evaluating Unmanned Autonomous Systems

Abstract:The North Carolina Agriculture and Technical State University (NC A&T) in collaboration with Georgia Tech Research Institute (GTRI) has developed methodologies for creating simulation-based technology tools that are capable of inferring the perceptions and behavioral states of autonomous systems. These methodologies have the potential to provide the Test and Evaluation (T&E) community at the Department of Defense (DoD) with a greater insight into the internal processes of these systems. The methodologies use only external observations and do not require complete knowledge of the internal processing of and/or any modifications to the system under test. This paper presents an example of one such simulation-based technology tool, named as the Data-Driven Intelligent Prediction Tool (DIPT). DIPT was developed for testing a multi-platform Unmanned Aerial Vehicle (UAV) system capable of conducting collaborative search missions. DIPT's Graphical User Interface (GUI) enables the testers to view the aircraft's current operating state, predicts its current target-detection status, and provides reasoning for exhibiting a particular behavior along with an explanation of assigning a particular task to it.

* The ITEA Journal of Test and Evaluation 41 (3), pp. 188-195, 2020

Via

Access Paper or Ask Questions

A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

Nov 10, 2021

Xuyang Yan, Mrinmoy Sarkar, Biniam Gebru, Shabnam Nazmi, Abdollah Homaifar

Figure 1 for A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

Figure 2 for A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

Figure 3 for A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

Figure 4 for A Supervised Feature Selection Method For Mixed-Type Data using Density-based Feature Clustering

Abstract:Feature selection methods are widely used to address the high computational overheads and curse of dimensionality in classifying high-dimensional data. Most conventional feature selection methods focus on handling homogeneous features, while real-world datasets usually have a mixture of continuous and discrete features. Some recent mixed-type feature selection studies only select features with high relevance to class labels and ignore the redundancy among features. The determination of an appropriate feature subset is also a challenge. In this paper, a supervised feature selection method using density-based feature clustering (SFSDFC) is proposed to obtain an appropriate final feature subset for mixed-type data. SFSDFC decomposes the feature space into a set of disjoint feature clusters using a novel density-based clustering method. Then, an effective feature selection strategy is employed to obtain a subset of important features with minimal redundancy from those feature clusters. Extensive experiments as well as comparison studies with five state-of-the-art methods are conducted on SFSDFC using thirteen real-world benchmark datasets and results justify the efficacy of the SFSDFC method.

* 6 pages, 3 figures, 4 tables, accepted by the IEEE SMC 2021

Via

Access Paper or Ask Questions

Evolving Multi-label Classification Rules by Exploiting High-order Label Correlation

Jul 22, 2020

Shabnam Nazmi, Xuyang Yan, Abdollah Homaifar, Emily Doucette

Figure 1 for Evolving Multi-label Classification Rules by Exploiting High-order Label Correlation

Figure 2 for Evolving Multi-label Classification Rules by Exploiting High-order Label Correlation

Figure 3 for Evolving Multi-label Classification Rules by Exploiting High-order Label Correlation

Figure 4 for Evolving Multi-label Classification Rules by Exploiting High-order Label Correlation

Abstract:In multi-label classification tasks, each problem instance is associated with multiple classes simultaneously. In such settings, the correlation between labels contains valuable information that can be used to obtain more accurate classification models. The correlation between labels can be exploited at different levels such as capturing the pair-wise correlation or exploiting the higher-order correlations. Even though the high-order approach is more capable of modeling the correlation, it is computationally more demanding and has scalability issues. This paper aims at exploiting the high-order label correlation within subsets of labels using a supervised learning classifier system (UCS). For this purpose, the label powerset (LP) strategy is employed and a prediction aggregation within the set of the relevant labels to an unseen instance is utilized to increase the prediction capability of the LP method in the presence of unseen labelsets. Exact match ratio and Hamming loss measures are considered to evaluate the rule performance and the expected fitness value of a classifier is investigated for both metrics. Also, a computational complexity analysis is provided for the proposed algorithm. The experimental results of the proposed method are compared with other well-known LP-based methods on multiple benchmark datasets and confirm the competitive performance of this method.

* 13 pages, 1 figure, 14 tables, accepted for publication in the Neurocomputing journal

Via

Access Paper or Ask Questions