Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amirmohammad Mohammadi

Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification

Jan 03, 2025

Jarin Ritu, Amirmohammad Mohammadi, Davelle Carreiro, Alexandra Van Dine, Joshua Peeples

Figure 1 for Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification

Figure 2 for Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification

Figure 3 for Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification

Figure 4 for Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) for Passive Sonar Classification

Abstract:Knowledge distillation has been successfully applied to various audio tasks, but its potential in underwater passive sonar target classification remains relatively unexplored. Existing methods often focus on high-level contextual information while overlooking essential low-level audio texture features needed to capture local patterns in sonar data. To address this gap, the Structural and Statistical Audio Texture Knowledge Distillation (SSATKD) framework is proposed for passive sonar target classification. SSATKD combines high-level contextual information with low-level audio textures by utilizing an Edge Detection Module for structural texture extraction and a Statistical Knowledge Extractor Module to capture signal variability and distribution. Experimental results confirm that SSATKD improves classification accuracy while optimizing memory and computational resources, making it well-suited for resource-constrained environments.

* 13 pages, 6 figures, submitted for review

Via

Access Paper or Ask Questions

Transfer Learning for Passive Sonar Classification using Pre-trained Audio and ImageNet Models

Sep 20, 2024

Amirmohammad Mohammadi, Tejashri Kelhe, Davelle Carreiro, Alexandra Van Dine, Joshua Peeples

Abstract:Transfer learning is commonly employed to leverage large, pre-trained models and perform fine-tuning for downstream tasks. The most prevalent pre-trained models are initially trained using ImageNet. However, their ability to generalize can vary across different data modalities. This study compares pre-trained Audio Neural Networks (PANNs) and ImageNet pre-trained models within the context of underwater acoustic target recognition (UATR). It was observed that the ImageNet pre-trained models slightly out-perform pre-trained audio models in passive sonar classification. We also analyzed the impact of audio sampling rates for model pre-training and fine-tuning. This study contributes to transfer learning applications of UATR, illustrating the potential of pre-trained models to address limitations caused by scarce, labeled data in the UATR domain.

* 5 pages, 6 figures, This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks

Sep 20, 2024

Amirmohammad Mohammadi, Iren'e Masabarakiza, Ethan Barnes, Davelle Carreiro, Alexandra Van Dine, Joshua Peeples

Figure 1 for Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks

Figure 2 for Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks

Figure 3 for Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks

Figure 4 for Investigation of Time-Frequency Feature Combinations with Histogram Layer Time Delay Neural Networks

Abstract:While deep learning has reduced the prevalence of manual feature extraction, transformation of data via feature engineering remains essential for improving model performance, particularly for underwater acoustic signals. The methods by which audio signals are converted into time-frequency representations and the subsequent handling of these spectrograms can significantly impact performance. This work demonstrates the performance impact of using different combinations of time-frequency features in a histogram layer time delay neural network. An optimal set of features is identified with results indicating that specific feature combinations outperform single data features.

* 5 pages, 14 figures. This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions