Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Nov 06, 2024

Chunyan Zeng, Yuhao Zhao, Zhifeng Wang

Figure 1 for Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Figure 2 for Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Figure 3 for Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Figure 4 for Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Share this with someone who'll enjoy it:

Abstract:This paper introduces a modeling approach that employs multi-level global processing, encompassing both short-term frame-level and long-term sample-level feature scales. In the initial stage of shallow feature extraction, various scales are employed to extract multi-level features, including Mel-Frequency Cepstral Coefficients (MFCC) and pre-Fbank log energy spectrum. The construction of the identification network model involves considering the input two-dimensional temporal features from both frame and sample levels. Specifically, the model initially employs one-dimensional convolution-based Convolutional Long Short-Term Memory (ConvLSTM) to fuse spatiotemporal information and extract short-term frame-level features. Subsequently, bidirectional long Short-Term Memory (BiLSTM) is utilized to learn long-term sample-level sequential representations. The transformer encoder then performs cross-scale, multi-level processing on global frame-level and sample-level features, facilitating deep feature representation and fusion at both levels. Finally, recognition results are obtained through Softmax. Our method achieves an impressive 99.6% recognition accuracy on the CCNU_Mobile dataset, exhibiting a notable improvement of 2% to 12% compared to the baseline system. Additionally, we thoroughly investigate the transferability of our model, achieving an 87.9% accuracy in a classification task on a new dataset.

* 16 pages

View paper on

Share this with someone who'll enjoy it:

Title:Mobile Recording Device Recognition Based Cross-Scale and Multi-Level Representation Learning

Paper and Code