Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhentao Lin

DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method

Nov 19, 2024

Zihao Chen, Zhentao Lin, Bi Zeng, Linyi Huang, Zhi Li, Jia Cai

Figure 1 for DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method

Figure 2 for DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method

Figure 3 for DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method

Figure 4 for DGSNA: prompt-based Dynamic Generative Scene-based Noise Addition method

Abstract:This paper addresses the challenges of accurately enumerating and describing scenes and the labor-intensive process required to replicate acoustic environments using non-generative methods. We introduce the prompt-based Dynamic Generative Sce-ne-based Noise Addition method (DGSNA), which innovatively combines the Dynamic Generation of Scene Information (DGSI) with Scene-based Noise Addition for Audio (SNAA). Employing generative chat models structured within the Back-ground-Examples-Task (BET) prompt framework, DGSI com-ponent facilitates the dynamic synthesis of tailored Scene Infor-mation (SI) for specific acoustic environments. Additionally, the SNAA component leverages Room Impulse Response (RIR) fil-ters and Text-To-Audio (TTA) systems to generate realistic, scene-based noise that can be adapted for both indoor and out-door environments. Through comprehensive experiments, the adaptability of DGSNA across different generative chat models was demonstrated. The results, assessed through both objective and subjective evaluations, show that DGSNA provides robust performance in dynamically generating precise SI and effectively enhancing scene-based noise addition capabilities, thus offering significant improvements over traditional methods in acoustic scene simulation. Our implementation and demos are available at https://dgsna.github.io.

Via

Access Paper or Ask Questions

CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Nov 12, 2024

Zhenkai Wu, Xiaowen Ma, Rongrong Lian, Zhentao Lin, Wei Zhang

Figure 1 for CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Figure 2 for CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Figure 3 for CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Figure 4 for CDXFormer: Boosting Remote Sensing Change Detection with Extended Long Short-Term Memory

Abstract:In complex scenes and varied conditions, effectively integrating spatial-temporal context is crucial for accurately identifying changes. However, current RS-CD methods lack a balanced consideration of performance and efficiency. CNNs lack global context, Transformers have quadratic computational complexity, and Mambas are restricted by CUDA acceleration. In this paper, we propose CDXFormer, with a core component that is a powerful XLSTM-based feature enhancement layer, integrating the advantages of linear computational complexity, global context perception, and strong interpret-ability. Specifically, we introduce a scale-specific Feature Enhancer layer, incorporating a Cross-Temporal Global Perceptron customized for semantic-accurate deep features, and a Cross-Temporal Spatial Refiner customized for detail-rich shallow features. Additionally, we propose a Cross-Scale Interactive Fusion module to progressively interact global change representations with spatial responses. Extensive experimental results demonstrate that CDXFormer achieves state-of-the-art performance across three benchmark datasets, offering a compelling balance between efficiency and accuracy. Code is available at https://github.com/xwmaxwma/rschange.

Via

Access Paper or Ask Questions