Picture for Dongyuan Shi

Dongyuan Shi

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models

Add code
Nov 28, 2024
Viaarxiv icon

Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning

Add code
Sep 09, 2024
Viaarxiv icon

Latent Diffusion Model-Enabled Real-Time Semantic Communication Considering Semantic Ambiguities and Channel Noises

Add code
Jun 09, 2024
Viaarxiv icon

Computation-efficient Virtual Sensing Approach with Multichannel Adjoint Least Mean Square Algorithm

Add code
May 23, 2024
Viaarxiv icon

A Survey of Integrating Wireless Technology into Active Noise Control

Add code
May 21, 2024
Figure 1 for A Survey of Integrating Wireless Technology into Active Noise Control
Figure 2 for A Survey of Integrating Wireless Technology into Active Noise Control
Figure 3 for A Survey of Integrating Wireless Technology into Active Noise Control
Figure 4 for A Survey of Integrating Wireless Technology into Active Noise Control
Viaarxiv icon

Unsupervised learning based end-to-end delayless generative fixed-filter active noise control

Add code
Feb 08, 2024
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Feb 05, 2024
Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

Add code
Jan 11, 2024
Figure 1 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 2 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 3 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 4 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Viaarxiv icon

A Comprehensive End-to-End Computer Vision Framework for Restoration and Recognition of Low-Quality Engineering Drawings

Add code
Dec 21, 2023
Viaarxiv icon

Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection

Add code
Dec 07, 2023
Viaarxiv icon