Picture for Han Yin

Han Yin

Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event Detection

Add code
Nov 02, 2024
Viaarxiv icon

Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference

Add code
Oct 10, 2024
Figure 1 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 2 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 3 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Figure 4 for Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference
Viaarxiv icon

Mixstyle based Domain Generalization for Sound Event Detection with Heterogeneous Training Data

Add code
Jul 04, 2024
Viaarxiv icon

FMSG-JLESS Submission for DCASE 2024 Task4 on Sound Event Detection with Heterogeneous Training Dataset and Potentially Missing Labels

Add code
Jun 29, 2024
Viaarxiv icon

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

Add code
Feb 05, 2024
Figure 1 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 2 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 3 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Figure 4 for Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift
Viaarxiv icon

Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

Add code
Jan 11, 2024
Figure 1 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 2 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 3 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Figure 4 for Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music
Viaarxiv icon

Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection

Add code
Dec 07, 2023
Viaarxiv icon

AudioLog: LLMs-Powered Long Audio Logging with Acoustic Scenes and Events Joint Estimation

Add code
Nov 21, 2023
Viaarxiv icon

Two-stage Autoencoder Neural Network for 3D Speech Enhancement

Add code
Jun 08, 2023
Viaarxiv icon