Picture for Xiang Hao

Xiang Hao

College of Optical Science and Engineering, Zhejiang University, No.38 of Zheda Road, Hangzhou, Zhejiang Province, China

NowYouSee Me: Context-Aware Automatic Audio Description

Add code
Dec 13, 2024
Viaarxiv icon

GEXIA: Granularity Expansion and Iterative Approximation for Scalable Multi-grained Video-language Learning

Add code
Dec 10, 2024
Viaarxiv icon

Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet

Add code
Oct 07, 2024
Figure 1 for Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Figure 2 for Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Figure 3 for Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Figure 4 for Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet
Viaarxiv icon

AI-Generated Content Enhanced Computer-Aided Diagnosis Model for Thyroid Nodules: A ChatGPT-Style Assistant

Add code
Feb 04, 2024
Viaarxiv icon

Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction

Add code
Oct 15, 2023
Viaarxiv icon

Pink-Eggs Dataset V1: A Step Toward Invasive Species Management Using Deep Learning Embedded Solutions

Add code
May 16, 2023
Viaarxiv icon

Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

Add code
Mar 14, 2023
Viaarxiv icon

Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement

Add code
Dec 18, 2022
Viaarxiv icon

Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes

Add code
Jun 16, 2022
Figure 1 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 2 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 3 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Figure 4 for Scalable Temporal Localization of Sensitive Activities in Movies and TV Episodes
Viaarxiv icon

Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers

Add code
Mar 30, 2022
Figure 1 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 2 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 3 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 4 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Viaarxiv icon