Picture for Meng Yu

Meng Yu

Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments

Add code
Oct 09, 2024
Viaarxiv icon

Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules

Add code
Oct 02, 2024
Viaarxiv icon

SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis

Add code
Sep 11, 2024
Figure 1 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 2 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 3 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Figure 4 for SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis
Viaarxiv icon

Neural Ambisonic Encoding For Multi-Speaker Scenarios Using A Circular Microphone Array

Add code
Sep 11, 2024
Viaarxiv icon

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment

Add code
Jun 17, 2024
Viaarxiv icon

SMRU: Split-and-Merge Recurrent-based UNet for Acoustic Echo Cancellation and Noise Suppression

Add code
Jun 17, 2024
Viaarxiv icon

Joint Conditional Diffusion Model for Image Restoration with Mixed Degradations

Add code
Apr 11, 2024
Viaarxiv icon

VIFNet: An End-to-end Visible-Infrared Fusion Network for Image Dehazing

Add code
Apr 11, 2024
Viaarxiv icon

Deep Audio Zooming: Beamwidth-Controllable Neural Beamformer

Add code
Nov 22, 2023
Viaarxiv icon

Advancing Acoustic Howling Suppression through Recursive Training of Neural Networks

Add code
Sep 27, 2023
Viaarxiv icon