Picture for Jiaqi Su

Jiaqi Su

Deep Audio Watermarks are Shallow: Limitations of Post-Hoc Watermarking Techniques for Speech

Add code
Apr 15, 2025
Viaarxiv icon

DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers

Add code
Apr 13, 2025
Viaarxiv icon

Code Drift: Towards Idempotent Neural Audio Codecs

Add code
Oct 14, 2024
Viaarxiv icon

Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation

Add code
Aug 28, 2024
Figure 1 for Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
Figure 2 for Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
Figure 3 for Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
Figure 4 for Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation
Viaarxiv icon

TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling

Add code
Mar 18, 2024
Viaarxiv icon

HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Add code
Jun 10, 2020
Figure 1 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 2 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 3 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Figure 4 for HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Viaarxiv icon