Picture for Ritwik Giri

Ritwik Giri

Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure

Add code
Feb 01, 2024
Viaarxiv icon

A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement

Add code
Feb 23, 2023
Viaarxiv icon

Semi-supervised Time Domain Target Speaker Extraction with Attention

Add code
Jun 18, 2022
Figure 1 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 2 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 3 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Figure 4 for Semi-supervised Time Domain Target Speaker Extraction with Attention
Viaarxiv icon

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

Add code
Jun 16, 2022
Figure 1 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 2 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 3 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 4 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Viaarxiv icon

Improved singing voice separation with chromagram-based pitch-aware remixing

Add code
Mar 28, 2022
Figure 1 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 2 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 3 for Improved singing voice separation with chromagram-based pitch-aware remixing
Figure 4 for Improved singing voice separation with chromagram-based pitch-aware remixing
Viaarxiv icon

Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement

Add code
Jun 08, 2021
Figure 1 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 2 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 3 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Figure 4 for Personalized PercepNet: Real-time, Low-complexity Target Voice Separation and Enhancement
Viaarxiv icon

Semi-Supervised Singing Voice Separation with Noisy Self-Training

Add code
Feb 16, 2021
Figure 1 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 2 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 3 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Figure 4 for Semi-Supervised Singing Voice Separation with Noisy Self-Training
Viaarxiv icon

Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders

Add code
Feb 12, 2021
Figure 1 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 2 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 3 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Figure 4 for Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders
Viaarxiv icon

PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss

Add code
Aug 11, 2020
Figure 1 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 2 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 3 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Figure 4 for PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Viaarxiv icon

From Speech-to-Speech Translation to Automatic Dubbing

Add code
Feb 02, 2020
Figure 1 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 2 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 3 for From Speech-to-Speech Translation to Automatic Dubbing
Figure 4 for From Speech-to-Speech Translation to Automatic Dubbing
Viaarxiv icon