Picture for Alexander H. Liu

Alexander H. Liu

A Closer Look at Neural Codec Resynthesis: Bridging the Gap between Codec and Waveform Generation

Add code
Oct 29, 2024
Viaarxiv icon

Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration

Add code
Sep 25, 2024
Viaarxiv icon

ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

Add code
Sep 24, 2024
Viaarxiv icon

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models

Add code
Sep 21, 2024
Figure 1 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 2 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 3 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 4 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Feb 20, 2024
Figure 1 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 2 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 3 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 4 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Feb 20, 2024
Viaarxiv icon

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

Add code
Jan 16, 2024
Figure 1 for Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
Figure 2 for Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
Figure 3 for Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
Figure 4 for Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective
Viaarxiv icon

Generative Pre-training for Speech with Flow Matching

Add code
Oct 25, 2023
Viaarxiv icon

Joint Audio and Speech Understanding

Add code
Oct 02, 2023
Viaarxiv icon

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Add code
May 18, 2023
Viaarxiv icon