Picture for Yi-Chiao Wu

Yi-Chiao Wu

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Add code
Feb 07, 2025
Figure 1 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 2 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 3 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Figure 4 for Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound
Viaarxiv icon

ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling

Add code
Feb 04, 2025
Figure 1 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 2 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 3 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Figure 4 for ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling
Viaarxiv icon

Movie Gen: A Cast of Media Foundation Models

Add code
Oct 17, 2024
Figure 1 for Movie Gen: A Cast of Media Foundation Models
Figure 2 for Movie Gen: A Cast of Media Foundation Models
Figure 3 for Movie Gen: A Cast of Media Foundation Models
Figure 4 for Movie Gen: A Cast of Media Foundation Models
Viaarxiv icon

Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models

Add code
Sep 21, 2024
Figure 1 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 2 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 3 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Figure 4 for Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models
Viaarxiv icon

EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 30, 2024
Figure 1 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 2 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 3 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 4 for EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Viaarxiv icon

EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 24, 2024
Figure 1 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 2 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 3 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 4 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Viaarxiv icon

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Figure 1 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 2 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 3 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 4 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Viaarxiv icon

Multi-speaker Text-to-speech Training with Speaker Anonymized Data

Add code
May 20, 2024
Viaarxiv icon

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Add code
Jan 22, 2024
Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Add code
Dec 25, 2023
Viaarxiv icon