Picture for Haizhou Li

Haizhou Li

Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context

Add code
Mar 19, 2025
Viaarxiv icon

Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation

Add code
Mar 16, 2025
Viaarxiv icon

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Add code
Mar 07, 2025
Viaarxiv icon

UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook

Add code
Feb 27, 2025
Viaarxiv icon

Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles

Add code
Feb 26, 2025
Viaarxiv icon

Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation

Add code
Feb 23, 2025
Viaarxiv icon

Soundwave: Less is More for Speech-Text Alignment in LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends

Add code
Feb 05, 2025
Viaarxiv icon

PAL: Prompting Analytic Learning with Missing Modality for Multi-Modal Class-Incremental Learning

Add code
Jan 16, 2025
Viaarxiv icon

ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification

Add code
Jan 14, 2025
Figure 1 for ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
Figure 2 for ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
Figure 3 for ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
Figure 4 for ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
Viaarxiv icon