Picture for Haizhou Li

Haizhou Li

Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing

Add code
Apr 08, 2025
Viaarxiv icon

Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation

Add code
Apr 03, 2025
Viaarxiv icon

$C^2$AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction

Add code
Apr 01, 2025
Viaarxiv icon

Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context

Add code
Mar 19, 2025
Viaarxiv icon

Context-Aware Two-Step Training Scheme for Domain Invariant Speech Separation

Add code
Mar 16, 2025
Viaarxiv icon

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Add code
Mar 07, 2025
Viaarxiv icon

UniCodec: Unified Audio Codec with Single Domain-Adaptive Codebook

Add code
Feb 27, 2025
Viaarxiv icon

Know You First and Be You Better: Modeling Human-Like User Simulators via Implicit Profiles

Add code
Feb 26, 2025
Viaarxiv icon

Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic Evaluation

Add code
Feb 23, 2025
Viaarxiv icon

Soundwave: Less is More for Speech-Text Alignment in LLMs

Add code
Feb 18, 2025
Viaarxiv icon